Gene Haur_0836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0836 
Symbol 
ID5732737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp946148 
End bp947308 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content54% 
IMG OID641277968 
Producthypothetical protein 
Protein accessionYP_001543612 
Protein GI159897365 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00104213 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCCGCT TCAGCGATGT CACCAACATT TGGTCAACCA TAAAAGAAAT TGATGTGCGC 
GATATTCGCG ATCAGGCCGA TTTGCCCTGT CGGATTGCGC TACTTGGCCA TGCGACCTTC
GGGCGCGATT TGATTATGCG CTTGTTGACG CTTGGCGCTC AACGTTTTCC CGCCCGTACT
CCGCAAGTCA GCATCATCGA TTTACCCTTG GGCCGCGAAC AGGCTACCGA TCTCAACCGC
GTCGATTTGA TTGTACTCAC GCTCGATGCT AGCCAAGCCT TGAGTTACGA TGAGTTTCTG
GCCTACGAAA AATTGGCGAT TCTGCCAGTG CCACTGTTGA TCGCCGTTTG GGGTACGAAT
CTCCCTAAAA GCTCCGAAAG CACCCACCAA GCTGATCTGC AAGCATCGCC AGCGGTGTTG
CTCGACCCAC AAGCCGAGCC GGCAACCCAG CGCAAAATGC TGGCCAAAGC GGTGCTCGAA
CTCGTACCAG AAGCGTTGCA TATTGCGGCT GCTCGGCGCT ACCCAGGCTT GCGCAGCGAA
GTTACCAATA ATTTGATTAG CAGCGTTTCG CTGAGCAATG CGACTTTTGC TTTTACCTCG
GGCATTCCCG AGATGATTCC CGTGCTGAAT TTGCCATTGA ACGCCGCCGA TATGCTGGTA
TTGACCAAGA ATCAGGCGCT GTTGGCCTAT CGCGTGGCCT TGGCTATGGG TGCTGAAGGC
GATTTTAGTG CCATGATTCG TGAATTGCTG CCCGTGGTTG GCGGTGGTTT CCTCTGGCGA
CAACTGGCAC GCCAATTGGT CGGTCTGATT CCAGGCATTG GCTTATTGCC CAAAGTTGCG
GTGGCCTATG CAGGCACGTT TGTGACTGGG ATTGCGGCAT GGCGCTGGTA TGAACGTGGC
GAGTTGGTCA GCAAAGCCGA ATTACAAAGC CTCGTCAAAG CAGCCTTAGA AGAAGGCCGA
CAACGAGCCA AGGCGCTGAT TGGTAATCGT AAGGCTGACG ATGATCCCAC TGCATCAGCC
AAACCCAGTT TTCGCCAACG GATCGGCGCA GTGCTGAACC CAAAAAACTG GTTCAAGGCG
CTGCGTGCCC GCCTGCGACG CAAACCCAAA TCGATCCAAA AAACAACTGA GCCAACCGAT
CAATCCAACT CTGCTGCTTA A
 
Protein sequence
MSRFSDVTNI WSTIKEIDVR DIRDQADLPC RIALLGHATF GRDLIMRLLT LGAQRFPART 
PQVSIIDLPL GREQATDLNR VDLIVLTLDA SQALSYDEFL AYEKLAILPV PLLIAVWGTN
LPKSSESTHQ ADLQASPAVL LDPQAEPATQ RKMLAKAVLE LVPEALHIAA ARRYPGLRSE
VTNNLISSVS LSNATFAFTS GIPEMIPVLN LPLNAADMLV LTKNQALLAY RVALAMGAEG
DFSAMIRELL PVVGGGFLWR QLARQLVGLI PGIGLLPKVA VAYAGTFVTG IAAWRWYERG
ELVSKAELQS LVKAALEEGR QRAKALIGNR KADDDPTASA KPSFRQRIGA VLNPKNWFKA
LRARLRRKPK SIQKTTEPTD QSNSAA