Gene Haur_3122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3122 
Symbol 
ID5734994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3939764 
End bp3941020 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content54% 
IMG OID641280265 
ProductL-sorbosone dehydrogenase 
Protein accessionYP_001545887 
Protein GI159899640 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAAAG CTCTGCTTTT GATCAGTTTG CTTGGTTTGT TGGCCGCTTG TGGCGAGCAA 
CAAGTCGATG CGCCGATTGC ACCAACCACC GTGCCGCTCA ACCAAGCCGC TGGCAGTTCC
ACCACACCTG TTGCCACCGC AACAACTAGT ATTGTACCAA GCCCTGAGCC AACAACAAGT
GCAGGCAAGC CAGCACGCAA CGCGCCAACC GCCGTAGTCG AGCCAACCGA TGCGGTTACT
TTGCCCGATG GCTTTGGCAT TAGTGTGTTT CAAAGTAAGC TGGCTGGCCC GCGCATGTTG
GCGATTGGCC CTGATGGCGC GATTTATACC GCTGAGCGTG GGGAGGATCG GATTGTCCGC
TTGCCTGATC GCAACGCCGA TGGTTTGGCT GATGGCGTTG AGGTGATTGC TGATGGCTTC
GATTCACCCT CAAGCATGAT TTTCGACCAA GCTGGAAATT TATATGTCGC CGAAACCACC
AAAGTGATCA AATTAACCCA GCCTGATGCT GAAGGCAAAT ATACTCAACG CCAAACGATC
ATCGATGGCT TGCCTGCTGG CGGCCATAGC ACCCGCACCT TGCTATTCAG CCCTGATGAA
AGCAAATTGT ATGTGGCGGT TGGTTCATCG TGCAATGTTT GCAACGAAGA AGATGAGCGA
CGGGCAACCG TGATGGAATA TGATCCCGAT GGCAGCAATG GCCGAATTTA TGCCAAGGGC
TTACGCAACG CGGTGGGCAT TACTTGGCGG CCTGGCACGA ATGAATTGTG GGCTACCAAC
AATGGTCGCG ACATGTTGGG CGACGACCAA CCACCAGAAA CTGTCAACGT GGCAACCAGC
GCTGGCCTGG ATTTTGGCTG GCCTCGCTGT CACTCAGGGC GGATTGCCGA CCCTGAATTT
GGCAAAGATG CCAATGCCTG CCAAGGTGTT ACGCCGCCTG CGGTCGAGAT GCAAGCCCAC
AGCGCTCCGC TCGGTTTGGC ATTTGGCAAC GGCAGCAACT TCCCCGAACC CTATCAAAGC
GGCTTGTTTG TGGCTTTCCA CGGCTCATGG AATCGCTCAA GCCCAACGGG TTATAAAGTG
GTGTTTATTC CCGTAACTGA TGGCAAAGCT GGCAATGCCC AAGATTTTGC CACTGGCTGG
CTGACCGATG CTGGAGCGGT TTGGGGCCGA CCAGTTGATG TAATTGTGGG CCGTGACGGT
AGTTTATATA TTTCCGATGA CGCTGGCGGC GCGATTTACC GCGTCTTTGC CAAATAA
 
Protein sequence
MRKALLLISL LGLLAACGEQ QVDAPIAPTT VPLNQAAGSS TTPVATATTS IVPSPEPTTS 
AGKPARNAPT AVVEPTDAVT LPDGFGISVF QSKLAGPRML AIGPDGAIYT AERGEDRIVR
LPDRNADGLA DGVEVIADGF DSPSSMIFDQ AGNLYVAETT KVIKLTQPDA EGKYTQRQTI
IDGLPAGGHS TRTLLFSPDE SKLYVAVGSS CNVCNEEDER RATVMEYDPD GSNGRIYAKG
LRNAVGITWR PGTNELWATN NGRDMLGDDQ PPETVNVATS AGLDFGWPRC HSGRIADPEF
GKDANACQGV TPPAVEMQAH SAPLGLAFGN GSNFPEPYQS GLFVAFHGSW NRSSPTGYKV
VFIPVTDGKA GNAQDFATGW LTDAGAVWGR PVDVIVGRDG SLYISDDAGG AIYRVFAK