Gene Haur_3737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3737 
Symbol 
ID5735601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4695939 
End bp4697339 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content52% 
IMG OID641280889 
Productglucose/sorbosone dehydrogenase-like 
Protein accessionYP_001546501 
Protein GI159900254 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCTGA ATATTGCAAC CTTGGTGCTG CTATCACTTT GGCTGGGGGC GATGATTGGC 
GGATTATTGC TTATGCGCCA AGCAGGCCGT TGGCGCTGGA TTGGCCTCGC AATTGCCGTG
CAAGGCTTGA TCTTGCTGGT ATTTCAAGGC TTGCTCAGCA CTTATCCAAT TGGGATTGGC
GATCCATTAA TTCGGGTGAA ATTTTGGGAT TTGCTGAGCC AAGCGGCGGC CGTGCTTGGT
GCATTAGTGC TTGTCGCATT TGTGTTATGG TGGCTAGCCA AACAAGTGCG CTACAGCATC
GATCAGCAAT TGCACCGACG AGGCTTAGCC TTATTGTTGC TCTTTGGCAT GCCATTAATC
GCCACGCCCG CCTTTTACGC CATGTGGCAA ACCAGCATTC CCGAGCGCGA GCGTGAACGC
AACCCTGATT TGCGGGTGAT CAACGTGCCC GAAGGCTTTG AATGGAGCGT CTACGCCCGT
GGCACGATGG ATAATCCTAC GGCGATTGCC TTTGGAGCAG AAAACGAACT ATACATCGCC
GATATTGCTG GCGATTTGTG GATTGGCCGC GACCAAAATA ATGATCAGCA GATTGATAGT
TTGAGCAAAT GGGCTGGCGA TTTTGATTTG TTGGTCGGCT TGGTTTGGCG CGATGGCGAG
TTGTATTGTG CTAGTTCAGG CAAAATCGAG GCCTTGCGCG ATAGCGATGG CGATGGCGTA
GCCGATAGTC GCCGCATCGT GGTTGATAAT TTGCCCTCAA TGATTTTGCA GCCACACTCC
AACAACGGCT TGGCCTTTGG CCCCGATGGC CGCTTGTATT TTGGGGTTGG CAGCACCACC
GACGGTAAAT TTGAGGAAAA TGAATTAGCC GCCAGCGTTT TATCGGTTAA TCCCGATGGC
ACTGATTTGC GCCCCTATGC GCGGGGCTTG GGCAATGTGT TTGATGTCGC TTTCAATGCC
GATGGGGCGC TGTTTGGGGG CGATAATGGC CCTAGCTCAG TTGAGGGCAA TGATCCGCCA
GATGAATTTA ATTATTTGGT TGAAGGCGAA CACTACGGCT ACCCCTACTT TTTTGGCGAC
CCGCCCAGCG ACGGTGGCAC ACGCGGCGCA TTGATCAGCT TTCCCGCTCA CTCGGTGCCA
ACGGGCGTTA CGTTTTATAG TGGCAATCAA TATCCGCAAA TCTATAGCGA TAGCGCCTTT
TTGACGCTGT GGCAGACGGG CGAGGTCGTA CACATTGAAG TTGGGCAAAC CAGCAACGGC
GATTATCTGG CCAAATCAAC CACCTTTGCT GATGGTATGC TCTACCCGAT TGATGTGATT
ACTGGCCCCG ATGGCAACTT GTATATCGCC GATTTCGGTA CGAGTGCAAT CTACCGAATC
ACCTATAATG GAGTGCGTTA A
 
Protein sequence
MLLNIATLVL LSLWLGAMIG GLLLMRQAGR WRWIGLAIAV QGLILLVFQG LLSTYPIGIG 
DPLIRVKFWD LLSQAAAVLG ALVLVAFVLW WLAKQVRYSI DQQLHRRGLA LLLLFGMPLI
ATPAFYAMWQ TSIPERERER NPDLRVINVP EGFEWSVYAR GTMDNPTAIA FGAENELYIA
DIAGDLWIGR DQNNDQQIDS LSKWAGDFDL LVGLVWRDGE LYCASSGKIE ALRDSDGDGV
ADSRRIVVDN LPSMILQPHS NNGLAFGPDG RLYFGVGSTT DGKFEENELA ASVLSVNPDG
TDLRPYARGL GNVFDVAFNA DGALFGGDNG PSSVEGNDPP DEFNYLVEGE HYGYPYFFGD
PPSDGGTRGA LISFPAHSVP TGVTFYSGNQ YPQIYSDSAF LTLWQTGEVV HIEVGQTSNG
DYLAKSTTFA DGMLYPIDVI TGPDGNLYIA DFGTSAIYRI TYNGVR