Gene Haur_3717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3717 
Symbol 
ID5735581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4675809 
End bp4676939 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content52% 
IMG OID641280869 
Productgalactokinase 
Protein accessionYP_001546481 
Protein GI159900234 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0664685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGG TTAAAGCTGC ATTTTTTGTT CGTTCTCCAG GTCGGGTAAA TTTAATTGGC 
GAGCATACCG ATTACAACGC TGGGTTTGTT ATGCCACTGG CGCTTGAACG TGGTACGACG
TTTCAGGTTC AACCCCGTGA TGATCAGCAG TTGATTGTGC ATGCCCTACG TTTCAACGCA
CATGATCAAG CCGATTTGGC TAATTTGGCG GCTGGCACGC ACGGCGATTG GCGTGATTAT
GTGCGCGGAA CAGCCCAATC CTTGCTTGAT GCTGGCTACG CGTTGCAAGG CGCTGAGATT
AATATTGATG GTGATTTGCC GCTGAGTGGC GGATTAAGCT CGTCAGCATC ATTAGAAGTT
GGCTTGGCTT TTAGCTTACT CTACGCCCAA GGCATCACGA TTGCTCCCGC TGAATTAGCT
AAAATTGCCC AACGCGCCGA AATTGAATAT GCCCATGTTA ATTGTGGAAT TATGGATCAG
CTTGCGATTG CCGCAGGCGT TGCCGGCCAT GCCACATTAA TCGATTGCCG CTCGTTGGAA
ATTGAGGCCG TGCCGATTCC GGCTGAAGTG GCAGTTTTGG TGATTGATAG TGGCGTGCCA
CGCACCTTGG CTGGCTCGGC TTATAATCAA CGCCGCGCCG AATGTGAACA AGCTGTAGCA
ATCTTGCGTC AACTCGACCC AAACATCAAC GATTTGCGCG ATGTTAACAG CGATTTGCTG
GCCCAAGCCG TTGAACAAGA TCGCTTTGAA GAAGTGATTT ATCGACGTGC CCGCCATGTT
GTCAGCGAAA ATGAGCGGGT GCATAAAGCC GCCGCCGCGT TTCGGGCAGG CGATTTTGGC
TACGTTGGCG AGTTGATGAA CGAATCGCAT TGGAGCCTGC GCGATGATTA TGAAGTTAGC
GGCCCTGAGC TTGATCAACT AACTGAGTTG TTGCGCGATA TGCCTGGGGT TTGGGGTGCT
CGCCTAACTG GCGCTGGCTT TGGTGGCTGC TGCGTGGCCT TGGTCGAAGC CAGCCACGTT
GATGCGGTGA TTGTGGCCTT AAGTCCAGCC TATCATGCCG CAACTGGCCG CACCTGCGAA
GCCTTTAGCA CCAAAGCCTC AGCATTAACC ATTGAAGAAC CTAGAGCATA G
 
Protein sequence
MTEVKAAFFV RSPGRVNLIG EHTDYNAGFV MPLALERGTT FQVQPRDDQQ LIVHALRFNA 
HDQADLANLA AGTHGDWRDY VRGTAQSLLD AGYALQGAEI NIDGDLPLSG GLSSSASLEV
GLAFSLLYAQ GITIAPAELA KIAQRAEIEY AHVNCGIMDQ LAIAAGVAGH ATLIDCRSLE
IEAVPIPAEV AVLVIDSGVP RTLAGSAYNQ RRAECEQAVA ILRQLDPNIN DLRDVNSDLL
AQAVEQDRFE EVIYRRARHV VSENERVHKA AAAFRAGDFG YVGELMNESH WSLRDDYEVS
GPELDQLTEL LRDMPGVWGA RLTGAGFGGC CVALVEASHV DAVIVALSPA YHAATGRTCE
AFSTKASALT IEEPRA