Gene Teth514_1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_1087 
Symbol 
ID5876545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp1123002 
End bp1124165 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content35% 
IMG OID641541441 
Productgalactokinase 
Protein accessionYP_001662721 
Protein GI167039736 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACGG CTGTAATTGA AGCACTTGAA AAATTCTACG GTAAAAATGA TGCTGAAATA 
AGGCTTTTCT ATTCTCCGGG ACGAGTGAAT CTTATTGGAG AGCATACAGA TTACAATGGA
GGCTATGTAT TTCCTTGTGC CCTTGACTTT GGAACATATG CTGCGATTAG AAAAAGAAAT
GACAAAAAAG TCTTCATGGC TTCTTTAAAT TTCGATTTAA AGGTGGAAGT AGACCTTGAT
GCACTCAATT TTGATAAAAG CCATGATTGG GCTAATTATC CTAAAGGGGT TTTAAAAGTG
TTACAGGATG AGGGGTATGA CTTTTCTGGA TTTGAAATTG TGTTTGAAGG CAACATTCCA
AATGGCGCTG GACTTTCCTC ATCTGCTTCA ATAGAGCTGG TTACTGCTGT TGCAGTAAAT
GAAGTTTTCA ATTTAAATAT TGACAGAATA AAATTGGTGA AATTGTGTCA AAAAGCAGAA
AATACTTTTG TTGGGGTAAA TTGTGGCATA ATGGACCAAT TTGCTGTTGG AATGGGTAAA
AAAGACCATG CTATTTTATT AAAAAGCGAT ACATTAGAGT ATTCATACGT GCCTTTGAAG
TTAGAAGGTT ATAAAATTTT GATAACAAAT ACAAATAAAA GGAGAGGGCT CTTGGATTCG
AAATATAATG AAAGAAGAAG TGAATGTGAA AAGGCCCTTT CATATCTTCA AAAAGCTTTG
CCTGTAAAAA ATCTATCTGA AATTACAATT GAACAATTTG AAGAATACAA AGATTTGATA
CCTGACGAAG TGCTTAGAAA AAGGGCAAAA CATGTTATAA CTGAAAATAA AAGAGTTTTA
GATGCAGTAA AAGCACTTAA TGATAAAGAC TTAATCAAAT TTGGAGAATT AATGGTTGAA
TCTCACAATT CTTTGAGAGA TGATTACGAA GTTACAGGGA AAGAACTGGA CACTTTGGTA
GAAGAAGCGT TAAAATTAAA GGGAGTAATA GGTTCCCGTA TGACTGGAGC AGGCTTTGGT
GGCTGCACTG TAAGCATTGT AAAAGAAGAT GCAGTAGAGG AATTTATAAA AGTGGTGACT
CACAATTACA CTCAAAAAAT AGGCTACAGG CCAACAGTCT ATATAACGGG AATAGGTGAA
GGAGCAGGAG AAATTAAATA CTGA
 
Protein sequence
MKTAVIEALE KFYGKNDAEI RLFYSPGRVN LIGEHTDYNG GYVFPCALDF GTYAAIRKRN 
DKKVFMASLN FDLKVEVDLD ALNFDKSHDW ANYPKGVLKV LQDEGYDFSG FEIVFEGNIP
NGAGLSSSAS IELVTAVAVN EVFNLNIDRI KLVKLCQKAE NTFVGVNCGI MDQFAVGMGK
KDHAILLKSD TLEYSYVPLK LEGYKILITN TNKRRGLLDS KYNERRSECE KALSYLQKAL
PVKNLSEITI EQFEEYKDLI PDEVLRKRAK HVITENKRVL DAVKALNDKD LIKFGELMVE
SHNSLRDDYE VTGKELDTLV EEALKLKGVI GSRMTGAGFG GCTVSIVKED AVEEFIKVVT
HNYTQKIGYR PTVYITGIGE GAGEIKY