Gene Teth514_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_1547 
Symbol 
ID5877545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp1570734 
End bp1571855 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content33% 
IMG OID641541895 
Productthiamin pyrophosphokinase, catalytic region 
Protein accessionYP_001663170 
Protein GI167040185 
COG category[S] Function unknown 
COG ID[COG4825] Uncharacterized membrane-anchored protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATAA CAGGGACAAT TAAAATGGAT AAAAGAACTA AAAATTTAGT AAAAAGACTA 
AATCCTGGAG AAATAGCTAT AATTGATCAT AGAGATATAG ATGAAGTGGC TGCTGAATCC
TTAGTTGAAA AAAAAGTTTT GGCAGTCATA AATGCTGATA AGTCTATAAG TGGCAGATAT
CCTAACATGG GACCTTCTAT CTTGTGTGAG GCGGGAATTC CAATTATTGA CGATGTAGGA
AAAGACATTT TTGTTTTATT GAAAGAAAAT GACAAAATAA CTATAATAGA TAATGAAATA
TATAAAGACG GTGTGTTTAT AAAAAGCGGG AAACTTCTTA CAAAAGACGT TATAAACTAT
AAATTGCAAG AAAGCCGCGA AAACATAGGT GTTGAATTAG ATAAATTTAT TGAAAATACA
TTGGAATATG CAAAAAAAGA AAAATATTTT ATTCTAGGGG GAGTTGAATT ACCCAATATT
AAAACTCGTT TTAAAAATAG ACATGCTCTT GTAGTCGTTA GAGGAAAGGA TTACAAAGAA
GATTTATTTA CAATAAAACA GTATATCAAC GATGTAAAAC CGATTTTAAT AGGAGTAGAT
GGAGGGGCAG ATGCTCTTTT AGAGTTTGGT CTTATTCCTG ACATAGTCAT TGGAGATATG
GACAGTGTAA GTGATGAGGC TTTAAAAAAG GCTCGGGAAA TCGTAGTTCA TGCCTATCCC
GACGGAAGAG CACCTGGCTT GGAAAGAGTA ACAGCATTAG GTCTTAAAGC AGAAATTTTT
AAGGCCCCAG GTACAAGTGA AGATATAGCT ATGCTTTTGG CTTTTGAAAA AGGTGCTGAC
TTAATTGTTG CTGTAGGGAC CCATTCCAGC ATGATTGATT TTCTGGAAAA AGGAAGAAAA
GGAATGTCCA GCACTTTTTT AGTGCGATTA AAAGTAGGAG AAAAATTAAT CGATGCTAAA
GGAGTAAATA AACTGTACAG GGAAACTTTC AAAATTTCTT ATATATTTAG CATTATACTA
GCAGCATTGA TTCCTCTTGG AGTAATAGCG TATTTTTCTC CACCTATGCA GCAGTTATTG
AAGCTATTAC AACTTAGAAT TCGCCTTTTA ATCGGATTTT GA
 
Protein sequence
MQITGTIKMD KRTKNLVKRL NPGEIAIIDH RDIDEVAAES LVEKKVLAVI NADKSISGRY 
PNMGPSILCE AGIPIIDDVG KDIFVLLKEN DKITIIDNEI YKDGVFIKSG KLLTKDVINY
KLQESRENIG VELDKFIENT LEYAKKEKYF ILGGVELPNI KTRFKNRHAL VVVRGKDYKE
DLFTIKQYIN DVKPILIGVD GGADALLEFG LIPDIVIGDM DSVSDEALKK AREIVVHAYP
DGRAPGLERV TALGLKAEIF KAPGTSEDIA MLLAFEKGAD LIVAVGTHSS MIDFLEKGRK
GMSSTFLVRL KVGEKLIDAK GVNKLYRETF KISYIFSIIL AALIPLGVIA YFSPPMQQLL
KLLQLRIRLL IGF