Gene Teth514_1094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_1094 
Symbol 
ID5876472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp1131026 
End bp1133227 
Gene Length2202 bp 
Protein Length733 aa 
Translation table11 
GC content34% 
IMG OID641541448 
Productglycoside hydrolase, clan GH-D 
Protein accessionYP_001662728 
Protein GI167039743 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTATTC ATTTTAATGA TAAAACAAAA ACTTTTTACC TAACTGCCAA AGATACAAGT 
TATGTAATTT ATGTTTTAAA AAATGGCGCT GTTTTACATG CATATTTTGG CAAAATAATA
AAGACTCCTA ATATATACCA TTTATTAAAA TTACCCCATA TAAGTATAGA TAATGATATT
ATTAACTTTG GGAATCAGTT AATGCTAGAT TTTTTGCCAC AGGAGTATCC TGCGTATGGT
AATACTGATT TTAGGAGTCC TGCATATCAA ATTCAGCTTG AAAACGGTTC TACTGTTTCT
GATCTAAGGT ACTTATCTCA CAAAATCTAT AAAGGTAAGC CTAAATTAGA AGGGCTTCCG
GCTACTTATG TTGAAAATGA AGATGAAGCT GATACTCTTG AGTTAGAATT GTATGATAAA
GTAGCTAATT TAAAGGTAAC TTTAATTTAT ACAGCATTTA GGGATTATGA TGTTATAACA
AGGTCAGTGA GATTTGAAAA CATGGGAAAA GAAGACATAA AATTGTTAAG AGCTTTGTCT
ATGAATGTTG ATTTTAATGA TAGTAATTTT GACATGCTTC AATTGTCAGG AGCTTGGGCG
AGAGAAAGAC ATGTTATAAG AAGACCGCTT GTACCAGGAG CCCAGTCTAT TGAAAGCAGA
AGAGGTGCAA GCAGCCATCA GCAAAATCCT TTTATAGCAC TTTTAAGGAA CGACGCTGAT
GAATGGCATG GAGATGTGTA TGGATTTAGC CTTGTTTACA GTGGCAATTT CTTGGCACAA
GTAGAAGTAG ATCAATATAA TATGGCAAGA GTTTCTATGG GAATCAATCC ATTTGATTTT
TCATGGCTTC TAAAACCAGG TGAAACATTT CAAACACCAG AAGTCGTTAT GGTTTATTCG
GATGGTGGCT TAAATAAAAT GTCAAATACC TATCACAAAC TGTACAGAAA TAGACTTATG
AGAAGTAAAT TTAAGGACAG AGAAACACCA ATTCTTATAA ACAACTGGGA AGCTACTTAT
TTTGATTTTA CAGAAGAAAA ACTTAAAGAA CTTGCTAAAG AAGCTAAAGA TTTGGGGATT
GAACTGTTTG TTCTTGACGA TGGATGGTTT GGAAAGAGAA ATTCTGACAA TTCTTCACTG
GGAGATTGGT TTGTAAATAA AGAAAAGATT CCAAGTGGTT TGGATGGCCT TGCAAAAGGG
ATAAATTCTT TAGGTTTAAA ATTTGGATTA TGGATGGAAC CAGAAATGGT GTCTCCTGAT
AGTGACCTCT ATAGAGAGCA TCCCGATTGG TGTATACATG TACCTAATAG GTCGAGAAGT
GAAAGCAGAA ATCAACTTGT GTTAGACTTG TCGCGCAAAG ATGTACAAGA TTATATTATA
AAAGTAGTGT CAGATATTTT GGAAAGCGCC AACATAAGTT ATGTAAAATG GGATATGAAT
AGAAATATGA CAGAGATAGG CTCTGCACTT TTGCCTCCTG AAAGACAAAG AGAAACTGCT
CACAGATATA TACTTGGACT TTACAGAATA TTAGAAGAAA TAACGACAAG GTTTCCTGAT
GTTTTGTTTG AAAGTTGTGC TGGTGGTGGT GGTCGTTTCG ATCCAGGAAT GCTTTATTAC
ATGCCACAGA CTTGGACCAG TGATGATACA GATGCAATTG AAAGACTTAA AATACAGTAT
GGTACAAGTA TAGTTTATCC TCTTATTTCA ATGGGAAGCC ACATATCCGC TGTACCAAAT
CATCAAGTTC ACAGAATTAC GCCGTTAAAA ATACGTGCAC ATGTAGCAAT GTCAGCTAAT
TTTGGCTTTG AACTTGATTT GACAAAATTG AGCAGTGAAG AAAAAGATGA AATAAAGAAA
TATGTTGAAA AGTATAAAGA GATTAGGAAA CTGGTGCAAT TTGGAGATTT TTATCGATTG
TTAAGTCCCT TTGAAGGGAA TGAAACTGCT TGGTTGATTG TTTCTGAGGA TAAAAGAGAA
TTTTTGCTCT ATTATTTTAG AGTTTTGGGA GGAGCAAATG AACCTATTAA AAGACTTCGT
CTAAAAGGGA TAAATCCAGA TTTTAATTAT GTTTTAGAAG ATGATGGTAG TGAATATAGT
GGTGATGAAC TTATGTATGC AGGGAAAGTA ATTCCAGAAC TTAAAGGGGA CTTTCAGAGC
ATTATGATGC ATTTTAAGGA GGAGAGTATT AAAGATGGGT AG
 
Protein sequence
MSIHFNDKTK TFYLTAKDTS YVIYVLKNGA VLHAYFGKII KTPNIYHLLK LPHISIDNDI 
INFGNQLMLD FLPQEYPAYG NTDFRSPAYQ IQLENGSTVS DLRYLSHKIY KGKPKLEGLP
ATYVENEDEA DTLELELYDK VANLKVTLIY TAFRDYDVIT RSVRFENMGK EDIKLLRALS
MNVDFNDSNF DMLQLSGAWA RERHVIRRPL VPGAQSIESR RGASSHQQNP FIALLRNDAD
EWHGDVYGFS LVYSGNFLAQ VEVDQYNMAR VSMGINPFDF SWLLKPGETF QTPEVVMVYS
DGGLNKMSNT YHKLYRNRLM RSKFKDRETP ILINNWEATY FDFTEEKLKE LAKEAKDLGI
ELFVLDDGWF GKRNSDNSSL GDWFVNKEKI PSGLDGLAKG INSLGLKFGL WMEPEMVSPD
SDLYREHPDW CIHVPNRSRS ESRNQLVLDL SRKDVQDYII KVVSDILESA NISYVKWDMN
RNMTEIGSAL LPPERQRETA HRYILGLYRI LEEITTRFPD VLFESCAGGG GRFDPGMLYY
MPQTWTSDDT DAIERLKIQY GTSIVYPLIS MGSHISAVPN HQVHRITPLK IRAHVAMSAN
FGFELDLTKL SSEEKDEIKK YVEKYKEIRK LVQFGDFYRL LSPFEGNETA WLIVSEDKRE
FLLYYFRVLG GANEPIKRLR LKGINPDFNY VLEDDGSEYS GDELMYAGKV IPELKGDFQS
IMMHFKEESI KDG