Gene Teth514_0266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_0266 
Symbol 
ID5876518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp281032 
End bp282342 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content36% 
IMG OID641540607 
Productglycoside hydrolase family protein 
Protein accessionYP_001661919 
Protein GI167038934 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAAAA AAGATTTGAA AATTGCTGTA ATTGGAGGGG GTTCTAGTTA TACCCCCGAA 
CTTATTGAGG GCTTTATCAA GAGGTATAAT GAACTACCAG TTAAAGACTT ATATTTAGTA
GATATAGAAG AAGGCCAAGA AAAACTTGAG ATTGTTGGCG GTCTTGCAAA AAGAATGGTA
GAAAAAGCTG GTGTAGGTAT AAACATTCAT TTGACTTTGG ACAGGCGAAA AGCCATAAAA
GATGCAGATT TTGTTGTTAC CCAGTTTAGA GTAGGGCTGA TAGATGCAAG AATTAGAGAT
GAAAAAATTC CTCTAAAGTA TGATGTTATA GGGCAAGAAA CAACTGGACC CGGTGGTTTT
GCAAAAGCAC AGAGAACAAT ACCTGTTATT TTAGACATAT GTAAAGACGT AGAGGAACTT
GCTCCAAATG CATGGCTTAT TAATTTTACA AATCCTTCAG GAGTGATAAC AGAAACGATT
TTAAAGCATA CAAATGTAAA AGCGATCGGA TTATGTAATG TACCTATAGG TATGGTATAC
GGTGTTGCAG AAGTACTTGG TGTTGATCCA AAAAGAGTGT ATATAGATTT TACAGGGCTT
AATCATTTAG TATGGGGTAC TCATATTTAC TTAGATGGCG AAGATATAAC CGAAAAACTA
ATAGACAGTT TCGCAGGTGG TAAATCTTTA TCAATGAAAA ATATACCTGA GTTGCCATGG
GAACCTGAAT TTATAAAATC TCTTGGTATG TATCCTTGTC CATACCACAG ATACTATTAT
TTAACAGATA AAATGCTTGA AGGACAGAAA AAAGAAGCTG CTACAGTAGG CACAAGAGGA
GAAGTCGTTA AAAAGGTAGA GCAAGAATTA TTTGAATTAT ATAAAGACCC AAATTTGAAT
ATAAAACCGC CGCAATTAGA AAAAAGGGGA GGAGCTCATT ATTCTGATGC TGCTTGCTCC
CTGATAAGTT CAATATATAA TGACAAAAAA GACATACATG TGGTCAATGT GAGAAACAAT
GGTACAATCG CAGATTTGCC AGATGATGTG GTGATAGAAA CAAATGCAAT AATAGATAGA
AATGGGGCTC ATCCGATAAA TATTGGACAT GTGCCAGCGA AAATAAGGGG TTTAATGCAA
GCAGTAAAAG CCTATGAAGA ACTTACTATA GAAGCAGGGG TAAAGGGGAA CTATTATACA
GCTTTACAGG CGTTGACAAT TCATCCATTA GTACCTTCTG CGACTGTTGC TAAAAAAATT
CTTGATGATA TACTTGAGCA AAATAAAGAG TATTTGCCAC AGTATAAATA G
 
Protein sequence
MSKKDLKIAV IGGGSSYTPE LIEGFIKRYN ELPVKDLYLV DIEEGQEKLE IVGGLAKRMV 
EKAGVGINIH LTLDRRKAIK DADFVVTQFR VGLIDARIRD EKIPLKYDVI GQETTGPGGF
AKAQRTIPVI LDICKDVEEL APNAWLINFT NPSGVITETI LKHTNVKAIG LCNVPIGMVY
GVAEVLGVDP KRVYIDFTGL NHLVWGTHIY LDGEDITEKL IDSFAGGKSL SMKNIPELPW
EPEFIKSLGM YPCPYHRYYY LTDKMLEGQK KEAATVGTRG EVVKKVEQEL FELYKDPNLN
IKPPQLEKRG GAHYSDAACS LISSIYNDKK DIHVVNVRNN GTIADLPDDV VIETNAIIDR
NGAHPINIGH VPAKIRGLMQ AVKAYEELTI EAGVKGNYYT ALQALTIHPL VPSATVAKKI
LDDILEQNKE YLPQYK