Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_1094 |
Symbol | |
ID | 5876472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 1131026 |
End bp | 1133227 |
Gene Length | 2202 bp |
Protein Length | 733 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641541448 |
Product | glycoside hydrolase, clan GH-D |
Protein accession | YP_001662728 |
Protein GI | 167039743 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTATTC ATTTTAATGA TAAAACAAAA ACTTTTTACC TAACTGCCAA AGATACAAGT TATGTAATTT ATGTTTTAAA AAATGGCGCT GTTTTACATG CATATTTTGG CAAAATAATA AAGACTCCTA ATATATACCA TTTATTAAAA TTACCCCATA TAAGTATAGA TAATGATATT ATTAACTTTG GGAATCAGTT AATGCTAGAT TTTTTGCCAC AGGAGTATCC TGCGTATGGT AATACTGATT TTAGGAGTCC TGCATATCAA ATTCAGCTTG AAAACGGTTC TACTGTTTCT GATCTAAGGT ACTTATCTCA CAAAATCTAT AAAGGTAAGC CTAAATTAGA AGGGCTTCCG GCTACTTATG TTGAAAATGA AGATGAAGCT GATACTCTTG AGTTAGAATT GTATGATAAA GTAGCTAATT TAAAGGTAAC TTTAATTTAT ACAGCATTTA GGGATTATGA TGTTATAACA AGGTCAGTGA GATTTGAAAA CATGGGAAAA GAAGACATAA AATTGTTAAG AGCTTTGTCT ATGAATGTTG ATTTTAATGA TAGTAATTTT GACATGCTTC AATTGTCAGG AGCTTGGGCG AGAGAAAGAC ATGTTATAAG AAGACCGCTT GTACCAGGAG CCCAGTCTAT TGAAAGCAGA AGAGGTGCAA GCAGCCATCA GCAAAATCCT TTTATAGCAC TTTTAAGGAA CGACGCTGAT GAATGGCATG GAGATGTGTA TGGATTTAGC CTTGTTTACA GTGGCAATTT CTTGGCACAA GTAGAAGTAG ATCAATATAA TATGGCAAGA GTTTCTATGG GAATCAATCC ATTTGATTTT TCATGGCTTC TAAAACCAGG TGAAACATTT CAAACACCAG AAGTCGTTAT GGTTTATTCG GATGGTGGCT TAAATAAAAT GTCAAATACC TATCACAAAC TGTACAGAAA TAGACTTATG AGAAGTAAAT TTAAGGACAG AGAAACACCA ATTCTTATAA ACAACTGGGA AGCTACTTAT TTTGATTTTA CAGAAGAAAA ACTTAAAGAA CTTGCTAAAG AAGCTAAAGA TTTGGGGATT GAACTGTTTG TTCTTGACGA TGGATGGTTT GGAAAGAGAA ATTCTGACAA TTCTTCACTG GGAGATTGGT TTGTAAATAA AGAAAAGATT CCAAGTGGTT TGGATGGCCT TGCAAAAGGG ATAAATTCTT TAGGTTTAAA ATTTGGATTA TGGATGGAAC CAGAAATGGT GTCTCCTGAT AGTGACCTCT ATAGAGAGCA TCCCGATTGG TGTATACATG TACCTAATAG GTCGAGAAGT GAAAGCAGAA ATCAACTTGT GTTAGACTTG TCGCGCAAAG ATGTACAAGA TTATATTATA AAAGTAGTGT CAGATATTTT GGAAAGCGCC AACATAAGTT ATGTAAAATG GGATATGAAT AGAAATATGA CAGAGATAGG CTCTGCACTT TTGCCTCCTG AAAGACAAAG AGAAACTGCT CACAGATATA TACTTGGACT TTACAGAATA TTAGAAGAAA TAACGACAAG GTTTCCTGAT GTTTTGTTTG AAAGTTGTGC TGGTGGTGGT GGTCGTTTCG ATCCAGGAAT GCTTTATTAC ATGCCACAGA CTTGGACCAG TGATGATACA GATGCAATTG AAAGACTTAA AATACAGTAT GGTACAAGTA TAGTTTATCC TCTTATTTCA ATGGGAAGCC ACATATCCGC TGTACCAAAT CATCAAGTTC ACAGAATTAC GCCGTTAAAA ATACGTGCAC ATGTAGCAAT GTCAGCTAAT TTTGGCTTTG AACTTGATTT GACAAAATTG AGCAGTGAAG AAAAAGATGA AATAAAGAAA TATGTTGAAA AGTATAAAGA GATTAGGAAA CTGGTGCAAT TTGGAGATTT TTATCGATTG TTAAGTCCCT TTGAAGGGAA TGAAACTGCT TGGTTGATTG TTTCTGAGGA TAAAAGAGAA TTTTTGCTCT ATTATTTTAG AGTTTTGGGA GGAGCAAATG AACCTATTAA AAGACTTCGT CTAAAAGGGA TAAATCCAGA TTTTAATTAT GTTTTAGAAG ATGATGGTAG TGAATATAGT GGTGATGAAC TTATGTATGC AGGGAAAGTA ATTCCAGAAC TTAAAGGGGA CTTTCAGAGC ATTATGATGC ATTTTAAGGA GGAGAGTATT AAAGATGGGT AG
|
Protein sequence | MSIHFNDKTK TFYLTAKDTS YVIYVLKNGA VLHAYFGKII KTPNIYHLLK LPHISIDNDI INFGNQLMLD FLPQEYPAYG NTDFRSPAYQ IQLENGSTVS DLRYLSHKIY KGKPKLEGLP ATYVENEDEA DTLELELYDK VANLKVTLIY TAFRDYDVIT RSVRFENMGK EDIKLLRALS MNVDFNDSNF DMLQLSGAWA RERHVIRRPL VPGAQSIESR RGASSHQQNP FIALLRNDAD EWHGDVYGFS LVYSGNFLAQ VEVDQYNMAR VSMGINPFDF SWLLKPGETF QTPEVVMVYS DGGLNKMSNT YHKLYRNRLM RSKFKDRETP ILINNWEATY FDFTEEKLKE LAKEAKDLGI ELFVLDDGWF GKRNSDNSSL GDWFVNKEKI PSGLDGLAKG INSLGLKFGL WMEPEMVSPD SDLYREHPDW CIHVPNRSRS ESRNQLVLDL SRKDVQDYII KVVSDILESA NISYVKWDMN RNMTEIGSAL LPPERQRETA HRYILGLYRI LEEITTRFPD VLFESCAGGG GRFDPGMLYY MPQTWTSDDT DAIERLKIQY GTSIVYPLIS MGSHISAVPN HQVHRITPLK IRAHVAMSAN FGFELDLTKL SSEEKDEIKK YVEKYKEIRK LVQFGDFYRL LSPFEGNETA WLIVSEDKRE FLLYYFRVLG GANEPIKRLR LKGINPDFNY VLEDDGSEYS GDELMYAGKV IPELKGDFQS IMMHFKEESI KDG
|
| |