Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0468 |
Symbol | |
ID | 7407547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 538102 |
End bp | 540291 |
Gene Length | 2190 bp |
Protein Length | 729 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643714856 |
Product | glycoside hydrolase clan GH-D |
Protein accession | YP_002572373 |
Protein GI | 222528491 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000274987 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATAA CCTTTAACCC ACAAACAAAT ATGTTTTTCA TAGAAGCAAA GAACACAAGC TATGTGATAA AGCTTTTCAA AGGAAAGTTT TTGTCCCATG TTTATTGGGG GAAAAAAATT AAAGAATTTG AGTGGACAGA TTTTGATGTG ACAGGAGGAA GAGCATTTGG TGCAACACCT GACCCAAATG ACAAAACATA CTCATTTGAT ACAATGCTTT TAGAATACCC TGCATATGGA AATTCAGATT TCAGACACCC TGCATATCAG ATAGAACAGG AAGACGGCTC TCGCATTACA AACTTAGTTT ACAAAACTCA CAGAATCTAT GATGGAAAGC CCAAACTTGA AGGTCTTCCA ACAACATATG TTGAGTCACC TGATGAAGCC CAGACACTGG AGATAGAGCT TTATGATGAT TTGATTGATT TGAAAGTCAC ATTGATTTAT ACAGCCTACA AAGATTATGA TGCAATAACA AGAAGCGTAA GGTTTGAGAA CTTAGGAAAA CAAACTCTCA AAATCCTTCG TGCAATGAGC GCGTGTGTTG ACTTTCCAGA AGGGGATTTT GAACTTTTGC ATCTTTGGGG TTCATGGGCA AGAGAAAGAT ACATCGAGAG AACTCCACTT ATTCACGGAA CCCAGGTAAT TGAAAGTGCA AGAGGCGAAA GCTCACATCA GCACAACCCA TTTATAGCAC TTTTGTCAAA GGATGCAACC GAAAAACATG GCGATGTGTA TGGCTTTTCT CTTGTCTATA GTGGAAACTT TGCTGCAATT GTGGAAAAAG ACCAGTACAA TCTTGTAAGA GTCACTATGG GAATAAATCC ATTTGAGTTT ACATGGGTTT TAGAGCCGCA AAGCAGTTTT CAGACACCAG AGGTTGTGAT GGTTTACTCT AATGAGGGCT TAGGAGGAAT GTCTCGCACA TACCACAAGC TTTACAGAAA AAGACTTTGC AGAGGAGCAT ATCGGGATAA AAGAAGACCA ATTCTGATTA ACAACTGGGA GGCTACATAT TTCAATTTCA ATGAAGAAAA ACTTCTTTCT TTGGCAAAAG AGGCAAAAGA TCTTGGGATT GAGCTGTTTG TTTTAGATGA TGGTTGGTTT GGTAAAAGAG ACGATGATAC AAGCTCACTT GGAGACTGGT TTGTTGACAG AAGAAAGCTT CCAAACGGTT TGGACGGGCT TGGGAAAAAG TTAAATGAAA TGGGGCTCAA ATTTGGACTG TGGTTTGAGC CTGAGATGGT TTCGCCTGAT AGCGAACTTT ACAGAAAGCA TCCTGATTGG TGCATACAGG TACGAGGAAG AACGTTGACA CAATGCAGAA ACCAGTACGT TTTGGACATC ACAAGAGAAG ATGTTAGAAA AGAAATTTTA AGGATGATGA AAGAGATTCT AAAAGCAGCT CCAATTGAAT ATATCAAGTG GGACATGAAC AGGCCCTTAA CAGAGATAGG TTCGCTTGAG CTCCCACCAG AGAGACAAAA AGAGGTCTTC CACAGATATG TTCTGGGACT TTATCAAATG ATGGAAGAGC TGACAATGGA GTTTCCACAT ATTTTGTTTG AAGGATGTTC TGGCGGTGGT GGAAGGTTTG ATCCGGGAAT TTTGTATTAC ATGCCTCAAA TTTGGACGAG TGATGACACA GACGCAATCG AAAGGCTTAA AATCCAGTTT GGAACAAGCA TAGTTTATCC TGCATCAACT ATGGGTGCGC ATGTATCAAT TGTGCCAAAC CATCAGGTTG GCAGGATAAC ACCAATGAAG ACAAGAGGGG TTGTAGCGCT TTCAGGCTGT TTTGGATATG AACTTGATTT AACAAAGCTA TCTCAAGAGG ACAAAGAAGA GATTAAGAGA CAAATTGAGC TTTATAAGAG AATATGGCAT ATAGTATTTG AAGGAGATTT GTACAGATTA ATTTCTCCAT TTGAGGGAAA TAGCGCTGCA TGGATGTATG TGACAGAGGA TAAGAAAGAG GCAGTTGTAT TCTATGTTGA AATTTTAAGG CAGCCAAACC CACCAATCAA AAGGTTAAAA TTAGATGGTC TTGACCCCAG CAAGAGCTAT TTAATTGAAG GTGAGCAAAA AACAAGGTTT GGCGATGAGC TTATGAACAT AGGGCTTATG ATTCCTCAGA TGTGGGGTGA TTTTAATTCT CATATGTGGA TTTTAAAAGC AGTTGATTAG
|
Protein sequence | MPITFNPQTN MFFIEAKNTS YVIKLFKGKF LSHVYWGKKI KEFEWTDFDV TGGRAFGATP DPNDKTYSFD TMLLEYPAYG NSDFRHPAYQ IEQEDGSRIT NLVYKTHRIY DGKPKLEGLP TTYVESPDEA QTLEIELYDD LIDLKVTLIY TAYKDYDAIT RSVRFENLGK QTLKILRAMS ACVDFPEGDF ELLHLWGSWA RERYIERTPL IHGTQVIESA RGESSHQHNP FIALLSKDAT EKHGDVYGFS LVYSGNFAAI VEKDQYNLVR VTMGINPFEF TWVLEPQSSF QTPEVVMVYS NEGLGGMSRT YHKLYRKRLC RGAYRDKRRP ILINNWEATY FNFNEEKLLS LAKEAKDLGI ELFVLDDGWF GKRDDDTSSL GDWFVDRRKL PNGLDGLGKK LNEMGLKFGL WFEPEMVSPD SELYRKHPDW CIQVRGRTLT QCRNQYVLDI TREDVRKEIL RMMKEILKAA PIEYIKWDMN RPLTEIGSLE LPPERQKEVF HRYVLGLYQM MEELTMEFPH ILFEGCSGGG GRFDPGILYY MPQIWTSDDT DAIERLKIQF GTSIVYPAST MGAHVSIVPN HQVGRITPMK TRGVVALSGC FGYELDLTKL SQEDKEEIKR QIELYKRIWH IVFEGDLYRL ISPFEGNSAA WMYVTEDKKE AVVFYVEILR QPNPPIKRLK LDGLDPSKSY LIEGEQKTRF GDELMNIGLM IPQMWGDFNS HMWILKAVD
|
| |