Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2370 |
Symbol | |
ID | 7407789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 2517530 |
End bp | 2520610 |
Gene Length | 3081 bp |
Protein Length | 1026 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643716734 |
Product | glycoside hydrolase family 2 TIM barrel |
Protein accession | YP_002574213 |
Protein GI | 222530331 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0768675 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTTGACA AATCTTTTTA TCAAAATCCA AAGATTCAGC ATGTGAACAT GGCACAGCCA AGGTCATCAT TTGTTCCTTA TGCCAGTGTA GAAAATGCTT TGCTTGGTGA TTGGGAACTT TCAGAATACT TGAGATTACT TAATGGAAAC TGGTATTTTA AGTTATTTGA TATGCCCTGT AAAGTTGACC AAGAGATTAT AAAATCAGAT GCTAAATTTA CAGGATTTGA TAAAATTATT GTTCCAAGTA ATTTTCAGCT TTTTGGTTAC GACAAACCAA TATACACTAA TACACGGTAT CCCATCCCAG TTGACCCACC ATATGTACCT GATATTAATC CTACTGGGGT GTATAAGAAA GAGATTTTTA TTTCAAAAGA GGATAAAGAA AAGAGAATAT TTTTAGTCTT TGAGGGTGTG GATTGTGCTT TTTATTTATA TGTAAATCAG GAGTTTGCTG GATTTTCAAA AGTAAGCCAC ATGATGCATG AGTTTGACAT CACAGACCTA TTGACTGAAG GTAAAAATAT TATAACAGTT GCAGTTTTAA AATATGCAGA CAGTACATAT TTAGAAGACC AAGACAAGTG GCGAATGAGT GGAATATTTA GAGATGTATA TTTACTTGTG CGCCCCAAGA TATATCTCAA AGACATTTAT CTCAAGCCTG ACCTTAACGA TAACCTAACT AAAGGCAGTT TGGTAGCAGA AATCGAAATC TCAAATTCAA ATGAAGAAAA ATCAGAATTT TATGTAGATG TACAAATATA CGCAGACAAA GAACTACTAA AATCTGCCTC AAAACTTGCA AGCATTTTAC CAAAAACCAC TGAACAGTTC AGATTTGAGT TCCAAATTGA AAAACCATTG CTTTGGAGTG CTGAAACGCC AAATCTCTAC AAATTTATTG TGATTATAAA GGATACATCA GGTAAGATTT TGGAGGTCAT CCCTCAAAAC TTTGGGTTTA GGAAAATTGA AATTAAAGAT GGCGTATTTT ACCTAAATAA TGTACCCATT AAATTAAAAG GTGTCAACAG ACACGACATG CACCCAAGAG TTGGATTTGC AGTGACAAGA AAGATGATGC AAGAGGACAT AACTTTGATG AAACAGCACA ATATAAACTG TGTAAGAACT TCACACTATC CAAATCATCC TTATTTTTTG GAGCTGTGTG ACAAATTTGG GATTTACGTA ATTGATGAAG CTGATTTAGA AACACATGGA TTTGGAGCAG TTGGTGACTG GGGACTTTTA GCAAAAGACC CTGTGTGGGA AGATGCGTTT CTCGAAAGGG CAAAGATGAT GGTAATGAGG GACAAAAACC ACCCATCAAT CATTATGTGG TCGCTTGGAA ATGAATCTGG CTATGGTCCA AACCACGATA AAATGGCTCA GTGGATAAGG TCATACGATA ATAGCCGTCC TATTCATTAC GAAAGTGCCC GTGATGCAGA GGTTGTGGAT ATTGTAAGTG TAATGTACCC TCCTGTAGAA AAACTTGAAG AAGAGGGCAA AAAACAAGAA AAAAGGCCAT TTTTTATGTG TGAGTATGCA CATGCAATGG GAAATGGTCC TGGAAACCTC AAAGAGTATT GGGATGTGAT ATATAAATAC CCAAGGCTTT TAGGTGGATG TGTGTGGGAG TGGGCAGACC ACGGAATTTT GACAAAGACA CCTGATGGGA AAGAGTACTA TGCCTACGGC GGGGATTTTG GAGATGAGCC AAATGATGGT AACTTCTGTA TAGATGGACT TCTTTTCCCC GACAGAACTC CGTCGCCCGG GATGATTGAG CTGAAAAAAG TTTATGAGCC AGTTATGATT GAACTTTTAG ATAAAAAAAG CGGAATTTTC AAGGTAACAA ACAGGTATGA TTTTATATCT TTGAATCACA TTGAAGTTGA ATGGGAACTT TTGTTAGGCG GCAGGGTTGT GAAAGAGGGC TTTGTTGATG TGAGCGATGT ACTGCCTCAC TCTTCAAAAG AGGTCAAGAT TGATGAAGTC AAAGAAGTTT TAGAAAGTTG CAAGGAAGAG CTTTTTATTA CCTTCACGGC AAAGCTCAAA AACTCAATGC CTTGGGCAAA AAGAGGTTTT GTTATAACAA AATCTCAGAT TGCAATTAAA GAAGAAACAT CTCAAGATGC TGTGCAAAAA ATTGAGAAGA TAAATGCCAT TTTATCAAAG CAAGACAGGT TTGAGGTTTG CAAGTTACAT GACAAATTGG TAGTTTTCGC AGGTAACACA GAAGTAGAGT TTTGCCGATG GACAGGTGAT TTGGTAAGCT TGAAACATAG TGACCTTGAG CTTATAAAAT CCTCACCAAG ATTTAATATT TGGAGGGCTC CAACAGATAA TGATGTGCAT ATCAAAAACG AATGGATAAA AGCTGGATTT GACAAGCTTC AAAGAAGAAT TGTAAATGTC AGCTTTGAAG AACACAGCCA GTACTTCAAA GTACAAACAA CTTCGGTATA TGGCGCATAT TCAGTAAAAC CTGGATTTGA AGTAACCACA AGCTACAAGG TTTTCAAATC GGGAATTGTA GAGACAAATG TGTATGCGCA AGCTCTCAGA CAGCTTCCGC CACTTCCAAA GATAGGACTG CAATTTATGA TGCCAAAAGA GTTTGAGTAT GTCAAATATT ATGGACGTGG ACCTCATGAA AACTATCCTG ATATAAAACA GAGCGCAACT GTAGAAATAT ATGACATGGC TATAAAAGAC ATGTACGTTC CATATATAAT GCCCCAGGAA TATGGAAACA GGTGCGACGT TAGATGGGCT TTTGTATACA ACATCTATGG AATAGGACTT TGTATTAGAG GCATCCCTAC ATTTAACTTC AGTGCAAGAG AATACACTGA TGATGTGCTC ACAAAAGCAA AACACTCATA TGAGCTGACA AAAGCAGATG GAATTGTTGT GAATGTTGAC TTTAAGATTG GTGGTATCGG AAGCCAGAGC TGCGGTCCCG GCCCACTTGA GAAGTACTTA GTCAAGGATG ACAAATTTGA ATTTTGCTTT TATATGATAC CTGTTGATAG CAATAGTTTA GACGTTGAAA AGCTATGGTA A
|
Protein sequence | MLDKSFYQNP KIQHVNMAQP RSSFVPYASV ENALLGDWEL SEYLRLLNGN WYFKLFDMPC KVDQEIIKSD AKFTGFDKII VPSNFQLFGY DKPIYTNTRY PIPVDPPYVP DINPTGVYKK EIFISKEDKE KRIFLVFEGV DCAFYLYVNQ EFAGFSKVSH MMHEFDITDL LTEGKNIITV AVLKYADSTY LEDQDKWRMS GIFRDVYLLV RPKIYLKDIY LKPDLNDNLT KGSLVAEIEI SNSNEEKSEF YVDVQIYADK ELLKSASKLA SILPKTTEQF RFEFQIEKPL LWSAETPNLY KFIVIIKDTS GKILEVIPQN FGFRKIEIKD GVFYLNNVPI KLKGVNRHDM HPRVGFAVTR KMMQEDITLM KQHNINCVRT SHYPNHPYFL ELCDKFGIYV IDEADLETHG FGAVGDWGLL AKDPVWEDAF LERAKMMVMR DKNHPSIIMW SLGNESGYGP NHDKMAQWIR SYDNSRPIHY ESARDAEVVD IVSVMYPPVE KLEEEGKKQE KRPFFMCEYA HAMGNGPGNL KEYWDVIYKY PRLLGGCVWE WADHGILTKT PDGKEYYAYG GDFGDEPNDG NFCIDGLLFP DRTPSPGMIE LKKVYEPVMI ELLDKKSGIF KVTNRYDFIS LNHIEVEWEL LLGGRVVKEG FVDVSDVLPH SSKEVKIDEV KEVLESCKEE LFITFTAKLK NSMPWAKRGF VITKSQIAIK EETSQDAVQK IEKINAILSK QDRFEVCKLH DKLVVFAGNT EVEFCRWTGD LVSLKHSDLE LIKSSPRFNI WRAPTDNDVH IKNEWIKAGF DKLQRRIVNV SFEEHSQYFK VQTTSVYGAY SVKPGFEVTT SYKVFKSGIV ETNVYAQALR QLPPLPKIGL QFMMPKEFEY VKYYGRGPHE NYPDIKQSAT VEIYDMAIKD MYVPYIMPQE YGNRCDVRWA FVYNIYGIGL CIRGIPTFNF SAREYTDDVL TKAKHSYELT KADGIVVNVD FKIGGIGSQS CGPGPLEKYL VKDDKFEFCF YMIPVDSNSL DVEKLW
|
| |