Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0426 |
Symbol | |
ID | 7407503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 484794 |
End bp | 487097 |
Gene Length | 2304 bp |
Protein Length | 767 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643714813 |
Product | peptidase U32 |
Protein accession | YP_002572331 |
Protein GI | 222528449 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGG TAGAGTTACT GGCACCAGCA GGTGGGTTTG AAGAGCTCAT TGCAGCCATA AAAGCTGGGG CTGACAGCGT GTATGTTGGT GCAAAAGAGT TCTCAGCAAG AGCGTATGCA AAGAACTTTT CAGAAGATGA GCTTAAAAAA GCCATAGATT TTTGTCATGA GAGAGGAAAA AAGATATATC TTGCAATAAA CACCCTGATT TACAATGATG AGATGCGCAA AGCTTTGAAG CTTTTAGAGT TTGCATATAA AGAAGGAATT GATGCTGTTA TTGTGCAGGA TATAGGTCTT CTTTTCATTA TGATAAATGA ATTTCCCGAC CTACCTATTT ATGCAAGCAC ACAGATGACA GTTCATAACT TGGCTCAGGT AAAGTTTTTG GAAGGCTTAG GAGTAAAGAG AGTTATACTC TCAAGAGAGC TCTCAATAGA TGAGATAAAG AACATAAGAC AGCAAAGTAG TATTGAACTT GAGGTTTTTG TGCATGGAGC TTTGTGTGTT TCATATTCTG GTCAGTGTCT TTTTTCAAGC ATAATTTTTA AAAGAAGTGG CAACAGAGGG CAGTGTGCCC AGCCTTGCAG GCTTTATTAT AAGCTATTAG ATAAGGAAAA AAAAGTAATT GATAGAGGAT ATCTTCTTTC ACCAAAGGAT ATTTGTCTTT TAGAAAACAT AGATAAACTA ATCGAAGCAG GGGTTGACTC TTTCAAGATA GAGGGAAGAT TAAAGGACCA TTATTATGTG TACACTGTGA GCTCAATCTA TAGAAAATAT ATTGATATGT ATTACGAGAA AGGCGAAATA ACAATCGACA GCGCTGATAA GCAAAAACTT CTACTTGTTT TCAACAGAGG AAACTTCAGT ACTGGATACT TAGAAAATAC TGATATAGAT AGAATAATCT TTAAGAAAGC ACCTAACAAT ACAGGTCTTT TTATTGGGAA ATTTTATTTT GAGAACGAAA CCCTTTTTTT GCAGACTTCA TATAACCTTT CAAACGGTGA TGTGATTTCT TTTAGAAACA AAAATTTTGA AGAGATTCTT CTTGAAATAA ATAACAATAT TATTAAGAAA GATGACAAAA GATTTGAGGT GAAAGTTGAT TTTGAAAGGA AAAAGAGATT GAAAGAATTT TCTCAGGGTC AGGTGTTTAT TGTAAGAAAT AAAGAACATG AAATTAGAAT AGAAAAAGAA ATGAATAAGG AGAAAAAATT TAGGAAGGTT GATTTTAAAG TATGGATAGA GAAAGAAAAA AAAATAAAAG CTTTAGCAGC ATGTGATAGA TTTGAGGTAG TGGAAGAGGG GGAAGTTGTT CAGCAGGCAA AAGAGAAAAA AGTTACATCT GCTGCTGTAA TCAGCAGCTT TTCAAAACTG GGTGGAACAA TTTTTGAGAT GGGAAATTTT GATGCGCATA TTGAAGATGG CTGTTTTGTG AAGGTCTCAG AACTAAACAG GCTGAGAAAG GTATTGATTG AAAAGCTTTC TCAAAAGATA ATTAGCTTTT ATAAGAGAAG TCTAAAACAA GATGTTGAAA TTTCAAGGTA TTTAGAAGAT GGTTGTGCAA GGTCATTTAA TAGAAGTCAC CGGTTTTCTT TCATGATAGA TTCACTCTGG CAACTTGAAA AGCTTAAAAA GTGGTGTGAG GCACGCAATC TTTCTAACTA TGAAATCTAC ATTCCTTACA ATGTAATTTT TGATATAAAG ACAGATGACA ATATGGTTGC TTATCTTGAC AGGATAACAC ACGATGAAGA TTTAAAAAAG GTTGATGTTG AAAAAATAAA AGAAAAGGGT ATAAAAAAAG TTTTGGTAAG AAACCTTGGG CAGTATGAGA TTTTCAAGCA CAACTTTGAA ATTTATTTTG ATTTTAGCTT AAACACTACA AATTCTGTTT CATTAAAATT TTTAGAACTA CTTGGTGGTA AAAGAATCTG TCTTTCGGTT GAGCTATCTA AAACAAGAAT TATAGAAATT TACAAAAACG CACAAGAAAG TGAGATAGAA GTAATTGTCT TTGGTAGAAT TCCTCTGATG ATAAACAGGC TTAAATTTTT CGAAAAGGGA GAATATTTGC AAGACAGGAA CGGTGAGCTT TTGAAACTTA TAAAAACCCA AAGAGGGAAA AATGAAGTTT TAAACCCTGC ATTTTTGTAT ATAGACGATA AAGATGTGCC ATCTGATGTG CTGAGATTTG ATTTCACAGG CATAAATGAA AAAGAAATGG AAAAAGCTTT GGAAGGTTAT TTTGATAACA AGGGGATTGG TCTAAAAATT ACAAAGGGGT ATTATTTGTC ATGA
|
Protein sequence | MKKVELLAPA GGFEELIAAI KAGADSVYVG AKEFSARAYA KNFSEDELKK AIDFCHERGK KIYLAINTLI YNDEMRKALK LLEFAYKEGI DAVIVQDIGL LFIMINEFPD LPIYASTQMT VHNLAQVKFL EGLGVKRVIL SRELSIDEIK NIRQQSSIEL EVFVHGALCV SYSGQCLFSS IIFKRSGNRG QCAQPCRLYY KLLDKEKKVI DRGYLLSPKD ICLLENIDKL IEAGVDSFKI EGRLKDHYYV YTVSSIYRKY IDMYYEKGEI TIDSADKQKL LLVFNRGNFS TGYLENTDID RIIFKKAPNN TGLFIGKFYF ENETLFLQTS YNLSNGDVIS FRNKNFEEIL LEINNNIIKK DDKRFEVKVD FERKKRLKEF SQGQVFIVRN KEHEIRIEKE MNKEKKFRKV DFKVWIEKEK KIKALAACDR FEVVEEGEVV QQAKEKKVTS AAVISSFSKL GGTIFEMGNF DAHIEDGCFV KVSELNRLRK VLIEKLSQKI ISFYKRSLKQ DVEISRYLED GCARSFNRSH RFSFMIDSLW QLEKLKKWCE ARNLSNYEIY IPYNVIFDIK TDDNMVAYLD RITHDEDLKK VDVEKIKEKG IKKVLVRNLG QYEIFKHNFE IYFDFSLNTT NSVSLKFLEL LGGKRICLSV ELSKTRIIEI YKNAQESEIE VIVFGRIPLM INRLKFFEKG EYLQDRNGEL LKLIKTQRGK NEVLNPAFLY IDDKDVPSDV LRFDFTGINE KEMEKALEGY FDNKGIGLKI TKGYYLS
|
| |