Gene Athe_0426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0426 
Symbol 
ID7407503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp484794 
End bp487097 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content33% 
IMG OID643714813 
Productpeptidase U32 
Protein accessionYP_002572331 
Protein GI222528449 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG TAGAGTTACT GGCACCAGCA GGTGGGTTTG AAGAGCTCAT TGCAGCCATA 
AAAGCTGGGG CTGACAGCGT GTATGTTGGT GCAAAAGAGT TCTCAGCAAG AGCGTATGCA
AAGAACTTTT CAGAAGATGA GCTTAAAAAA GCCATAGATT TTTGTCATGA GAGAGGAAAA
AAGATATATC TTGCAATAAA CACCCTGATT TACAATGATG AGATGCGCAA AGCTTTGAAG
CTTTTAGAGT TTGCATATAA AGAAGGAATT GATGCTGTTA TTGTGCAGGA TATAGGTCTT
CTTTTCATTA TGATAAATGA ATTTCCCGAC CTACCTATTT ATGCAAGCAC ACAGATGACA
GTTCATAACT TGGCTCAGGT AAAGTTTTTG GAAGGCTTAG GAGTAAAGAG AGTTATACTC
TCAAGAGAGC TCTCAATAGA TGAGATAAAG AACATAAGAC AGCAAAGTAG TATTGAACTT
GAGGTTTTTG TGCATGGAGC TTTGTGTGTT TCATATTCTG GTCAGTGTCT TTTTTCAAGC
ATAATTTTTA AAAGAAGTGG CAACAGAGGG CAGTGTGCCC AGCCTTGCAG GCTTTATTAT
AAGCTATTAG ATAAGGAAAA AAAAGTAATT GATAGAGGAT ATCTTCTTTC ACCAAAGGAT
ATTTGTCTTT TAGAAAACAT AGATAAACTA ATCGAAGCAG GGGTTGACTC TTTCAAGATA
GAGGGAAGAT TAAAGGACCA TTATTATGTG TACACTGTGA GCTCAATCTA TAGAAAATAT
ATTGATATGT ATTACGAGAA AGGCGAAATA ACAATCGACA GCGCTGATAA GCAAAAACTT
CTACTTGTTT TCAACAGAGG AAACTTCAGT ACTGGATACT TAGAAAATAC TGATATAGAT
AGAATAATCT TTAAGAAAGC ACCTAACAAT ACAGGTCTTT TTATTGGGAA ATTTTATTTT
GAGAACGAAA CCCTTTTTTT GCAGACTTCA TATAACCTTT CAAACGGTGA TGTGATTTCT
TTTAGAAACA AAAATTTTGA AGAGATTCTT CTTGAAATAA ATAACAATAT TATTAAGAAA
GATGACAAAA GATTTGAGGT GAAAGTTGAT TTTGAAAGGA AAAAGAGATT GAAAGAATTT
TCTCAGGGTC AGGTGTTTAT TGTAAGAAAT AAAGAACATG AAATTAGAAT AGAAAAAGAA
ATGAATAAGG AGAAAAAATT TAGGAAGGTT GATTTTAAAG TATGGATAGA GAAAGAAAAA
AAAATAAAAG CTTTAGCAGC ATGTGATAGA TTTGAGGTAG TGGAAGAGGG GGAAGTTGTT
CAGCAGGCAA AAGAGAAAAA AGTTACATCT GCTGCTGTAA TCAGCAGCTT TTCAAAACTG
GGTGGAACAA TTTTTGAGAT GGGAAATTTT GATGCGCATA TTGAAGATGG CTGTTTTGTG
AAGGTCTCAG AACTAAACAG GCTGAGAAAG GTATTGATTG AAAAGCTTTC TCAAAAGATA
ATTAGCTTTT ATAAGAGAAG TCTAAAACAA GATGTTGAAA TTTCAAGGTA TTTAGAAGAT
GGTTGTGCAA GGTCATTTAA TAGAAGTCAC CGGTTTTCTT TCATGATAGA TTCACTCTGG
CAACTTGAAA AGCTTAAAAA GTGGTGTGAG GCACGCAATC TTTCTAACTA TGAAATCTAC
ATTCCTTACA ATGTAATTTT TGATATAAAG ACAGATGACA ATATGGTTGC TTATCTTGAC
AGGATAACAC ACGATGAAGA TTTAAAAAAG GTTGATGTTG AAAAAATAAA AGAAAAGGGT
ATAAAAAAAG TTTTGGTAAG AAACCTTGGG CAGTATGAGA TTTTCAAGCA CAACTTTGAA
ATTTATTTTG ATTTTAGCTT AAACACTACA AATTCTGTTT CATTAAAATT TTTAGAACTA
CTTGGTGGTA AAAGAATCTG TCTTTCGGTT GAGCTATCTA AAACAAGAAT TATAGAAATT
TACAAAAACG CACAAGAAAG TGAGATAGAA GTAATTGTCT TTGGTAGAAT TCCTCTGATG
ATAAACAGGC TTAAATTTTT CGAAAAGGGA GAATATTTGC AAGACAGGAA CGGTGAGCTT
TTGAAACTTA TAAAAACCCA AAGAGGGAAA AATGAAGTTT TAAACCCTGC ATTTTTGTAT
ATAGACGATA AAGATGTGCC ATCTGATGTG CTGAGATTTG ATTTCACAGG CATAAATGAA
AAAGAAATGG AAAAAGCTTT GGAAGGTTAT TTTGATAACA AGGGGATTGG TCTAAAAATT
ACAAAGGGGT ATTATTTGTC ATGA
 
Protein sequence
MKKVELLAPA GGFEELIAAI KAGADSVYVG AKEFSARAYA KNFSEDELKK AIDFCHERGK 
KIYLAINTLI YNDEMRKALK LLEFAYKEGI DAVIVQDIGL LFIMINEFPD LPIYASTQMT
VHNLAQVKFL EGLGVKRVIL SRELSIDEIK NIRQQSSIEL EVFVHGALCV SYSGQCLFSS
IIFKRSGNRG QCAQPCRLYY KLLDKEKKVI DRGYLLSPKD ICLLENIDKL IEAGVDSFKI
EGRLKDHYYV YTVSSIYRKY IDMYYEKGEI TIDSADKQKL LLVFNRGNFS TGYLENTDID
RIIFKKAPNN TGLFIGKFYF ENETLFLQTS YNLSNGDVIS FRNKNFEEIL LEINNNIIKK
DDKRFEVKVD FERKKRLKEF SQGQVFIVRN KEHEIRIEKE MNKEKKFRKV DFKVWIEKEK
KIKALAACDR FEVVEEGEVV QQAKEKKVTS AAVISSFSKL GGTIFEMGNF DAHIEDGCFV
KVSELNRLRK VLIEKLSQKI ISFYKRSLKQ DVEISRYLED GCARSFNRSH RFSFMIDSLW
QLEKLKKWCE ARNLSNYEIY IPYNVIFDIK TDDNMVAYLD RITHDEDLKK VDVEKIKEKG
IKKVLVRNLG QYEIFKHNFE IYFDFSLNTT NSVSLKFLEL LGGKRICLSV ELSKTRIIEI
YKNAQESEIE VIVFGRIPLM INRLKFFEKG EYLQDRNGEL LKLIKTQRGK NEVLNPAFLY
IDDKDVPSDV LRFDFTGINE KEMEKALEGY FDNKGIGLKI TKGYYLS