Gene Athe_1798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1798 
Symbol 
ID7408585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1870537 
End bp1871730 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content36% 
IMG OID643716175 
Productcarboxyl-terminal protease 
Protein accessionYP_002573664 
Protein GI222529782 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GACTTCAAAC TTTTGCAATA GTTCTCATAA CTGCAATTGT AACATATATT 
GCGACCACTT ATGTCTATTT TGGAAGTCCT ATATATACAA ACAAATTAGT AACTAATCCT
AAATTATCTA AGGTTATATG GCTTTTGAAA AAATACTACT ATGAGCCTAA GGATATAAGT
GACCAGAAAA TTGTAGACGG TGCAATAGAT GGGATTGCCG CAAGTGTTGG TGATCCGTAC
ACTGAGTATT TTACTAAAAA AGAATATGAA GAGTTCATGA TACAAAGTAA AGGTACGTAT
TTTGGGGTAG GAGTAACAAT AGAGCCTGGC GAACATTATA TCGAAGTTGT AACACCCTTT
GAAGGTTCTC CGGCGTACAA GGCGGGGATA AAACCAGGGG ATAAGATTAT AAAAGTAAAT
GGAATAAGTT TGACATCAAA AGATATAGAA AAGGCTGTAA GTTTGATGAG AGGGCCAAAA
GGAACAAGCG TGACAGTTAC AATTTTGCGC GATGGCAGCT CAAAGCCTAT TGACCTTAAG
ATTGTCAGAG ACGAGATAAA AATAAAGACT GTATCTACTT CCATTTTTGA AAACAACATA
GGTTATATCA AAATCACTAA CTTTGATGAA AATACTCCTC AGGACTTTTA CAATAGCTAT
GACAAACTCA AAAGCTCTGG CTGCCGTGGA CTTGTCATTG ACCTGAGATT TAACCCTGGT
GGGCTTTTAG AGTCTGTTGT TGACATTGCA AGCAATTTTC TCAAGAAAGG ACAGCTTATA
GTGTATCTCA AGGACAGATA CAATAACAAA GAGTATTTCA AATCATACAA AAATGGTGAC
ACGGTAACAC CGCTTGTGGT GCTTACCAAT AAGTATTCAG CGTCAGCTTC AGAGATATTA
GCTGGATGTT TAAAAGACCA AAAGAGGGCA AAAATTGTTG GTGAGAAGAC TTTTGGCAAA
GGCGTTGTTC AGCAGGTATT TGACCTGGGA GATGGGTCTG CAATAAAAAT AACAGTAAGC
CAGTATCTTT TGCCAAGTGG AGCATATATT CACAAAAGAG GAATAAAGCC AGATATTAAA
GTAGTTCAAC CCAAAGAGTA TCAGGACAAA ATGAATGTTC CAATGGATAA AGATTTGCAG
CTGAAAAAAG CTATTGAGAT ATTAAAGAGT GAAATTTCAA AGAGCAAGTT TTGA
 
Protein sequence
MKKRLQTFAI VLITAIVTYI ATTYVYFGSP IYTNKLVTNP KLSKVIWLLK KYYYEPKDIS 
DQKIVDGAID GIAASVGDPY TEYFTKKEYE EFMIQSKGTY FGVGVTIEPG EHYIEVVTPF
EGSPAYKAGI KPGDKIIKVN GISLTSKDIE KAVSLMRGPK GTSVTVTILR DGSSKPIDLK
IVRDEIKIKT VSTSIFENNI GYIKITNFDE NTPQDFYNSY DKLKSSGCRG LVIDLRFNPG
GLLESVVDIA SNFLKKGQLI VYLKDRYNNK EYFKSYKNGD TVTPLVVLTN KYSASASEIL
AGCLKDQKRA KIVGEKTFGK GVVQQVFDLG DGSAIKITVS QYLLPSGAYI HKRGIKPDIK
VVQPKEYQDK MNVPMDKDLQ LKKAIEILKS EISKSKF