Gene Athe_0766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0766 
Symbol 
ID7407953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp855507 
End bp856787 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content34% 
IMG OID643715144 
Productpeptidase M16 domain protein 
Protein accessionYP_002572654 
Protein GI222528772 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.227787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGGA TTTATGATGA GGCTCTTCAC GAAGAAGTTT ATATAAATTC GTATTCCAAC 
GGGCTAAAAG CTTTTGTTAT AAAAAAGAAA AACTTTAGCA AGGCATTTGC AGGTTTTGCA
ACAAAATACG GTTCTGTTGA TAGCAAATTT GTCCATCCAA AAACAAAAGA AATTGTTGAG
GTTCCAGATG GCATTGCACA TTTTTTGGAA CACAAATTGT TTGAGGAAGA AGAAGGAAAT
GTGTTTGACA GATTTGCAAA ATTTGGTGCA ATGGCAAATG CATTTACTTC CTTCAAAGAA
ACAGTCTACT ATTTTATATC CACTCAAAAC TTTTATGAAA ATTTTGAGAT TCTCTTAGAT
TTTGTTCAAA ATCCATATTT TACTGATCAA AATGTCGAGA AAGAAAAAGG AATAATTGGA
CAGGAGATAA GAATGTACCA GGACAATCCA AACTGGAGGG TTTATTTTAA TCTCTTGAAT
GCACTTTATG TAAACAATCC TGTGAAAATT GACATTGCAG GAACCTTAGA AAGTATTCAA
AAAATTACAA AAGAGGATTT GTATTTGTGT TATAATACAT TCTATCATCC AAGCAATATG
ATAATTGTTG TATGCGGTGA TGTGGACCCG CAAAAAGTTT TTGATATAAT TGAAAGGATG
GAAAAGACAA AAGAATATCA AAGCCTGATA GAAAGGATTT ATCCTGATGA GCCCGAAGAA
GTAAATCAAA AAAAGATAGA GGCAAGGCTT TCGGTGGCAG TGCCAATCTT CTATATAGGC
TTTAAAGACA ATCAAAATGA CCTTCCACCG TATGAGATGA TAATGAAGGA TATCCAAACA
CAGATAGTGG CTGAGATGTT ATTTGGAAAA TCCACAGATT TTTATGAGAA GCTTTATAAA
GAAGGGCTTA TCAATCAAAA CTTTGGGTTT GAGTACAACT GTGAGCCTGA ATATTCATTT
TTTATGATTG GTGGAGAAAG CAAAGACCCT GAAGAGGTCT ACAAGAGAAT AATAGAACAC
ATAGAGGGGG TCAAGAAAAA ACGAATTGAC AGGGCAGAGT TTGAGAGGGC AAAAAAGGTT
GTGCTGGGAA GCCACCTGAG AAAGTTTGAC AATCCTGAAA AACTCTCTGT TGAGTTTATA
TACAGCTATT TCAAAGGAGT CAATATTTTT GAATATGTTA AGGAAATCAC TTCTGTATCA
TTTGAAATGT GCGAAAAAAG GCTCAAAGAA TTTTTTGATG AGAGCTTGAG CTGTATATCA
ATTGTATGGC CTGCAGATTG A
 
Protein sequence
MERIYDEALH EEVYINSYSN GLKAFVIKKK NFSKAFAGFA TKYGSVDSKF VHPKTKEIVE 
VPDGIAHFLE HKLFEEEEGN VFDRFAKFGA MANAFTSFKE TVYYFISTQN FYENFEILLD
FVQNPYFTDQ NVEKEKGIIG QEIRMYQDNP NWRVYFNLLN ALYVNNPVKI DIAGTLESIQ
KITKEDLYLC YNTFYHPSNM IIVVCGDVDP QKVFDIIERM EKTKEYQSLI ERIYPDEPEE
VNQKKIEARL SVAVPIFYIG FKDNQNDLPP YEMIMKDIQT QIVAEMLFGK STDFYEKLYK
EGLINQNFGF EYNCEPEYSF FMIGGESKDP EEVYKRIIEH IEGVKKKRID RAEFERAKKV
VLGSHLRKFD NPEKLSVEFI YSYFKGVNIF EYVKEITSVS FEMCEKRLKE FFDESLSCIS
IVWPAD