Gene Anae109_3346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3346 
Symbol 
ID5374977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp3909208 
End bp3910965 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content74% 
IMG OID640844859 
Productpeptidase M61 domain-containing protein 
Protein accessionYP_001380514 
Protein GI153006189 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTACC GGCTCTCGCT CCCCGAACCG CACACTCACC TGTTCCACGT CGAGGCGATC 
CTGGAGCGGC CCGGCGCCGC ACCCGTGCTC GCGCTCCCCG TCTGGACCCC CGGCAGCTAT
CTCGTGCGGG AGTTCGCGCG CCACCTCGAG GGCGTGCAGG CCGAGGACGG CCAGGGCCGC
CGTCTCGGGA TCGAGCGACT CGACAAGCAG CGCTTTCGCG TCTCCGCCGG GGACGCGGAG
CGCGTGGTGG TGCGCTACCG GGTTTACGCG AACGAGCTCA CCGTGCGGAC CTGTCACCTC
GACGGCACGC ACGGGTTCCT GAACGGCGCG GCGGTGTTCC TGTTCGCGCC CGGGCGGGAG
GGCGAGCCGC ACGTGGTGGA GGTCTCCCCG CCCGAGGGCT GGGCGGTCTC GACCGCGCTC
CCAGGCGGCC CGACGACGTT CACGGCGCGC GATTACGACG AGCTCGCCGA CTCGCCCTTC
GAGATCGGGC GCCACCGGCT CGTGACGTTC GCCGCGCTCG GCAGGCCGCA CGAGATCGCC
ATCTGGGGCA GGGGGAACCT GGACGAGGCG CGGCTCGCGC AGGACGCGCG GCGCATCGTC
GAGACGCTCG GGGGGCTCAT GGGCGGGCTC CCCTACGAGC GGTACCTGTT CATCCTGCAC
CTCACCGAGA AGCGGCGCGG CGGCCTCGAG CACGCGGCCT CGACCACGCT GAACGTGGGC
CGGATGCAGT TCTTCCCGCG CGACGTCTAC GAGGAGACGC TCGGGCTCTT CTCGCACGAG
TTCTTCCACC TCTGGAACGT GAAGCGCATG CGGCCGGCGG CGCTCGTGCC GTTCGACTAC
GGGCAGGAGC AGTACACGCG GCTCCTCTGG TGGTTCGAGG GGGCGACGTC CTACTACGAG
GGGCTGGCGC TCTCGCGCGC CGGGATCGTC GACGCGAAGA AGCACCTGCG CAACCTCGGT
CGCGCCCTCA CGCAGCTCGA GCGCACGCCG GGAGCGGGCA AGATGAGCGT CGAGGAGTCG
AGCTTCCTCG CGTGGGTGAA GCTCTACCGG CCGGACGAGA ACAGCGTGAA CAGCACCGTC
TCCTACTACC TGAAGGGCGA GCTCGTGGCG CTCGCGCTCG ATCTCGCGCT GCGGCGGGCG
GGCTCCTCGC TCGACGCGCT CCTCCGCGCG CTGTACGCGC GTCACGAGGG ACGGGGCGTG
CCCGAGGACG GCGTCGAGCG GGCCGCGGCG GAGCTCCTGG GCGAAGAGGC CGCGCGCCGC
TTCTTCGATC GCCACGTGCG GGGCACCGAT CCCGTGGATC TCGATCTCGA GGTGGTCGGG
CTGCGGCTGC GCCGCCGCCC GGCGCAGGCG TTCGACGACA AGGGCGGCAC GCCGCCGAAG
GAGGGCGACG AGCGGCCCGC GCCGGGGTGG CTCGGCGTCG AGCTCGCCCC GGGCCCGAAG
CTCCAGGTCG CGGCGGTGCG GGAGGGGAGC CCCGCGCACC GCGCCGGGCT GTACGCGGAG
GACGAGCTGG TGGCCGAGGG CGGCTTCCGG GTCGACCGGG CCGCCCTGTG GGACCGCCTC
TGCGAGAAGG GGCCCGCGGG CGCGCTGCGG CTCACCGTCT TCCGCCGCGA CGAGCTCGTC
GAGGTGGAGG TGCCGCTCGC GGCGTGGCCG GAGGACACGG TCTGGCTCGA GCCCCTCGAG
AACGCCGGTG CCGCCCAGCG GGCGGCGTTC GAGGCGTGGT CGGGGGCGAG GTGGCCGGGG
GTGACCCGCA CGCCCTGA
 
Protein sequence
MRYRLSLPEP HTHLFHVEAI LERPGAAPVL ALPVWTPGSY LVREFARHLE GVQAEDGQGR 
RLGIERLDKQ RFRVSAGDAE RVVVRYRVYA NELTVRTCHL DGTHGFLNGA AVFLFAPGRE
GEPHVVEVSP PEGWAVSTAL PGGPTTFTAR DYDELADSPF EIGRHRLVTF AALGRPHEIA
IWGRGNLDEA RLAQDARRIV ETLGGLMGGL PYERYLFILH LTEKRRGGLE HAASTTLNVG
RMQFFPRDVY EETLGLFSHE FFHLWNVKRM RPAALVPFDY GQEQYTRLLW WFEGATSYYE
GLALSRAGIV DAKKHLRNLG RALTQLERTP GAGKMSVEES SFLAWVKLYR PDENSVNSTV
SYYLKGELVA LALDLALRRA GSSLDALLRA LYARHEGRGV PEDGVERAAA ELLGEEAARR
FFDRHVRGTD PVDLDLEVVG LRLRRRPAQA FDDKGGTPPK EGDERPAPGW LGVELAPGPK
LQVAAVREGS PAHRAGLYAE DELVAEGGFR VDRAALWDRL CEKGPAGALR LTVFRRDELV
EVEVPLAAWP EDTVWLEPLE NAGAAQRAAF EAWSGARWPG VTRTP