Gene Anae109_3343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3343 
Symbol 
ID5375251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp3906583 
End bp3907827 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content72% 
IMG OID640844856 
Productpeptidase M24 
Protein accessionYP_001380511 
Protein GI153006186 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTCC CCGCCGCCGA GGAGAAGCTC CGGAACGCCG ACACCGAGCA CCTGTTCCGG 
CAGGACAGCG ACTACCACTA CGTCGTCGGG CTCGACGAGC CGGAGGGCTG CGCGGTCCTC
CTCGCCGCGC CGTCGGGAGA GGTGAAGCTG GTGCTCTTCG TGCGTCCGCG CGACCGGGAG
AAGGAGATCT GGACCGGGCG GCGCGCCGGG GTCGAGGGCG CCAAGGAGCG CTACGGCGCC
GACGAGGCGT ACACCGTGGC GGAGATGGAC GAGAAGCTGC CCGCGCTCCT CGAGGGCGCG
GAGACGCTCT GGTTCCGCCT CGGCGAGGAC GCCGGCTGGG ACGCGCGGGT CGCGCGAATC
CTCCGCGGGC TGCGCGGCGG CGCTCGCCTC GGCAAGCACC CGCCGCGCGC GATCGTGGAG
CCCGGCCGCG TGCTGCACGA GCAGCGGCTC GTGAAGTCGC CGGAGGAGCT GAAGCGGCTC
CGCAAGGCCG CCGAGATCAC CGCCGAGGCG CACATGGCGG CGATGCGCGA CGGCCAGCCC
GGCCGCCGCG AGCACCAGGT GCAGGCGGAG ATCGAGTACG CCTTCCGCCG CCGCGGCGGC
TCCGGTCCCG GGTACGGCAC CATCGTCGCC ACCGGCGCGA ACTCGACCAT CCTCCATTAC
CGCGCCGGCC CCGACGTCCT GAAGGACGGC GACGTGTGCC TGGTGGACGC GGGCGGCGAG
TACGACTTCT ACACGGCCGA CGTGACGCGT ACGTTCCCGG TCTCGGGCGA CTTCACGAAG
CCGCAGCGCG TGCTCTACGA GCTGTGCCTC GACGTGCAGA AGCAGGCCAT CGAGGCGGTG
AAGCCCGGCA CGACGCTCGA CGCCATCCAC GATCTCGTGG TGCGCAAGCT GACCGAGGGC
TTCATCTCCC TCGGCCTGCT CCAGGGGAAC GTCGAGGAGC GCATCGCCGA CAAGTCGTTC
CGCAAGTACT ACATGCACCG GACCAGCCAC TGGCTCGGCA TGGACGTCCA CGACGTGGGC
GACTACTACG TGGACGGCAA GCCCCGCCCG CTCGTCCCCG GCATGGTGCT CACCGTCGAG
CCCGGCATCT ACGTCGCCGA GGACGACGAG ACCGCGCCCC CCGAGATGCG CGGCGTCGGC
ATCCGCATCG AGGACGACGT GCTCGTGACG CCCGAGGGCC ACGAGAACCT GACCGCCGCG
GTGCCGAAGG AGGTCGCGGA GGTCGAGGCG GTCTGCGTGA GGTAG
 
Protein sequence
MLLPAAEEKL RNADTEHLFR QDSDYHYVVG LDEPEGCAVL LAAPSGEVKL VLFVRPRDRE 
KEIWTGRRAG VEGAKERYGA DEAYTVAEMD EKLPALLEGA ETLWFRLGED AGWDARVARI
LRGLRGGARL GKHPPRAIVE PGRVLHEQRL VKSPEELKRL RKAAEITAEA HMAAMRDGQP
GRREHQVQAE IEYAFRRRGG SGPGYGTIVA TGANSTILHY RAGPDVLKDG DVCLVDAGGE
YDFYTADVTR TFPVSGDFTK PQRVLYELCL DVQKQAIEAV KPGTTLDAIH DLVVRKLTEG
FISLGLLQGN VEERIADKSF RKYYMHRTSH WLGMDVHDVG DYYVDGKPRP LVPGMVLTVE
PGIYVAEDDE TAPPEMRGVG IRIEDDVLVT PEGHENLTAA VPKEVAEVEA VCVR