Gene M446_6607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_6607 
Symbol 
ID6135887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp7270723 
End bp7272798 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content75% 
IMG OID641646698 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001773297 
Protein GI170744642 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0858273 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCCA CGCCCGATCC CCGCCCGACC CTCGCGGCGC CCGACGACGA TCCCTACCTG 
TGGCTGGAGG AGATCGACGG GGCGCGGGCC CTGGCCTGGG TCGAGGCGCA GAACGCCGCC
ACCCTCGAGG CGCTCGCGGA CGGGCGCCTC GCCGCCGACC GGGACGGGCT CAAGGCCGCC
CTCGACCGGC CCGACAAGAT CCCGGGCGTC ACGCGGCGGG GCGGGCTGCT CTACAATCAC
TGGCAGGATG CCGACCATCC CCGCGGCCTG TGGCGGCGCA CCACCCTCGC CTCCTACCGG
GCGCCGGACA CGGAGTGGGA GCTCCTCCTC GACCTCGACG CCCTCGCCCG CGAGGAGGGC
GAGGACTGGG TCTGGGCCGG GGCCATCAGC CTGCCGGGAT CGCACGACCG GGCGCTGCTC
AAGCTCTCCC GCGGCGGCGG CGACGCCGCC GTGGTGCGCG AATTCGACCT GCCCTCCCGC
GCCTTCGTGC CGGACGGCTT CGTGCTGCCG GAGGGCAAGA GCTATCCGGC CTGGCTCGAC
CGCGACACGG TGCTGCTGGC GAGTCCCCTC GGTGAGGGCA TGGCGACCCT GTCGGGCTAC
GCCCGCACCG TCCGGCTCTG GACCCGCGGC GGCGATCCCC TGGCGGCGCC GGTCATCTTC
GAGGCGCCGC CCGAGAGCAT GGCGGTCCAT GCCAGCCACG ACCGGGAGGC GGCGCCCGAG
CGCGTCGTCT TCGTCGAGCG CACCGGCTTC TTCGACGGCG TGACCCATCT CGGCGACCGG
TCCGGCGCCA AGATCCGCCT CGACCTGCCG ACCGATGCCG ACGCGCAGTG GAGCCGGGGC
GTCCTGGTCG TGCGGACCCG CTCGCCCTGG ACCCTCGGCG GCACGACCCA CCCGCCCGAC
ACCCTGCTCG GCATCGGCCT CGACGCCTTC CTGGCCGGCG CGCGCGATCT CCGCGTGCTG
TTCGAGCCCG GTCCCCGGCG GGCGCTGCAG GGCTTCTTCT GGTCCGGCCC CTTCCTCGTC
CTGTCGGTGC TCGACGACCT GCGGGCGCGG TTCCCGGTCT TCCGGCCGGA CGAGGACTGG
GCGCGCGGCG AGGTCGGGGG CCTGCCGGAA CTCGGGATGG TCGGCGTCTG GTCCCTCGAC
GCCGAGGAGG ACGAGGCGAA CGGCGACCTC CTCGCCGCCG CCAACGACCC GGTCACGCCG
GCGACCCTGA TGCTGACCCG GCCCGGCCCC GGCGGGCCGA CGATCCTGCG GCAGGCGCCC
GCCACCTTCT CGGCCGAGGG GCTGGTGGTG ACCCGGCACG AGGCGGTCTC GGTCGACGGC
GAGCGCATTC CCTACGTGCA GGCGGGGCCG CCGGGCGAGA CCGGCGAGGC GCCGGTCCAC
CTCTCGGGCT ACGGCGGCTT CCAGGTCTCG AACCTCGCCG GCTACTCGGC GGTGCTCGGC
CGGCTCTGGC TGGAGAAGGG CGGCACCCGC GTGGTGGCCA ACATCCGCGG CGGCGGCGAG
TTCGGCACGA CCTGGCACGA GGCCGGCCGC CGCGAGGGCA AGGCGCGCTC GCACGACGAT
TTCGCCGCGG TCGCGGCCGA CCTCGTGCGC CGCGGCGTGA CCCGGCCCGA CCGGATCGCC
GCCGAGGGCG GCTCGAATGG CGGCCTGCTC GTCGCCAACA TGCTGACCCG CTACCCGGAG
CGGTTCGGGG CGCTGCTCTG CACGATCCCC CTCATCGACA TGCGCCGCTA CCACCGGCTG
CTCGCCGGGG CGAGCTGGGT GGCCGAGTAC GGCGACCCGG ACGCGGCGGA GGATTGGGCC
TTCCTCCGGC ACATCTCCGC CTACCACGTC GCCGCGCCCG GGCGGCCCTA CCCGCCGATC
CTGATCGCCA CGACGCGGCG GGACGACCGC GTCCATCCGG GCCACGCCCG CAAGATGGCG
GCGAAGCTGC AGGCCATGGG CTATCCGGCC CGCTTCTACG AGCCGGAGGC GGGCGGGCAT
TCCTACGGCA AGAACAGCCA GGAGACCGCG ACCTTCGCGG CGCTCGGGGC GGCCTTCCTG
CGGCGCGCCA TCGGCTGGGA GCCGGAGGTG GCCTGA
 
Protein sequence
MTPTPDPRPT LAAPDDDPYL WLEEIDGARA LAWVEAQNAA TLEALADGRL AADRDGLKAA 
LDRPDKIPGV TRRGGLLYNH WQDADHPRGL WRRTTLASYR APDTEWELLL DLDALAREEG
EDWVWAGAIS LPGSHDRALL KLSRGGGDAA VVREFDLPSR AFVPDGFVLP EGKSYPAWLD
RDTVLLASPL GEGMATLSGY ARTVRLWTRG GDPLAAPVIF EAPPESMAVH ASHDREAAPE
RVVFVERTGF FDGVTHLGDR SGAKIRLDLP TDADAQWSRG VLVVRTRSPW TLGGTTHPPD
TLLGIGLDAF LAGARDLRVL FEPGPRRALQ GFFWSGPFLV LSVLDDLRAR FPVFRPDEDW
ARGEVGGLPE LGMVGVWSLD AEEDEANGDL LAAANDPVTP ATLMLTRPGP GGPTILRQAP
ATFSAEGLVV TRHEAVSVDG ERIPYVQAGP PGETGEAPVH LSGYGGFQVS NLAGYSAVLG
RLWLEKGGTR VVANIRGGGE FGTTWHEAGR REGKARSHDD FAAVAADLVR RGVTRPDRIA
AEGGSNGGLL VANMLTRYPE RFGALLCTIP LIDMRRYHRL LAGASWVAEY GDPDAAEDWA
FLRHISAYHV AAPGRPYPPI LIATTRRDDR VHPGHARKMA AKLQAMGYPA RFYEPEAGGH
SYGKNSQETA TFAALGAAFL RRAIGWEPEV A