Gene M446_4223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4223 
Symbol 
ID6135643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4660907 
End bp4662229 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content68% 
IMG OID641644367 
Productcarboxyl-terminal protease 
Protein accessionYP_001771006 
Protein GI170742351 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.502517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0644787 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAAG TTTCCCTCGT CCTGTTGGGT GCGATCCTGG GCGCCGGGAC CGCCACGGTG 
GCGACCCAGA CCCACTTCCT CTCGGGCACC AGCGCGGTAG CGGCCTCCGC CGAAACGTAC
CGTCAGCTCA GCCTGTTCGG CGACGTCTTC GAGAAGATTC GCACCGACTA CATCGAGAAG
CCCGAGGAAT CGAAGCTGAT CGAGGCGGCC GTCAACGGCA TGCTGACCTC GCTCGACCCG
CATTCGAGCT ACATGGATGC GAAGAGCTTC CGCGACATGC AGGTGCAGAC CCGCGGCGAG
TTCGGCGGCC TCGGGATCGA GGTGACGATG GAGGACGGCC TCATCAAGGT CGTCACGCCG
ATCGACGACA CGCCCGCCGC CCGCGCCGGC CTGCTCGCCA ACGACATCAT CACCCAGATC
GACAACGAGC AGGTCCAGGG CCTGACCCTC AACCAGGCCG TCGAGAAGAT GCGCGGCCCG
GTCAACTCGC CGGTGAAGCT CAAGGTCACC CGCAAGGAGG TCAAGGAGCC GCTCGAGATC
ACGCTGAACC GCGACCTCAT CCGCATCAAG CCGGTGCGCT CGCGGGTCGA GGGCGGCGAC
GTCGGCTACA TCCGGCTGAC CCAGTTCAAC GAGCAGACCT TCGACGGGCT GAAGGCGGCG
ATCGACAAGA TCTCCACCGA CGTGCCGAGC GACAAGCTCA AGGGCTACGT CCTCGACCTG
CGCAACAACC CGGGGGGCCT GCTCGATCAG GCCGTGATGG TCTCGGACGC GTTCCTCGAC
CGCGGCGAGA TCGTCTCCAC CCGCGGCCGC AACCCGGACG AGACGCAGCG CTTCTCGGCC
AAGTCCGGCG ACCTGACCAA GGGCAAGCCG ATCGTGGTGC TGGTCAACGG CGGCTCGGCC
TCGGCCTCCG AGATCGTCGC CGGCGCCCTG CAGGACCACA AGCGCGCGAC CGTGCTCGGC
ACGCGCTCCT TCGGCAAGGG CTCGGTGCAG TCGATCATCC CGCTCGGCGG GAACGGCGCC
CTGCGCCTGA CCACGGCGCG CTACTACACG CCGTCCGGCC GCTCGATCCA GGCCAAGGGC
ATCGAGCCGG ACCAGGAGGT GCTGCAGGAG GTGCCCGACG ACCTGAAGGG CAAGGACGAG
ACCAAGGGCG AGGCCGGCCT GAAGGGCCAC CTCAAGCAGA AGGACCAGGA AGAGCGCGGC
GGCTCCTCGG CCTACGTCCC GCCGGATCCG GCCAAGGACA AGCAGCTCAC CGCCGCGGTC
GACCTGCTGC ACGGCGTGCA GAAGGGCGCC GCCAACACCG CCAAGAACGG CGTCCCGAAC
TGA
 
Protein sequence
MRKVSLVLLG AILGAGTATV ATQTHFLSGT SAVAASAETY RQLSLFGDVF EKIRTDYIEK 
PEESKLIEAA VNGMLTSLDP HSSYMDAKSF RDMQVQTRGE FGGLGIEVTM EDGLIKVVTP
IDDTPAARAG LLANDIITQI DNEQVQGLTL NQAVEKMRGP VNSPVKLKVT RKEVKEPLEI
TLNRDLIRIK PVRSRVEGGD VGYIRLTQFN EQTFDGLKAA IDKISTDVPS DKLKGYVLDL
RNNPGGLLDQ AVMVSDAFLD RGEIVSTRGR NPDETQRFSA KSGDLTKGKP IVVLVNGGSA
SASEIVAGAL QDHKRATVLG TRSFGKGSVQ SIIPLGGNGA LRLTTARYYT PSGRSIQAKG
IEPDQEVLQE VPDDLKGKDE TKGEAGLKGH LKQKDQEERG GSSAYVPPDP AKDKQLTAAV
DLLHGVQKGA ANTAKNGVPN