Gene M446_4143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4143 
Symbol 
ID6133012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4587005 
End bp4588450 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content73% 
IMG OID641644294 
Productpeptidase S49 
Protein accessionYP_001770934 
Protein GI170742279 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.437804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTATC ACCGCATCGC CGGCCGGTTC TACAACCGGC CGCTTCTCGT CGCGCCCGCG 
ACCGCCGAGA CGATCTCGGC CTTCCTCCTC TCGCGCATGT CCGCCGGCCC GGCCGCGGGC
GGCAACGTCG GCGGGATCGA GCACGACGCG GGCGAGAGCC TGCAGATCTT CCGCGGGCAC
GAGCGCGCGG ACGGCTCGGT CGAGGTCCAC ACCCCGCGCG CCAGCCGCTT CTACGGCGAT
TACCCCCTGG CCGAGGATGG TTCGAAGCGC CCGCTCCCCT TCCGCCGGAC CGCCGAGGGC
GTCGCCATCC TCACCCTCGT GGGCGAGTGG GTGAACCGGG GCGGCTGGGT CGGCGCGTCC
TCGGGGCTCA TCTCCTACGA GGGCTTCGCC TACCAGATGC GGATGGCCGC CGCGGACCCG
CGGACGAAGG CGATCCTGCT CGACCTGGAG AGCCCCGGCG GCGAGGCGGT CGGCGCCTTC
GAGGCGGCCG AGCTCGTCCG GCAAGTGGCG TCCCAGAAGT CCGTGACCGC GCTCGTCAAC
GGCATGGCCT CCTCGGCTGC CTACGCGATC GCCTCCGGCG CCAGCCGGAT CGTCTCGATC
CCGACCGGCC TCGCCGGCTC GATCGGCGTC GTCCTGATGC ACCTCGACAT CAGCGAGTAC
CTGCGGGCCG AGGGGATGAA GCCGACCCTG ATCTTCGCGG GCGACCACAA GGTCGACTGG
AACCCCTTCG AACCCTTGCC CGACGCGGTC CGCGCCGACC TCCAGAAGGA GGTTGAAGGC
TTCTACGCGA AGTTCGTCAC GACGGTCGCG GCCGGGCGGC CGGGCCTCAG CGAGCAGGCG
ATCCGGGACA CCGAGGCCCG CACCTTCATG GGCGAGGAGG CCATCAAGGC GGGCCTCGTC
GATGCGATCG GCACCTTCGA CGCGGTGCTG GCCGACCTTT CCGCCGCGCC CGCAGCCGGG
CGCTCCTTCC CGTCGCGACC CACTGGAGCT TCCATGTCCG ACAACACCCC CACGCCCGGC
GCCTCTGCGG GCTTCACCCA GTCCGATCTC GACACCGCCC GCGCCGAGAG CTTCGCGGCC
GGCAAGGCCG AGGGCCTGAC CGAGGGCACC AAGGCCGGTG CCAGTACCGA GCGGGAGCGC
ATCGGCGCGA TCCTCGGCAG CGACGAGGCC AAGGGCCGCG AGGCGAGCGC CCGCCACCTG
GCGCTCTCCA CCGACCTGTC GCTGGACGCG GCCAAGGGCG TCCTCTCGGG GCTCCAGGCT
GCGGCCCCGG CGGGCCCCAG CCTCGACGCC CGGATGGCCG GGCGCGCCGA CCTGAACCTC
GACCCCGACA CGCCCCCGGC CGCTCAGACG GACCCCAAGG CGGACGCCGC GGCGTCCTGG
GACGAGATCG TGGCCGGGAT GAACGCGCGC CTGCCGGCAG CGGCCCGCAT CCCGGGCGTC
CGCTAA
 
Protein sequence
MHYHRIAGRF YNRPLLVAPA TAETISAFLL SRMSAGPAAG GNVGGIEHDA GESLQIFRGH 
ERADGSVEVH TPRASRFYGD YPLAEDGSKR PLPFRRTAEG VAILTLVGEW VNRGGWVGAS
SGLISYEGFA YQMRMAAADP RTKAILLDLE SPGGEAVGAF EAAELVRQVA SQKSVTALVN
GMASSAAYAI ASGASRIVSI PTGLAGSIGV VLMHLDISEY LRAEGMKPTL IFAGDHKVDW
NPFEPLPDAV RADLQKEVEG FYAKFVTTVA AGRPGLSEQA IRDTEARTFM GEEAIKAGLV
DAIGTFDAVL ADLSAAPAAG RSFPSRPTGA SMSDNTPTPG ASAGFTQSDL DTARAESFAA
GKAEGLTEGT KAGASTERER IGAILGSDEA KGREASARHL ALSTDLSLDA AKGVLSGLQA
AAPAGPSLDA RMAGRADLNL DPDTPPAAQT DPKADAAASW DEIVAGMNAR LPAAARIPGV
R