Gene M446_0736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_0736 
Symbol 
ID6130437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp842548 
End bp845475 
Gene Length2928 bp 
Protein Length975 aa 
Translation table11 
GC content69% 
IMG OID641641054 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001767729 
Protein GI170739074 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGATG CGGGCGAGGC GGATCTCGGG GCCGCCGATG CGGGATTCGC GACCACGGCG 
GCGCATAGCG AGCGCGACGT CGCCCAGGGC GTCACGCGCT GGTTGATCGC CAATTTCCGA
TCCGAACTCG CGGACGCGAC GCGATCGACG CCGTTCACGG TGCCGCTGCT GTGCGCCATC
GCCTGCCGCG AGGCCGGCAT GTACTGGCTC CCGCTGACGC CGCACAGGGG GGGCGCGGAG
ATCCTCGGTC TGTGCGTGTA CGACGCCAGC GGCGACGTGG CCGGCGCGCC CCGAACCGCG
TTCCCGATCA ATACCGCTCA GTTCAGACTC ACGTATGGGG ATGCATTCAC GCAGCTCCTG
ATCGCCGAAA CGAACAAGGC GAGGGCCGCG CGCGGCCTCA GCCCGGCCTC GATGATTTAC
AAAGGTTACG GTATATTTCA GTACGATCTG CAGCATGTCC GCACGGACGA AGCGTTCTTT
CGTCAGAAGA AATGGTATGT CTTCGAAGAG TGTGTGAGCA GAGTCGTATC TGAGCTTACA
AGCAAGTATG AAGCGACCGG GAACATTCAG GAAGCTGTCC GCGCTTACAA CGGATCTGGC
CAGAAAGCCC GTCAATACGC TCAGGATGTG ATGAGATTGC TGCCATATTG TGAGGAGGCG
GCGGGTTCTC CACAGATTGC ATCGCTCGCC TCTGCATCGC TCTCCTCTGC GGCGCGCGAT
GGCGGGGCGG TCCTCGGCGC GATGGATCCG CAGGACGCCT CGGACGACGA CCCGGGCGCC
CCCGCTCCGA CCGACGTCAC CGAGGTCGCG GACGAGGACA CCGCCCGCCT CCTCGCCAAT
CTCGGATACT CGGTCGATGC GGAGAGTGCC GCGCCGGCGG AACCGGGCGT CGCGGCCGTC
GGCGACACGG CCTTCGACCT CGCGCGGGCG CGGGCCTTCC TCGACGCGTG CCGGACCGCG
AGGCCGCGCG TCACCTACGG CCTCGGGCAG AAGGTGCCGT TCCTCGATGC CGTCCCGGGA
CGCGACTTCA CGCAGGTCGA TTGCAGCGGT TTCGTCCGAC AGGTCGTCCG GCTCGCCACG
ACCCCGTCGC TGCGTTTCCC CGACGGCTCG GTCAATCAGC ACCAATGGGC GCGCGCGAGG
GGTTTGGAGA CCTCGTCCGT GGCGGAGGGC AGGGCCACGG ACGACGTGGT CCGGATCGCG
TTCCTGCGGC CGCAGGACGC CGGTCGCAAG AGGATCGGGC ACGTCGTCCT GATCGCCAAC
GGCGAGACGC TCGAATCCCA CGGCGGCGTG GGTCCGGATT CGCGGCCCTG GACGGGGACC
GGCTGGCAGG CGAAGGCCTT CGTCTACGTC CTGGCTCGCG ACGCTCGGAT CCGCGCGGCG
GCTTCGGCGG AGCGGCTCGC GGCGGCGCGG ACCGAGTCCG TGGCCAAGCC CAGCATCGTT
CGGACCCTCG AGAGCGCGAC GATGCGCCGG ACCACGGCCA GCATCTTCAC CACGCAGGCT
CTGCCGCAGC ACCACGACGA CATCCTCGTC GTGACCATGC GGCCCGGACC GGCGGAGGCT
CCCGCGGCGG CGGGGATGGC GATGGGGATG GGGATGGCGC CGGCGCCGGA GACGCCCGGC
CTGGGTGCGC TGTCGTATTT CGCCCGCGCG GGACGCATCA AGCGGGTCGT TCCGCTGCGC
GCGAGCGAGG AGGCGGTGAC GGCGCCGTCC CCGATGGCCG CGGCCGCCGC GATGATGGGG
TTCCACCGTC CCGCCGGAGC ACCCGACGTC GGCGCCCCGG TCCGCTTCAT CGAGATGATG
GACGGTCAGG ACGCGAAGCA GCTGCACGGC GCTCTGGCGA GCGACCCGAG CGTTCTCTCG
GTGTCGCAGG TTCCGGTCCG CTACCTCGCC GCGCGGCGCG CGGGGCGCAC GGCCGCCGGC
GGCGGTCTCG GGATCGCCGC GGCGCCGCCC GCGGCATCGC TGCTCTGGAA CCTCGCCAAG
ATCCGCTGGC AGGAGGCGCG TGCGGCTGCC GGCTTCCAGG AGGCAACCCG GGTGAGGGTC
GCGGTGCTGG ACACCGGCGT CGATGCCAAG CACCCGAGCC TGCGGGTGTC CAATTATTAC
TGGCAGAACG CTGATCTGAC GCGGCCCGTC TCGGAGCTCG ATCTGATCGG GCACGGGACG
CATGTCTCGG GAACGATCGC TGCGCTGATC GCCAGCGGCG TCTCGGTTCA AGGGGTGTGT
GCCTGCCAAC TTGATGTCTG GAAGATCTTC GATGATGAGC CGACCTACGC GCCCGGCCAA
GGAGCTTTCG TCTATTACGT GAATCCGATC CTGTACCGTC GTGCCTTGGC CGCGTGCGTG
GACGACCCGC CCCACGTCGT GAACCTGAGC ATCGGCGGGC CGGCCGTTCC CGATCCGACC
GAGCGGACCC TGTTCGAACA GCTGCTCGCG TCCGGCGTGA CGATCTGCGC GGCGATGGGC
AATGATCGCC AGTATGGCAG CCCGACCTCG TACCCGGCCG CGATACCGGG GGTCGTCGCG
GTCGGCGCGA CGGGGCTGGA CGACAGGGTG ACGCTCTTCT CGAACAGCGG AAACCACATC
GCGGTCGCGG CGCCCGGGAA GGCCATCTGG TCGACGCTCC CCCGGTATGA CGGCCAGACC
GCCTTCGGCA TCGCGTACGG CCCGGACGGG CGGCCGCAGC CGGGAGCCAG GGTCCGCCGC
GAGTGCAACT ACGACGCCTG GGATGGAACT TCCATGGCAA CACCCCATGT GACGGGGAGC
GCGGCGCTCC TGATCGCCAA GAGCATCGCT GCCGGTGGCG AACTCAAGCC CGATCAGGTG
AGAGCCGCCC TGATGACGTC GGCCGACAAG GTGCAGGCGA TGAATGGAGC GGATTTCAGC
GCCGACTACG GTGCGGGGCG GATCAACCTG CTCAAATTGT TGCAATGA
 
Protein sequence
MTDAGEADLG AADAGFATTA AHSERDVAQG VTRWLIANFR SELADATRST PFTVPLLCAI 
ACREAGMYWL PLTPHRGGAE ILGLCVYDAS GDVAGAPRTA FPINTAQFRL TYGDAFTQLL
IAETNKARAA RGLSPASMIY KGYGIFQYDL QHVRTDEAFF RQKKWYVFEE CVSRVVSELT
SKYEATGNIQ EAVRAYNGSG QKARQYAQDV MRLLPYCEEA AGSPQIASLA SASLSSAARD
GGAVLGAMDP QDASDDDPGA PAPTDVTEVA DEDTARLLAN LGYSVDAESA APAEPGVAAV
GDTAFDLARA RAFLDACRTA RPRVTYGLGQ KVPFLDAVPG RDFTQVDCSG FVRQVVRLAT
TPSLRFPDGS VNQHQWARAR GLETSSVAEG RATDDVVRIA FLRPQDAGRK RIGHVVLIAN
GETLESHGGV GPDSRPWTGT GWQAKAFVYV LARDARIRAA ASAERLAAAR TESVAKPSIV
RTLESATMRR TTASIFTTQA LPQHHDDILV VTMRPGPAEA PAAAGMAMGM GMAPAPETPG
LGALSYFARA GRIKRVVPLR ASEEAVTAPS PMAAAAAMMG FHRPAGAPDV GAPVRFIEMM
DGQDAKQLHG ALASDPSVLS VSQVPVRYLA ARRAGRTAAG GGLGIAAAPP AASLLWNLAK
IRWQEARAAA GFQEATRVRV AVLDTGVDAK HPSLRVSNYY WQNADLTRPV SELDLIGHGT
HVSGTIAALI ASGVSVQGVC ACQLDVWKIF DDEPTYAPGQ GAFVYYVNPI LYRRALAACV
DDPPHVVNLS IGGPAVPDPT ERTLFEQLLA SGVTICAAMG NDRQYGSPTS YPAAIPGVVA
VGATGLDDRV TLFSNSGNHI AVAAPGKAIW STLPRYDGQT AFGIAYGPDG RPQPGARVRR
ECNYDAWDGT SMATPHVTGS AALLIAKSIA AGGELKPDQV RAALMTSADK VQAMNGADFS
ADYGAGRINL LKLLQ