Gene M446_5887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5887 
Symbol 
ID6133125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp6475852 
End bp6477366 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content73% 
IMG OID641645996 
Productprotease Do 
Protein accessionYP_001772608 
Protein GI170743953 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.685816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGC CCGTTCCCGC CTCCCGTCTC CATCGCAAGG CGCTCGCGTC CGTGGCCGCC 
GTGACGCTCG TGGCGACGGG CGCGCTCGGC TCCGCCTTCC TGCCGCCGAG CACGCCGGTC
CTCGCCCAGG CCCTGCCGCA GACCCCGATC ACCGCCCCCG AGCACCCGCC GGGCAGCTTC
GCGCCGATCG TCAACCGGGT GAAGCCCGGC GTCGTCTCGG TGAAGGTCAA GCTCAAGGAC
GACGCGGCGG ATGACGAGGA GGGCGGCCCC GGCGGCCAGA ACGTGCCGCC GCAGCTGCGC
GAGTTCTTCC GCCGCTTCGG CGAGAACGGC ATGCCCAACC GGCCGCACCG CAACGGCGGC
CGCGCCGCGC AGGGCTCGGG CTTCTTCATC TCGGCGGACG GCTACGTCGT CACCAACAAC
CACGTGGTCG AGAACGCCAA GTCGGTGGAG GTCACCCTCG ACGACGGCCG CACCCTCGAC
GCCAAGGTGG TCGGCACCGA CCCGAAGACC GACCTCGCGC TGCTCAAGGT CACGGAGGGC
AACGGCTCCT TCCCCTACGT GAGGCTCGCG CACGGCGCGC CGCAGGTCGG CGACTGGGTG
GTGGCGATCG GCAACCCCTT CGGCCTCGGC GGCACCGTCA CGGCCGGCAT CGTCTCGGCC
CGCGGCCGCG ACATCGGCGC CGGGCCCTAC GACGACTTCC TGCAGATCGA TGCGCCGATC
AACAAGGGCA ATTCGGGCGG CCCGACCTTC AACGTGTCGG GCGAGGTCGT CGGCGTGAAC
ACTGCCATCG CCTCGCCGTC GGGCGGCAAT GTGGGCCTCG CCTTCGCCAT CCCCTCCGAG
ACGGTGCAGG CGGTCGTCGA CCAGCTGCGG ACGGACGGCA AGGTCGCCCG CGGCTATCTG
GGCCTCCAGA TCCAGCCCGT CACCAAGGAC ATCGCCGAGG GGCTCGGCCT CGACAAGGCG
AAGGGCGCGC TCGTCACCAG CGCCCAGGAC GGCACGCCGG CCGCCAAGGC GGGCCTGAAG
TCCGGCGACG TGGTCCAGGC GGTGAACGGG GATCCGGTCG GCGACGCGCG CGAATTGTCG
CGCCGGATCG CCTCGATGAA GCCGGGCACC AAGGTCCAGC TGTCCTACCT GCGCGGCGGC
AAGACCGACA CCGCGACGGT CGAACTCGCG ACCCTGCCGA ACGACACCCG GGTCGCGGCC
CGGGAGGAGC GCGGGCGGGG CTCGGACGCG CAGCCGCGGC TCGGCCTGAG CCTCGCGCCC
GCCGACGCGG TGGGCGCCGG CCAGGAGGGC GTGGCGGTGG TGAACGTCGA TCCGGACGGC
CCGGCGGCGG CCAAGGGCAT CGAGCCCGGC GACGTCATCC TCGACGTCGG CGGCCAGCCG
GTCTCCTCGG TCTCCGACGT GCAGGGCCGG ATCCGGGCCG CCGAGCGCGA CGGCCGCAAG
GCCGTGCTGA TGCGGGTGAA GAGCGACAAG GGCACGCGCT TCGTCGCCAT CGCCCTCCAG
AACCGCAACG GCTGA
 
Protein sequence
MSEPVPASRL HRKALASVAA VTLVATGALG SAFLPPSTPV LAQALPQTPI TAPEHPPGSF 
APIVNRVKPG VVSVKVKLKD DAADDEEGGP GGQNVPPQLR EFFRRFGENG MPNRPHRNGG
RAAQGSGFFI SADGYVVTNN HVVENAKSVE VTLDDGRTLD AKVVGTDPKT DLALLKVTEG
NGSFPYVRLA HGAPQVGDWV VAIGNPFGLG GTVTAGIVSA RGRDIGAGPY DDFLQIDAPI
NKGNSGGPTF NVSGEVVGVN TAIASPSGGN VGLAFAIPSE TVQAVVDQLR TDGKVARGYL
GLQIQPVTKD IAEGLGLDKA KGALVTSAQD GTPAAKAGLK SGDVVQAVNG DPVGDARELS
RRIASMKPGT KVQLSYLRGG KTDTATVELA TLPNDTRVAA REERGRGSDA QPRLGLSLAP
ADAVGAGQEG VAVVNVDPDG PAAAKGIEPG DVILDVGGQP VSSVSDVQGR IRAAERDGRK
AVLMRVKSDK GTRFVAIALQ NRNG