Gene MCA1467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1467 
Symbol 
ID3102844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1557968 
End bp1559389 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content62% 
IMG OID637170642 
Productserine protease, MucD 
Protein accessionYP_113924 
Protein GI53804465 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.627151 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGACC GATACCGTTT TGTCGGCGCC ATTTGTGCGG CGCTGGTGCT GGCCGTTTCG 
CCGCCAGCCC GGGCCCAGCT TCCCGATTTC ACCCAGCTGG TTGAACAGAA CAACGCCGCC
GTCGTGAACA TCAGCACCAC CCAGAAAGTG GCCGCCAACG AACAGCAGAT GCCGGAGGGT
CTGGAGATTC CGGAGGGCAC GCCGTTCGAC GATTTTTTCC GGCATTACTT TGGCGAAGGC
GGGGGTAGCG ACGGCCAGCC GAGCGAGGCG AAGTCCCTGG GTTCGGGTTT CATCATGTCG
GCGGATGGCT ATATCATCAC CAATCACCAC GTCGTGAAAG GGGCCGATGA GATCGTGGTG
CGACTGCAGG ACCGGCGGGA GCTGGTGGCG AAGATCGTTG GGTCCGACAA GCGTAGTGAC
GTCGCCCTGC TCAAGATCGA GGCCAGCCAG CTGCCCACGG TGAAGCTGGG GTCGTCCGAA
AAGCTCAAGG TCGGCGAATG GGTGCTCGCC ATCGGCTCGC CTTTCGGATT CGACCATTCC
GCCACCGCCG GCATCGTCAG CGCCAAGGGG CGCAGCCTTC CCAGCGACAA TTACGTGCCG
TTCATACAGA CCGACGTGGC CATCAACCCT GGCAACTCGG GAGGACCGCT GTTCAATCTG
AACGGCGAAG TGGTCGGCGT CAATTCCCAG ATCTACAGCC GTACCGGCGG CTTCATGGGG
CTGTCCTTCG CCATTCCCAT CGAAGTCGCC ATGCAGGTGG TGGACCAGCT CAAAGCCAGC
GGAAGGGTTT CCCGCGGCTG GCTGGGTGTC CAGATCCAAG ACGTGACGCG AGAGCTGGCC
GAGTCCTTCG ACATGAAGAA ACCACAAGGC GCCCTGGTAT CCAAGGTTCT TTCGAAGAGC
CCGGCCGAAG CGGCCGGCGT CCAGATCGGC GACATCGTGC TGGAATTCAA CGGCCAGGCG
GTGGACACGT CGGCTGCGCT GCCGCCCATG GTAGGCATGA CCAAGGTCGG CGAAGTCGCC
AAAATCAAGT TGTTACGTAA CGGTGCGATC AAGGAGCTGA GTATCAAGAT CGGAGCGCTC
CCCGATGAGG AAGAACCGGC GATGGGTACC GCCGAGCCCG ATGCGGTACC GCTGAAGCGC
ATGGGAGCCA GCGTGGCCGA TCTGACCCCG GAACTGCGCG AGCAGTTCGA GGTGCCACGG
GGCGGGGTGC TCGTCTACGG CGTCAATCCC GGTCCCGCCT ACGAGGCAGG GCTGCGGCGC
GGAGACGTGA TCCTCCGGAT TCAGGACAAG GAAATCAACG GCGTGAAACA ATTGGTAGAG
TTGGAGAAGA CCCTGCCGGC AGGGAAATCG CTGGCGGTGC TGGTGCAGCG GCGCGATGGC
TCCATCTTCC TGGCGATGAA ATTGAAAGAC GAAAAGCAGT GA
 
Protein sequence
MFDRYRFVGA ICAALVLAVS PPARAQLPDF TQLVEQNNAA VVNISTTQKV AANEQQMPEG 
LEIPEGTPFD DFFRHYFGEG GGSDGQPSEA KSLGSGFIMS ADGYIITNHH VVKGADEIVV
RLQDRRELVA KIVGSDKRSD VALLKIEASQ LPTVKLGSSE KLKVGEWVLA IGSPFGFDHS
ATAGIVSAKG RSLPSDNYVP FIQTDVAINP GNSGGPLFNL NGEVVGVNSQ IYSRTGGFMG
LSFAIPIEVA MQVVDQLKAS GRVSRGWLGV QIQDVTRELA ESFDMKKPQG ALVSKVLSKS
PAEAAGVQIG DIVLEFNGQA VDTSAALPPM VGMTKVGEVA KIKLLRNGAI KELSIKIGAL
PDEEEPAMGT AEPDAVPLKR MGASVADLTP ELREQFEVPR GGVLVYGVNP GPAYEAGLRR
GDVILRIQDK EINGVKQLVE LEKTLPAGKS LAVLVQRRDG SIFLAMKLKD EKQ