Gene MCA1599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1599 
Symbol 
ID3103410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1706633 
End bp1708147 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content66% 
IMG OID637170767 
Productserine protease, MucD, putative 
Protein accessionYP_114049 
Protein GI53804357 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.210077 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCATCC AGCGGCAAAC GCCGCTGATT CGTGAGGAAG AAACCGCCAT GCCAAAAAAC 
GTCATGCTTT CCGCCCTGAT CGCCGCCTCC ATCCTCGCCG CCATGGGCGC TGGATATGTC
CGCTACGGCG GCTACACCCT GGTGCCCAAA TCGGATCTCA TGCAGCCGCC CCCGCAGACG
CCCCAACTGC CGGAGGCCGC AGCCCGTGCC ACGCCGTCTC TGCCCGATTT CGCCGCCATC
GTCCGGCGTA ATGGACCGGC GGTCGTCAAC ATCAGCGTCA CCGGCACCAG CCGGGTCAAT
CTGCCGGAGA TGCCCCAGTT CGACCCCAAT GATCCGTTCG GGGAATTTTT CCGGCGGTTC
CAACCGCAGA TTCCCCGTGG CGAGGATGCT CCCGCGCACG GACTGGGCTC GGGATTCATC
ATCCGTCCCA ATGGCCTGAT CCTGACCAAC GCCCATGTGG TCAACGGCGC CCAGGAAGTC
ACGGTGAAGT TGAATGACCG GCGCGAATTC AAGGCCCGGA TCATCGGTAT CGACAAACCC
ACCGACGTCG CCCTGCTCAA GATCGAGGCC GACGGTCTGC CGGTGGTGCC GCTGGGAGAT
CCCGCGCGAT CCGGGCCCGG CGACTGGGTG GTGGCGATCG GCTCGCCGTT CGGTTTCGAA
AACAGCGTGA CGGCGGGCAT CATTTCGGCG AAATCGCGCA GCCTGCCCGA GGAAACCTAT
GTGCCGTTCA TCCAGACCGA TGTCGCCGTC AATCCGGGCA ACTCCGGCGG CCCGCTGTTC
AATCTGAGCG GCGAAGTCAT CGGCATCAAC TCGCAGATCT ACAGTCGCAC CGGCGGCTAC
CAGGGCCTGT CGTTCGCCAT CCCCATCGAT GTCGCGCTGA AGGTCGAAAA GCAGCTGCTG
GCGGACGGCA AGGTCAGCCG AGGACGGCTC GGCGTCGGCA TCCAGGAATT GAACCAGTCA
CTGGCCGAAT CCTTCGGGCT GGATCGCCCG ACGGGCGCCC TGGTCGATTC GGTGCCCAAC
GACGGGCCGG CGGCCAAGGC CGGTATCAAG CCGGGCGACG TCATCCTGAG CCTCAACGGC
CAGCCGATCG AAAATTCCGG CCAATTGCCG CCACTGGTTG CGGACATCAA GCCAGGCAGC
GAAGCGAAGG TGGGCATCTG GCGTAACGGC AAGCGCGAGG AGATCACGGT CCAGGTCGGC
GAAATGCCGG ACACACAGCA GGCCGCCGCG ACCGGCAACG GGCTGGCCAA GGGCCGGCTG
GGGCTGGCGG TGCGGCCCCT ATCGCCGGAA GAACAGCGGG CGGCCGACGT CGACGGCGGT
GTGCTGGTCG AGGCGTCCGC GGGCCCGGCC GAACGCGCCG GCATCCGGCC GGGCGACGTG
ATCCTCGCAC TCAACGGCCA TGCGGTGGCC AATCCCGGCG AGCTGCGCGA ACTGGCGGAC
CGGGCCGACA AGCACGTGGC CCTGCTGGTG CAGCGCGGCG GTACGAGGAT TTTCGTGCCG
CTGGATTTGG GCTGA
 
Protein sequence
MFIQRQTPLI REEETAMPKN VMLSALIAAS ILAAMGAGYV RYGGYTLVPK SDLMQPPPQT 
PQLPEAAARA TPSLPDFAAI VRRNGPAVVN ISVTGTSRVN LPEMPQFDPN DPFGEFFRRF
QPQIPRGEDA PAHGLGSGFI IRPNGLILTN AHVVNGAQEV TVKLNDRREF KARIIGIDKP
TDVALLKIEA DGLPVVPLGD PARSGPGDWV VAIGSPFGFE NSVTAGIISA KSRSLPEETY
VPFIQTDVAV NPGNSGGPLF NLSGEVIGIN SQIYSRTGGY QGLSFAIPID VALKVEKQLL
ADGKVSRGRL GVGIQELNQS LAESFGLDRP TGALVDSVPN DGPAAKAGIK PGDVILSLNG
QPIENSGQLP PLVADIKPGS EAKVGIWRNG KREEITVQVG EMPDTQQAAA TGNGLAKGRL
GLAVRPLSPE EQRAADVDGG VLVEASAGPA ERAGIRPGDV ILALNGHAVA NPGELRELAD
RADKHVALLV QRGGTRIFVP LDLG