Gene MCA1812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1812 
Symbol 
ID3103875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1945078 
End bp1946508 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content63% 
IMG OID637170972 
ProductU32 family peptidase 
Protein accessionYP_114250 
Protein GI53804107 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGGC CCAGGCATGG GCTGCGTCAA GCGGCGTCAG GTTCTGGTAT AGTGCCGTCG 
AGCTATGCCC TCTTCATATC GCTGCTCCCG ATGCCTTCTG TCGAACTGCT CTCCCCTGCC
GGCACCCTGA AGAACCTGCG TTATGCCTTC GCATACGGCG CCGATGCCGT CTATGCCGGC
CAGCCGCGCT ACTCGCTGCG GGTCCGCAAC AACGACTTCC TGGAGAAAAA CCTGGCGCTG
GGCATCGCCG AGGCCCATCG CCTCGGCAAG AAGTTTTATC TCGCGGCCAA CGTGCTGCCC
CACAACGCCA AGATCAAGAC CTTCATCAAA GACATCGCGC CCGTGGTTGC GATGGGACCG
GATGCGCTGA TCATGGCCGA TCCCGGCCTC ATCCTGCTGG CGCGCGAGGC CTGGCCGGAA
ATGCCCATCC ATCTGTCGGT GCAGGCCAAC ACGTTGAACT TTGCGGCGGT GAAGTTCTGG
CAGTCGGCCG GCATCTCGCG GATCATCCTG TCGCGCGAGC TCTCGCTCGA CGAGATCGCC
GAAATCCGCC AGCAATGCCC GGACACGGAG CTCGAAGTCT TCGTCCACGG CGCGCTGTGC
ATCGCCTACT CCGGACGCTG CCTGCTCTCC GGCTATTTCA ATCACCGTGA TCCCAACCAG
GGCACCTGCA CCAATGCCTG CCGCTGGAAA TACGACGTCG CCCCGGCGCA CGAGAACGCC
GAAGGAGACT ATGTTCCGGG AGCATTGCGG CTGAACCTCG CGGAGTTGAA TGGCAGCCTG
GCGGACTGCG GCGCCGCGGA GCGCCACCCG CTGGCCGACG AAGTGTATTT TCTGGAAGAG
GAAAACCGCC CGGGCGAACT GATGCCGGTG ATGGAAGACG AGCACGGCAC CTACATCATG
AACTCCAAGG ACCTTCGAGC AGTCGAGCAC GTGCAGCGGC TGGTGGAGAT CGGGGTAGAC
AGCCTGAAGA TCGAAGGCCG CACCAAGTCT CATTACTATG TGGCGCGCAC CGCCCAGGTC
TACCGCCGAG CCATCGACGA CGCGCTCGCC GGGCGGCAAT TCGACTGGAA CCTGCTCGGC
GTGCTGGAAA ATCTGGCGCA GCGCGGCTAC ACGGACGGAT TCTACCAGCG CCACCATACC
CAGGATTACC AGAACTATGT GCGGGGCTGC TCCGAAAGCC ACCGCCAGCA GTTCGTCGCC
GAAGTCACCA GGTTCGATGG CGGGATGGCC GAAGTCACCG TCAAGAACAA GTTCGGCGTC
GGCGACCGGC TAGAGCTGAT CCGCCCCGAA GGCAACCTCG ATTTCTTGCT CACCGGCATG
GAGGACTCCC AGGGCAATGC CATGCAGGAA GCTCCCGGCG GCGGCTGGGA GGTCAGGATT
CCCTTGCCCG CGCCCTGCGA CGATACTGCC TTGCTGTCAC GCTATCTCTG A
 
Protein sequence
MTRPRHGLRQ AASGSGIVPS SYALFISLLP MPSVELLSPA GTLKNLRYAF AYGADAVYAG 
QPRYSLRVRN NDFLEKNLAL GIAEAHRLGK KFYLAANVLP HNAKIKTFIK DIAPVVAMGP
DALIMADPGL ILLAREAWPE MPIHLSVQAN TLNFAAVKFW QSAGISRIIL SRELSLDEIA
EIRQQCPDTE LEVFVHGALC IAYSGRCLLS GYFNHRDPNQ GTCTNACRWK YDVAPAHENA
EGDYVPGALR LNLAELNGSL ADCGAAERHP LADEVYFLEE ENRPGELMPV MEDEHGTYIM
NSKDLRAVEH VQRLVEIGVD SLKIEGRTKS HYYVARTAQV YRRAIDDALA GRQFDWNLLG
VLENLAQRGY TDGFYQRHHT QDYQNYVRGC SESHRQQFVA EVTRFDGGMA EVTVKNKFGV
GDRLELIRPE GNLDFLLTGM EDSQGNAMQE APGGGWEVRI PLPAPCDDTA LLSRYL