Gene MCA2704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2704 
Symbol 
ID3104402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2888751 
End bp2890445 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content67% 
IMG OID637171836 
Productserine protease 
Protein accessionYP_115106 
Protein GI53803184 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0794852 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGAAA GCGATGTCGA CCGGGATGAG CTCGGGGCGG TGAGCGCCGC CGAGCTTGGT 
CGGCTGCAGG CGGACACCGG TTTGGATTTC GAATGGGTGG GGAATACCCG GACCAACGCG
AGGATTTTCC GGACGCCCGC TGCGTTGACC CGAGATGAGG CGACGGCCCT GGCACGAAGG
ATCGGTGTCA TGGAGGGCGT GCTGTGGGCT GCCGTCGAAG CCGGGCAGGG CACGGTATAT
GCCCGGTCGG CCTTGCCCGC CGTTGCCGGT ATCGAGGACT TCATCATCAG ATTCAAACGC
GGGGCTGGCT CGCTGCCGCT GCATGTCGCG AAACTCTCGG CCGTCGCCGG CGCAGGTCTG
GTCCCGACAT CGGAAACCGT AGGCACGGCG CAGGTGTACC GCCTCCTTCG TCCGGTGAGC
CTGACGGAGG CGGGCGAGAT CGAGCGGCGC CTCGAAGCCC TGCCGGAGGT GCGCTATGCC
GATCCGGTGA CACGCCTGAA GGCCCAGGCG ACCGTCGAGC CGGTCACCCC CAGCGACGAA
TATTTCCCCC AGCAGTGGCA TCTTCAGAGC GGTCCCGGGG CGGTTGCCGG CTCGGCCAAT
GTGCAGGCCG CCTGGTCGCT CATCCGGGGG GCGCCCGAAA TCACGGTTGC CGTGCTGGAC
AGCGGCGTCC TGTTCGCCCC GACCCATCCC GACCTCGAAA ACCGGCTGGC ATACCGGAAT
GCGGAGAAAA CCGTGATCGT CGGCTGGGAC ATGATCCGGA ATCCGAGATT CGCGCGGGAC
GGCAATGCCC GCGACAAAAA GCCGAAGGAC GAGGGCGACT GGGAGACCGT GGGCTGGTGC
GGGAAAGAGG ATGGAAGACC CGTGCCTTCG GAGTTCAAGC CTGCCGAATG GCACGGCAGC
CACGTCGCCG GCATCATCGG CGCCGTCACC GACAACGGTA TCGGTGTGAG CGGCGTCGAC
TGGTTCGCCC GCATCGAACC CGTGCGGGTG CTCGGCCCCT GCGATGGCAA CGCCAAAGAC
ATCATCCAGG GGATATGGTG GGCCGCCGGC AAGAAAGGGG TGGGAGGAAC CCAGCCGAAT
CCCAGGCCGG CCCATGTCAT CAACATGAGC CTGGGGCCCG ATGCGCCCGG TTCGGGACCC
TGTCCGGACG CCATGCAGGA AGCCATCGAC TACGCGCTCT CGCGCAACGC CGTGGTCGTC
GTATCGGCGG GCAATGAAAA GGACGATACC CGCAACTATG CGCCGGCGAA TTGCCGCGGC
GTGATCGCAG TCAGTGCCGT GGGGCCGTCC GGCGACCTGG CCTCGTACAG CAATTTCGGG
GCGGAGGTCG CCGTCGCGGC GCCCGGCGGA GACACCGGCG CGAATGGCGA ACGTCCGCAG
GACGGCATCC TCTCGACCCT GAATACCAGC CGGAAGCGGC CATCCGCCGA GGGCATGTCC
TATGGGGCGA TGGACGGTAC CAGCATGGCG GCGCCCGTGG TTTCCGGCGT CGTGTCGCTA
ATGCTGGCCG CGGATACCGA GAAAAAGCTC GATCCACAAC GGATCGTGGC CATCCTTCGG
GACACGGCGC GGCCGTTTCC GGAAGGTTCA CGGTGCGCCA CCGAGCTCGA GGGACGGTGC
GGAGCGGGGA TCGTGGACGC CTATGCCGCG GTGAAGGCTG CGATGTCTCA GACGATCGGC
CCCCGTGCGC AATGA
 
Protein sequence
MFESDVDRDE LGAVSAAELG RLQADTGLDF EWVGNTRTNA RIFRTPAALT RDEATALARR 
IGVMEGVLWA AVEAGQGTVY ARSALPAVAG IEDFIIRFKR GAGSLPLHVA KLSAVAGAGL
VPTSETVGTA QVYRLLRPVS LTEAGEIERR LEALPEVRYA DPVTRLKAQA TVEPVTPSDE
YFPQQWHLQS GPGAVAGSAN VQAAWSLIRG APEITVAVLD SGVLFAPTHP DLENRLAYRN
AEKTVIVGWD MIRNPRFARD GNARDKKPKD EGDWETVGWC GKEDGRPVPS EFKPAEWHGS
HVAGIIGAVT DNGIGVSGVD WFARIEPVRV LGPCDGNAKD IIQGIWWAAG KKGVGGTQPN
PRPAHVINMS LGPDAPGSGP CPDAMQEAID YALSRNAVVV VSAGNEKDDT RNYAPANCRG
VIAVSAVGPS GDLASYSNFG AEVAVAAPGG DTGANGERPQ DGILSTLNTS RKRPSAEGMS
YGAMDGTSMA APVVSGVVSL MLAADTEKKL DPQRIVAILR DTARPFPEGS RCATELEGRC
GAGIVDAYAA VKAAMSQTIG PRAQ