Gene MCA2343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2343 
SymbolhtrA 
ID3105048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2534666 
End bp2536063 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content61% 
IMG OID637171486 
Productprotease DO 
Protein accessionYP_114759 
Protein GI53803452 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family
[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGACCCTC TTGCTATGCG GTTTAGAAAG ATAATCGATA GAGGTAATTC CATGCGCATG 
AAATCCATCG GCGCGCTGTT GCTACTGACC GCCAGCGGCC TGTCCCTCTG CCCCCCGGTG
TGGGCCGATT TGCCCGCCAG CGTCAACGGT CTGCCACTGC CCAGCCTCGC GCCCGTCTTG
AAGAAGGCCA TGCCCGCGGT GGTCAACATC TCGACGAAGA CCCAGATCGA AATCGCCGAG
AATCCCCTGA TGCAAGACCC CTTCTTCCGA CATTTCTTCG GTATTCCGAA TCAGCCGCGG
CGCCGTGAGA GCTCCAGCCT CGGCTCCGGG GTGATCGTCG ACGCCCGCCG AGGCTACATC
CTCACCAACA ACCATGTGAT CGACAAGGCG GACGAGATAA GCGTGACCTT GAGAGACGGA
CGCCAGTTGA GCGCAAAACT GGTCGGCGCC GATCCGGAGT CGGATTTGGC CGTCATCAAG
GTCGAACCCA AGAATCTGAC GGAACTGCCC ATCGGTGATT CGAGTCAGCT CGAAGTCGGC
GACTTCGTGG TAGCCATCGG CAATCCCTTT GGCCTGGGGC AGACAGTGAC CTCCGGCATC
GTCAGTGCCC TGGGGCGATC CGGACTCGGC ATTGAGGGGT ACGAGGATTT CATCCAGACC
GACGCCTCGA TCAACCCCGG CAACTCGGGA GGCGCCTTGA TCAATCTCCG TGGCGAGCTG
GTCGGCGTGA ATACGGCCAT CATCGCTCCC ACCGGCGGAA ACGTGGGCAT CGGTTTCGCC
ATCCCGTCCA ACATGGCCGC CAGCATCATG ACCCAACTGG TCGAAAAGGG CGAAATCCGC
CGCGGCCAGA TCGGCATCAC CATCCAAGAC CTGACGCCGG ATCTCGCTCA GGCCTTTGGC
CTGAAGCAGA GCCAGGGCGC GGTGATCACC GGCGTCCAAA AGGATTCCCC GGCCGCATCT
TCGGGCCTGG AAGCCGGCGA CGTCGTCGTC AGCGTCAATG ACCGCCCGGT CAAAAACAGC
GCGGACGTCC GCAACACCAT CGGCCTCCTG CCCATAGGCG AAGAAGTCCG GGTCGAAGTG
ATGCACAAGG GGGAGAGAGT GGTACGCGAG GTGGTGATCC GCGCCCCCAA ACTGGTCCAG
GAAGAGGGCA ATAAAATCCA TCCCAGACTG TCCGGTGTCA TACTTAAGAA CAACGAGGAG
GGCGGTGTCC AGGTGGAAAA AATCCACACG AGTTCTTACG CCTTCCAGGC CGGCCTGCGC
CCCGGCGACG TGATCGTGAT GGCGAACCGC GAGGAAATCG AAACGCTCGA TGACCTGAAG
CGCGCCACCA AGGGCCGCTC GGAGCTGCTC CTCAGCGTCC AGCGGGGCAG CGGCTCGTTC
TTCTTGATGC TGAAGTAG
 
Protein sequence
MDPLAMRFRK IIDRGNSMRM KSIGALLLLT ASGLSLCPPV WADLPASVNG LPLPSLAPVL 
KKAMPAVVNI STKTQIEIAE NPLMQDPFFR HFFGIPNQPR RRESSSLGSG VIVDARRGYI
LTNNHVIDKA DEISVTLRDG RQLSAKLVGA DPESDLAVIK VEPKNLTELP IGDSSQLEVG
DFVVAIGNPF GLGQTVTSGI VSALGRSGLG IEGYEDFIQT DASINPGNSG GALINLRGEL
VGVNTAIIAP TGGNVGIGFA IPSNMAASIM TQLVEKGEIR RGQIGITIQD LTPDLAQAFG
LKQSQGAVIT GVQKDSPAAS SGLEAGDVVV SVNDRPVKNS ADVRNTIGLL PIGEEVRVEV
MHKGERVVRE VVIRAPKLVQ EEGNKIHPRL SGVILKNNEE GGVQVEKIHT SSYAFQAGLR
PGDVIVMANR EEIETLDDLK RATKGRSELL LSVQRGSGSF FLMLK