Gene MCA2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2023 
SymbolspoT 
ID3105018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2171099 
End bp2173306 
Gene Length2208 bp 
Protein Length735 aa 
Translation table11 
GC content62% 
IMG OID637171178 
Productguanosine-3,5-bis(diphosphate) 3-pyrophosphohydrolase 
Protein accessionYP_114455 
Protein GI53803680 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases 
TIGRFAM ID[TIGR00691] (p)ppGpp synthetase, RelA/SpoT family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCGT ACGCCGTCGC CGATGCCCCG CCGCCGCAAG AGCCGGCGGA AACGCCTCAG 
GAAAAACTGG CGCGCCGGCT CGGCCATCTG ACTTCGAGCT ACCTCGAACC GGCCCAGGTC
GCGGAAATCC AGAAGGCTTA CGAAGTCGCC GCCAAGGCGC ATGAGGGGCA GTTTCGCCTC
AGCGGCGAGC CCTACGTCTG TCACCCCCTT TCGGTCGCCA TCATCCTTGC CGAAATGCGG
ATGGATGCCA AGGGCATCAT GGCGGCCCTC CTCCACGACG TCATCGAAGA CACGCCCTTG
ACCAAGGAGG ATCTGAGCGC CCGCTTCGGA GCCGAAGTGG CGGAACTGGT CGATGGCGTC
AGCAAGCTCA CCCAGCTCGA TTGCAAATCG CGCGCCGAGG CTCAGGCGGA GAATGTGCGC
AAGATGGTCC TGGCGATGGT CAAGGACCTG CGCGTGATCA TGGTCAAGCT GGCGGACCGG
CTCCACAACA TGCGCACGCT GGACGTGATG AACCCGGGAC GGCGCTGTCG CATCGCCCGG
GAAACCCTGG ACATTTATGC CCCCATCGCC CACCGGCTGG GAATGAACGA GGTCCGGCTG
GAACTGGAGA ATCTCGGCTT CTCCGCCATG TACCCGATGC GGCACCGGGT ACTCGAGCAG
TGCGTCAAGA AAGCTCGAGG CAACCGCAAG GAAGTCCTCG CCACCATCGA ATCAACGCTC
AAAGCGCGTC TCAAGGATTG CGGCATGCCG GACGCCCGGA TCATCGGCCG GGAGAAGCAT
CTCTACAGTC TCTACCAGAA GATGCGCAGC AAGCATCTAC CCTTCGCCCA GGTCTACGAC
GTCTATGCGT TCCGCATCGT GGTCGACTCG CCCGACGAGT GTTACCGGGC GCTGGGCGCC
GTCCACAACC TGTACAAACC GATTCCGGGC CGCTTCAAAG ACTACATCGC CCTGCCCAAG
GCCAACGGCT ACCAGTCGCT CCACACCGTC CTGATCGGCC CCTTCGGCCT GCCGCTGGAA
ATCCAGATCC GCACCCATGC GATGCACCAC ATGGCGGAGT CGGGCATCGC TGCCCATTGG
CTGTACAAAT CGGAAACCGA TCCCGCCGCC GGCAGCCAGG CGCGAGCCCG GGAATGGCTG
CGAGACCTGC TAGAAATCCA GAAGAGTGCC GGCGATTCCC TGGAATTCCT CGACAACCTC
AAGGTCGACC TGTTCCAGCA CGAGTGCTAC GTGTTCACCC CCAAGGGCCG CATCATCAAG
CTGCCGCGTG GCGCCACTAT CGTCGATTTC GCCTACGCGG TACATACCGA CATCGGCAAT
TCCTGTGTCT CAGCCCGCGT CAACCGCATT CTGGCGCCGC TGCACAGCGT GCTCGAGAAT
GGCCAGACCA TCGAGATCAT CACCGCGCCT TGGGCCAGAC CGAATCCCCT GTGGCTGAAC
TATGTGGTAA CGGCCAAAGC TCGGGCCGCA ATCCGCAGCC ATTTGAGAAA CTTCAAGAAA
CAGGAAGCGG TCAACCTCGG ACGGCGCCTG TTGGAAAAAG AACTGGCCAA TCACGGGCTG
AATCTGGAAG CGATTCCGCC GCAGCAACTG GAAAACTTTA CCCGCTCCCT GGAGTTGGCC
TCGTTCGAAT CCCTGCTGGA AGATCTCGGG CTCGGCAACC GGCTCCCCTT CCTCGTCGTG
CAGCAAATGC TGCATGGTGA ACGCGAAAGC GGCAAAGGCG CGCCGTCGCC AGCTGCCGAA
AAAAGCGCCA GGATGCCGCT CGTCATCAAG GGAACCGAGG GCGTCGTGGT CAACCTGGCA
AAGTGCTGCC GGCCTATTCC AGGCGACCCC ATCGTCGGTT TCTTCAACCC AGGCAAGGGC
ATCGTGGTGC ATCTCACCGA CTGCAAAAAT GCTGCCGAGC TGCGGCGCAA GCAGATCAAC
AGCCTGGACG TCGAGTGGGA TCGCGCGGCG AGCGGGATGT TCCCGGCGAT GATTCGGCTG
GAACTGATGA ACCGGGTTGG CACCCTCGCC CAGGTGGCAT CGGCGATCTC CCGGATGGAA
GCCGACATCG AAAACGTCCA GATCACCAAC CAGGACGACC AGATATCCAC GGACGTCATC
ACGATCGGCG TCAAGGACAG GGTCCACCTC GCGCGGGTCA TGCGCGAATT GCGCCGACTG
AGCATCGTCC TGAAAATATC CAGAGTCAAA TCGGAACTGA GAAAGTAG
 
Protein sequence
MSSYAVADAP PPQEPAETPQ EKLARRLGHL TSSYLEPAQV AEIQKAYEVA AKAHEGQFRL 
SGEPYVCHPL SVAIILAEMR MDAKGIMAAL LHDVIEDTPL TKEDLSARFG AEVAELVDGV
SKLTQLDCKS RAEAQAENVR KMVLAMVKDL RVIMVKLADR LHNMRTLDVM NPGRRCRIAR
ETLDIYAPIA HRLGMNEVRL ELENLGFSAM YPMRHRVLEQ CVKKARGNRK EVLATIESTL
KARLKDCGMP DARIIGREKH LYSLYQKMRS KHLPFAQVYD VYAFRIVVDS PDECYRALGA
VHNLYKPIPG RFKDYIALPK ANGYQSLHTV LIGPFGLPLE IQIRTHAMHH MAESGIAAHW
LYKSETDPAA GSQARAREWL RDLLEIQKSA GDSLEFLDNL KVDLFQHECY VFTPKGRIIK
LPRGATIVDF AYAVHTDIGN SCVSARVNRI LAPLHSVLEN GQTIEIITAP WARPNPLWLN
YVVTAKARAA IRSHLRNFKK QEAVNLGRRL LEKELANHGL NLEAIPPQQL ENFTRSLELA
SFESLLEDLG LGNRLPFLVV QQMLHGERES GKGAPSPAAE KSARMPLVIK GTEGVVVNLA
KCCRPIPGDP IVGFFNPGKG IVVHLTDCKN AAELRRKQIN SLDVEWDRAA SGMFPAMIRL
ELMNRVGTLA QVASAISRME ADIENVQITN QDDQISTDVI TIGVKDRVHL ARVMRELRRL
SIVLKISRVK SELRK