Gene SAG1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1940 
Symbol 
ID1014750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1926442 
End bp1928658 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content37% 
IMG OID637317107 
ProductGTP pyrophosphokinase family protein 
Protein accessionNP_688928 
Protein GI22538077 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases 
TIGRFAM ID[TIGR00691] (p)ppGpp synthetase, RelA/SpoT family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.891198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAAAG AAATTAATTT AACAGGAGAA GAGGTTGTTG CAATCACATC CCAGTACATG 
AGTGAGACGG ATGTTGCTTT TGTAAAATTT GCCTTAAATT ACGCAACCGC AGCTCATTAT
TATCAAGCAA GAAAGTCGGG CGAGCCCTAC ATTATTCATC CAATTCAAGT AGCAGGTATT
CTTGCAGATT TACACCTTGA TGCCGTTACG GTAGCTTGTG GTTTTTTACA TGATGTTGTA
GAAGATACGG AAATTACCCT TGATGAGATT GAAACTGATT TTGGCAAGGA TGTTCGCGAT
ATTATTGACG GCGTAACAAA ATTGGGTAAA GTGGAGTACA AATCTCATGA AGAACAGTTA
GCAGAAAATC ACCGTAAAAT GTTAATGGCT ATGTCTAAGG ATATCCGTGT TATTTTGGTT
AAGCTCGCTG ACCGCTTACA TAATATGAGA ACCCTTAAAC ACCTCAGAAA AGATAAACAA
GAGCGTATTT CACGTGAGAC TATGGAAATA TACGCTCCTT TAGCGCATCG TTTGGGGATT
AGTCGTATCA AGTGGGAATT AGAAGATTTA TCTTTTCGTT ACTTGAATGA GACGGAATTT
TATAAGATTT CTCATATGAT GAGTGAAAAA CGTCGTGAAC GCGAAGAGTT GGTTGATATA
ATTGTCGATA AAATCAGATC CTATACTGAG GAACAGGGTT TATATGGTGA TATTTATGGT
AGACCAAAAC ACATTTATTC TATTTACAGA AAAATGCGTG ATAAAAAGAA ACGCTTTGAT
CAGATTTATG ATTTAATTGC GATACGCTGT ATCATGGAAA CTGCTAGTGA TGTTTATGCC
ATGGTGGGGT ATATCCATGA GTTATGGCGT CCTATGCCAG GAAGGTTTAA AGATTATATT
GCGGCACCAA AGGCCAATGG TTATCAATCT ATTCACACGA CGGTTTATGG ACCTAAAGGG
CCAATTGAAA TTCAGATTCG AACTAAAGAA ATGCATCAAG TTGCCGAGTT TGGTGTTGCA
GCCCATTGGG CCTATAAAAA AGGAATCACC AGTAAGGTTA ATCAAGCAGA GCAATCGGTT
GGTATGGGAT GGATTCAAGA ATTAGTTGAG CTTCAAGATG AATCAAAAGA CGCCAAGGAT
TTTGTTGACT CTGTTAAGGA AGATATCTTT ACAGAACGCA TCTATGTTTT TACCCCAAAT
GGTGCTGTTC AGGAATTGCC AAGGGAATCT GGACCAATTG ATTTTGCTTA TGCTATTCAC
ACGCAAGTTG GAGAAAAAGC TACTGGCGCT AAAGTAAATG GACGTATGGT TCCATTGACT
GCTAAGTTAA AAACAGGAGA TGTTGTTGAG ATTATCACAA ACCCTAATTC ATTTGGACCA
AGTCGTGACT GGATTAAGAT TGTTAAGACC AATAAAGCTC GCAATAAAAT TCGTCAGTTC
TTCAAAAACC AAGATAAAGA AACTTCTATT AATAAAGGTA GAGAATTACT GGTTGATTAT
TTCCAAGAAC AAGGTTATGT GCCTAACAAA TATTTAGATA AGAAACATAT TGAAGAAATC
CTTCCACGTG TCAGTGTTAA AAGTGAGGAA GCTTTGTATG CAGCCGTAGG TTTTGGTGAT
TTGAGTCCGA TTAGTATTTT TAATAAACTG ACGGAAAAAG AACGCCGTGA GGAGGAACGT
GCTAAGGCTA AAGCGGAAGC AGACGAACTT ATTAACGGTG GTGAAATCAA AACCGATAAG
CGTGATGTTC TCAAAGTTAA GAGTGAAAAT GGTGTTATCA TTCAAGGAGC TTCAGGTTTA
TTAATGCGTA TTGCAAAATG TTGTAATCCC GTTCCTGGAG ATCTCATTGA AGGCTATATA
ACTAAAGGTA GAGGCGTGGC TATCCATCGT TCAGATTGTC AAAATTTAAA GAGTCAGGAG
AACTACGAAC AACGTTTGAT TGATGTTGAG TGGGATGATG ATGGATCTAA AAAAGAGTAC
ATGGCTGAAA TCGATATTTA TGGTTTGAAT CGTAGTGGCC TCCTTAATGA TGTTCTTCAA
ACCTTATCAA ATGCAACAAA GCTAGTATCG ACAGTAAATG CACAACCAAC AAAAGATATG
AAATTTGCTA ATATTCATGT TAGCTTTGGA ATTTCAAATT TAGCACAATT GACAACTGTG
GTTGATAAAA TCAAAATTAT CCCAGATGTT TATTCTGTTA AACGTACGAA TGGTTAG
 
Protein sequence
MVKEINLTGE EVVAITSQYM SETDVAFVKF ALNYATAAHY YQARKSGEPY IIHPIQVAGI 
LADLHLDAVT VACGFLHDVV EDTEITLDEI ETDFGKDVRD IIDGVTKLGK VEYKSHEEQL
AENHRKMLMA MSKDIRVILV KLADRLHNMR TLKHLRKDKQ ERISRETMEI YAPLAHRLGI
SRIKWELEDL SFRYLNETEF YKISHMMSEK RREREELVDI IVDKIRSYTE EQGLYGDIYG
RPKHIYSIYR KMRDKKKRFD QIYDLIAIRC IMETASDVYA MVGYIHELWR PMPGRFKDYI
AAPKANGYQS IHTTVYGPKG PIEIQIRTKE MHQVAEFGVA AHWAYKKGIT SKVNQAEQSV
GMGWIQELVE LQDESKDAKD FVDSVKEDIF TERIYVFTPN GAVQELPRES GPIDFAYAIH
TQVGEKATGA KVNGRMVPLT AKLKTGDVVE IITNPNSFGP SRDWIKIVKT NKARNKIRQF
FKNQDKETSI NKGRELLVDY FQEQGYVPNK YLDKKHIEEI LPRVSVKSEE ALYAAVGFGD
LSPISIFNKL TEKERREEER AKAKAEADEL INGGEIKTDK RDVLKVKSEN GVIIQGASGL
LMRIAKCCNP VPGDLIEGYI TKGRGVAIHR SDCQNLKSQE NYEQRLIDVE WDDDGSKKEY
MAEIDIYGLN RSGLLNDVLQ TLSNATKLVS TVNAQPTKDM KFANIHVSFG ISNLAQLTTV
VDKIKIIPDV YSVKRTNG