Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2882 |
Symbol | guaA |
ID | 6269720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2680405 |
End bp | 2681982 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641726825 |
Product | GMP synthase |
Protein accession | YP_001881298 |
Protein GI | 187731505 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0518] GMP synthase - Glutamine amidotransferase domain [COG0519] GMP synthase, PP-ATPase domain/subunit |
TIGRFAM ID | [TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit [TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0113861 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAAA ACATTCATAA GCATCGCATC CTCATTCTGG ACTTCGGTTC TCAGTACACT CAACTGGTTG CGCGCCGCGT GCGTGAGCTG GGTGTTTACT GCGAACTGTG GGCGTGGGAT GTGACAGAAG CACAAATTCG TGACTTCAAT CCAAGCGGCA TTATTCTTTC CGGCGGCCCG GAAAGTACTA CTGAAGAAAA CAGTCCGCGT GCGCCGCAGT ATGTCTTTGA AGCAGGCGTA CCGGTATTCG GCGTTTGCTA TGGCATGCAG ACCATGGCAA TGCAGTTGGG CGGTCACGTT GAAGCCTCTA ACGAACGTGA ATTTGGCTAC GCGCAGGTTG AAGTCGTAAA CGACAGCGCA CTGGTTCGCG GTATCGAAGA TGCGCTGACC GCAGACGGTA AACCGCTGCT CGATGTCTGG ATGAGCCACG GCGATAAAGT TACCGCTATC CCGTCCGACT TCGTCACCGT AGCCAGCACC GAAAGCTGCC CGTTTGCCAT TATGGCTAAC GAAGAAAAAC GCTTCTATGG CGTACAGTTC CACCCGGAAG TGACTCACAC CCGCCAGGGT ATGCACATGC TGGAGCGTTT TGTGCGTGAT ATCTGCCAGT GTGAAGCCCT GTGGACGCCA GCGAAAATTA TCGACGATGC TGTAGCTCGC ATCCGCGAGC AGGTAGGCGA CGATAAAGTC ATCCTCGGCC TCTCTGGTGG TGTGGATTCC TCCGTAACCG CAATGCTGCT GCACCGCGCT ATCGGTAAAA ACCTGACTTG CGTATTCGTC GACAACGGCC TGCTGCGTCT CAACGAAGCA GAGCAGGTTC TGGATATGTT TGGCGATCAC TTTGGTCTGA ACATTGTTCA CGTACCGGCA GAAGATCGCT TCCTGTCAGC GCTGGCTGGC GAAAACGATC CGGAAGCAAA ACGTAAAATC ATCGGGCGCG TTTTCGTTGA AGTGTTCGAT GAAGAAGCGC TGAAACTGGA AGACGTGAAG TGGCTGGCGC AGGGCACCAT CTACCCTGAC GTTATCGAAT CTGCGGCTTC TGCAACCGGT AAAGCACACG TCATCAAATC TCACCACAAC GTGGGCGGCC TGCCGAAAGA GATGAAGATG GGCCTGATTG AACCGCTGAA AGAGCTGTTC AAAGACGAAG TGCGTAAGAT TGGTCTGGAG CTGGGCCTGC CGTACGACAT GCTGTACCGT CACCCGTTCC CGGGACCAGG CCTTGGCGTT CGTGTGCTGG GTGAAGTGAA GAAAGAGTAC TGTGACCTGC TGCGCCGTGC TGACGCCATC TTCATTGAAG AACTGCGTAA AGCGGACCTG TACAACAAAG TCAGCCAGGC GTTCACTGTG TTCCTGCCGG TACGTTCCGT TGGCGTAATG GGCGATGGTC GTAAGTATGA CTGGGTTGTC TCTCTGCGTG CTGTCGAAAC CATCGACTTT ATGACCGCAC ACTGGGCGCA TCTGCCGTAC GATTTCCTCG GTCGCGTTTC CAACCGCATT ATCAATGAAG TGAACGGTAT TTCCCGCGTG GTGTATGACA TCAGCGGCAA GCCGCCAGCT ACCATTGAGT GGGAATGA
|
Protein sequence | MTENIHKHRI LILDFGSQYT QLVARRVREL GVYCELWAWD VTEAQIRDFN PSGIILSGGP ESTTEENSPR APQYVFEAGV PVFGVCYGMQ TMAMQLGGHV EASNEREFGY AQVEVVNDSA LVRGIEDALT ADGKPLLDVW MSHGDKVTAI PSDFVTVAST ESCPFAIMAN EEKRFYGVQF HPEVTHTRQG MHMLERFVRD ICQCEALWTP AKIIDDAVAR IREQVGDDKV ILGLSGGVDS SVTAMLLHRA IGKNLTCVFV DNGLLRLNEA EQVLDMFGDH FGLNIVHVPA EDRFLSALAG ENDPEAKRKI IGRVFVEVFD EEALKLEDVK WLAQGTIYPD VIESAASATG KAHVIKSHHN VGGLPKEMKM GLIEPLKELF KDEVRKIGLE LGLPYDMLYR HPFPGPGLGV RVLGEVKKEY CDLLRRADAI FIEELRKADL YNKVSQAFTV FLPVRSVGVM GDGRKYDWVV SLRAVETIDF MTAHWAHLPY DFLGRVSNRI INEVNGISRV VYDISGKPPA TIEWE
|
| |