Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2656 |
Symbol | guaA |
ID | 6146677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2715255 |
End bp | 2716832 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617527 |
Product | GMP synthase |
Protein accession | YP_001744692 |
Protein GI | 170683481 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0518] GMP synthase - Glutamine amidotransferase domain [COG0519] GMP synthase, PP-ATPase domain/subunit |
TIGRFAM ID | [TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit [TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0518373 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGAAA ACATTCATAA GCATCGCATC CTCATTCTGG ACTTCGGTTC CCAGTATACT CAACTGGTTG CGCGCCGCGT GCGTGAGCTG GGCGTTTACT GCGAACTGTG GGCGTGGGAT GTGACAGAAG CACAAATTCG TGACTTCAAT CCAAGCGGCA TTATTCTTTC CGGCGGCCCG GAAAGCACCA CCGAAGAAAA CAGCCCGCGT GCGCCGCAGT ATGTCTTTGA AGCAGGCGTA CCGGTATTCG GCGTTTGCTA TGGCATGCAG ACCATGGCGA TGCAGTTGGG CGGTCACGTT GAAGCCTCTA ACGAACGTGA ATTTGGCTAC GCGCAGGTTG AAGTCGTAAA CGACAGCGCA CTAGTTCGCG GTATCGAAGA TGCGCTGACC GCAGACGGTA AACCTCTGCT CGATGTCTGG ATGAGCCACG GCGATAAAGT TACCGCTATC CCGTCCGACT TCGTCACCGT AGCCAGCACC GAAAGCTGCC CGTTTGCCAT TATGGCTAAC GAAGAAAAAC GCTTCTATGG CGTACAGTTC CACCCGGAAG TGACTCACAC CCGCCAGGGT ATGCGCATGC TGGAGCGTTT TGTGCGTGAT ATCTGCCAGT GTGAAGCCCT GTGGACGCCA GCGAAAATTA TCGACGATGC TGTAGCCCGT ATCCGCGAGC AGGTAGGCGA CGATAAAGTC ATCCTCGGCC TCTCTGGTGG CGTGGATTCC TCCGTGACCG CAATGCTGCT GCACCGCGCT ATCGGTAAAA ATCTGACTTG CGTATTCGTC GATAACGGCC TGCTGCGTCT CAACGAAGCA GAGCAGGTTC TGGATATGTT TGGCGATCAC TTTGGTCTGA ACATTGTTCA CGTTCCGGCA GAAGATCGCT TCCTGTCAGC GTTGGCCGGC GAAAACGATC CGGAAGCAAA ACGTAAAATC ATCGGGCGCG TTTTCGTTGA AGTATTCGAT GAAGAAGCAC TGAAACTGGA AGACGTGAAG TGGCTGGCGC AGGGCACTAT CTACCCTGAC GTTATTGAAT CTGCCGCATC TGCAACCGGT AAAGCACACG TCATCAAATC TCACCACAAC GTGGGCGGCC TGCCGAAAGA GATGAAGATG GGCCTGGTTG AACCGCTGAA AGAGCTGTTC AAAGACGAAG TGCGTAAGAT TGGTCTGGAG CTGGGCCTGC CGTACGACAT GCTGTACCGT CACCCGTTCC CGGGACCAGG CCTTGGCGTT CGTGTTCTGG GTGAAGTGAA GAAAGAGTAT TGTGACCTGC TGCGCCGTGC TGACGCCATC TTCATTGAAG AACTGCGTAA AGCGGACCTG TACGACAAAG TCAGCCAGGC GTTCACCGTC TTCCTGCCGG TACGTTCCGT TGGCGTAATG GGCGATGGTC GTAAGTATGA CTGGGTTGTC TCTCTGCGTG CTGTCGAAAC CATCGACTTT ATGACCGCAC ACTGGGCGCA TCTGCCGTAC GATTTCCTCG GTCGCGTTTC CAACCGCATT ATCAATGAAG TGAACGGTAT TTCCCGCGTG GTGTATGACA TCAGCGGCAA GCCGCCAGCT ACCATTGAGT GGGAATGA
|
Protein sequence | MTENIHKHRI LILDFGSQYT QLVARRVREL GVYCELWAWD VTEAQIRDFN PSGIILSGGP ESTTEENSPR APQYVFEAGV PVFGVCYGMQ TMAMQLGGHV EASNEREFGY AQVEVVNDSA LVRGIEDALT ADGKPLLDVW MSHGDKVTAI PSDFVTVAST ESCPFAIMAN EEKRFYGVQF HPEVTHTRQG MRMLERFVRD ICQCEALWTP AKIIDDAVAR IREQVGDDKV ILGLSGGVDS SVTAMLLHRA IGKNLTCVFV DNGLLRLNEA EQVLDMFGDH FGLNIVHVPA EDRFLSALAG ENDPEAKRKI IGRVFVEVFD EEALKLEDVK WLAQGTIYPD VIESAASATG KAHVIKSHHN VGGLPKEMKM GLVEPLKELF KDEVRKIGLE LGLPYDMLYR HPFPGPGLGV RVLGEVKKEY CDLLRRADAI FIEELRKADL YDKVSQAFTV FLPVRSVGVM GDGRKYDWVV SLRAVETIDF MTAHWAHLPY DFLGRVSNRI INEVNGISRV VYDISGKPPA TIEWE
|
| |