Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2658 |
Symbol | guaA |
ID | 5591890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2668598 |
End bp | 2670175 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640921773 |
Product | GMP synthase |
Protein accession | YP_001459300 |
Protein GI | 157161982 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0518] GMP synthase - Glutamine amidotransferase domain [COG0519] GMP synthase, PP-ATPase domain/subunit |
TIGRFAM ID | [TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit [TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.00000100783 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAAA ACATTCATAA GCATCGCATC CTCATTCTGG ACTTCGGTTC CCAGTATACT CAACTGGTTG CGCGCCGCGT GCGTGAGCTG GGCGTTTACT GCGAACTGTG GGCGTGGGAT GTGACAGAAG CACAAATTCG TGACTTCAAT CCAAGCGGCA TTATTCTTTC CGGCGGCCCG GAAAGTACTA CTGAAGAAAA CAGTCCGCGT GCGCCGCAGT ATGTCTTTGA AGCAGGCGTA CCGGTATTCG GCGTTTGCTA TGGTATGCAG ACCATGGCGA TGCAGTTGGG CGGTCACGTT GAAGCCTCTA ACGAACGTGA ATTTGGCTAC GCGCAGGTTG AAGTCGTAAA CGACAGCGCA CTGGTTCGCG GTATCGAAGA TGCGCTGACC GCAGACGGTA AACCGCTGCT CGATGTCTGG ATGAGCCACG GCGATAAAGT TACCGCTATC CCGTCCGACT TCGTCACCGT AGCCAGCACC GAAAGCTGCC CGTTTGCCAT TATGGCTAAC GAAGAAAAAC GCTTCTATGG CGTACAGTTC CACCCGGAAG TGACTCACAC CCGCCAGGGT ATGCGCATGC TGGAGCGTTT TGTGCGTGAT ATCTGCCAGT GTGAAGCCCT GTGGACGCCA GCGAAAATTA TCGACGATGC TGTAGCCCGC ATCCGTGAGC AGGTAGGCGA CGATAAAGTC ATCCTCGGCC TCTCTGGTGG CGTGGATTCC TCCGTAACCG CAATGCTACT GCACCGCGCT ATCGGTAAAA ACCTGACTTG CGTATTCGTC GACAACGGCC TGCTGCGTCT CAACGAAGCA GAGCAGGTTC TGGATATGTT TGGCGATCAC TTTGGTCTGA ACATTGTTCA CGTTCCGGCA GAAGATCGCT TCCTGTCAGC GCTGGCTGGC GAAAACGATC CGGAAGCAAA ACGTAAAATC ATCGGTCGCG TTTTCGTTGA AGTATTCGAT GAAGAAGCGC TGAAACTGGA AGACGTGAAG TGGCTGGCGC AGGGCACCAT CTACCCTGAC GTTATCGAAT CTGCGGCGTC TGCAACCGGT AAAGCACACG TCATCAAATC TCACCACAAC GTGGGCGGCC TGCCGAAAGA GATGAAGATG GGCCTGGTTG AACCGCTGAA AGAGCTGTTC AAAGACGAAG TGCGTAAGAT TGGTCTGGAG CTGGGCCTGC CGTACGACAT GTTGTACCGT CACCCATTCC CGGGACCAGG CCTTGGCGTT CGTGTGCTGG GTGAAGTGAA GAAAGAGTAC TGTGACCTGC TGCGCCGTGC TGACGCCATC TTCATTGAAG AACTGCGTAA AGCGGACCTG TACGACAAAG TCAGCCAGGC GTTCACTGTG TTCCTGCCGG TACGTTCCGT TGGCGTAATG GGCGATGGTC GTAAGTATGA CTGGGTTGTC TCTCTGCGTG CTGTCGAAAC CATCGACTTT ATGACCGCAC ACTGGGCGCA TCTGCCGTAC GATTTCCTCG GTCGCGTTTC CAACCGCATT ATCAATGAAG TGAACGGTAT TTCCCGCGTG GTGTATGACA TCAGCGGCAA GCCGCCAGCT ACCATTGAGT GGGAATGA
|
Protein sequence | MTENIHKHRI LILDFGSQYT QLVARRVREL GVYCELWAWD VTEAQIRDFN PSGIILSGGP ESTTEENSPR APQYVFEAGV PVFGVCYGMQ TMAMQLGGHV EASNEREFGY AQVEVVNDSA LVRGIEDALT ADGKPLLDVW MSHGDKVTAI PSDFVTVAST ESCPFAIMAN EEKRFYGVQF HPEVTHTRQG MRMLERFVRD ICQCEALWTP AKIIDDAVAR IREQVGDDKV ILGLSGGVDS SVTAMLLHRA IGKNLTCVFV DNGLLRLNEA EQVLDMFGDH FGLNIVHVPA EDRFLSALAG ENDPEAKRKI IGRVFVEVFD EEALKLEDVK WLAQGTIYPD VIESAASATG KAHVIKSHHN VGGLPKEMKM GLVEPLKELF KDEVRKIGLE LGLPYDMLYR HPFPGPGLGV RVLGEVKKEY CDLLRRADAI FIEELRKADL YDKVSQAFTV FLPVRSVGVM GDGRKYDWVV SLRAVETIDF MTAHWAHLPY DFLGRVSNRI INEVNGISRV VYDISGKPPA TIEWE
|
| |