Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4143 |
Symbol | gppA |
ID | 6145004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4242392 |
End bp | 4243876 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618966 |
Product | guanosine pentaphosphate phosphohydrolase |
Protein accession | YP_001746098 |
Protein GI | 170681890 |
COG category | [F] Nucleotide transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0248] Exopolyphosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.263273 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTCCA CCTCGTCGCT GTATGCAGCC ATTGATCTCG GTTCGAATAG TTTTCATATG CTGGTTGTGC GCGAGGTGGC TGGAAGCATC CAGACGCTGA CGCGAATTAA ACGCAAAGTG CGTCTGGCTG CTGGCCTGAA TAGCGAAAAT GCCCTGTCTA ATGAAGCAAT GGAGCGCGGT TGGCAATGTC TGCGCCTGTT TGCTGAACGT CTGCAAGATA TCCCTCCCTC GCAAATTCGC GTTGTCGCTA CGGCGACATT ACGACTAGCC GTCAATGCGG GTGATTTTAT CGCCAAAGCA CAGGAAATCC TCGGTTGTCC GGTACAGGTG ATCAGCGGTG AAGAGGAAGC ACGTCTGATT TATCAGGGCG TTGCTCACAC AACTGGCGGT GCCGATCAGC GCCTGGTGGT GGATATAGGC GGTGCCAGTA CGGAACTGGT AACCGGCACG GGTGCACAAA CCACCTCGTT GTTCAGCCTG TCGATGGGCT GCGTCACCTG GCTGGAACGC TATTTTGCCG ATCGTAATCT GGGGCAGGAA AATTTTGATG CTGCAGAAAA AGCGGCACGC GAAGTGTTAC GTCCGGTTGC CGATGAATTA CGGTATCACG GCTGGAAAGT GTGCGTTGGC GCTTCCGGCA CCGTGCAGGC GTTACAGGAA ATCATGATGG CACAGGGGAT GGATGAACGC ATTACCCTGG AAAAGTTGCA GCAACTGAAA CAACGCGCCA TTCATTGCGG TCGGCTGGAA GAGCTAGAGA TTGACGGGCT GACGCTGGAA CGTGCGTTAG TGTTCCCGAG TGGTCTGGCG ATCCTGATAG CCATTTTTAC CGAACTGAAT ATTCAGTGTA TGACCCTGGC GGGCGGTGCG CTGCGTGAAG GCCTGGTCTA CGGTATGTTA CATCTTACCG TCGAGCAGGA TATTCGCAGC CGTACGCTGC GTAATATTCA GCGCCGCTTT ATGATCGATA TTGATCAGGC ACAGCGCGTA GCCAAAGTTG CGGCTAACTT CTTCGATCAG GTGGAAAATG AATGGCATCT TGAAGCAATA AGCCGCGATT TGCTCATCAG CGCCTGCCAG CTTCATGAAA TCGGCCTGAG CGTTGACTTC AAACAAGCGC CGCAACACGC TGCTTATCTG GTACGTAATC TGGATCTTCC CGGTTTTACC CCCGCACAGA AAAAACTGCT GGCGACGCTA CTGCTCAACC AGACTAATCC GGTCGATCTC TCATCGCTGC ATCAGCAAAA TGCCGTACCA CCGCGCGTCG CAGAACAACT CTGCCGTTTA CTCCGCCTGG CTATCATTTT TGCCAGCCGT CGCCGTGACG ATCTCGTGCC AGAGATGACA TTACAGGCTA ACCATGAACT GTTGACCTTG ACGCTTCCGC AAGGTTGGCT AACCCAACAT CCGCTGGGTA AAGAGATTAT TGATCAGGAA AGCCAGTGGC AGAGCTATGT CCACTGGCCG CTGGAAGTGC ATTAA
|
Protein sequence | MGSTSSLYAA IDLGSNSFHM LVVREVAGSI QTLTRIKRKV RLAAGLNSEN ALSNEAMERG WQCLRLFAER LQDIPPSQIR VVATATLRLA VNAGDFIAKA QEILGCPVQV ISGEEEARLI YQGVAHTTGG ADQRLVVDIG GASTELVTGT GAQTTSLFSL SMGCVTWLER YFADRNLGQE NFDAAEKAAR EVLRPVADEL RYHGWKVCVG ASGTVQALQE IMMAQGMDER ITLEKLQQLK QRAIHCGRLE ELEIDGLTLE RALVFPSGLA ILIAIFTELN IQCMTLAGGA LREGLVYGML HLTVEQDIRS RTLRNIQRRF MIDIDQAQRV AKVAANFFDQ VENEWHLEAI SRDLLISACQ LHEIGLSVDF KQAPQHAAYL VRNLDLPGFT PAQKKLLATL LLNQTNPVDL SSLHQQNAVP PRVAEQLCRL LRLAIIFASR RRDDLVPEMT LQANHELLTL TLPQGWLTQH PLGKEIIDQE SQWQSYVHWP LEVH
|
| |