Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1305 |
Symbol | |
ID | 6145524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1293156 |
End bp | 1294664 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641616183 |
Product | putative deoxyguanosinetriphosphate triphosphohydrolase |
Protein accession | YP_001743363 |
Protein GI | 170684108 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0232] dGTP triphosphohydrolase |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0227641 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATGGC AACAATTGCT TAACTGCAAC CGTCGGAAAG ATAATCTAAA AAAAACCAGC ATAGAATTTT TACATCCCCT TGCTAGCGGA AGACAAGAGA TCGAGCGGGA TTACGACCGA TTACTTTTTG CTGCGCCAAC TCGTCGGCTT GCGGATAAAA CACAAGTTTT CCCGTTAGAT CAAAATGACA GTATACGGAC GAGGCTGACT CATTCTCACG AAGTAGCAAA CTTTGCTCGC GGAATAGGAA TGAGACTCGC TTTCGAGATG AAGGAACACA TCTTTGGCGA AATACCCACG GGGATTGTCG TTGAACGCGA CGTTCCTGCT CTGTTAGCAG CGATCGGACT GGCACACGAT CTCGGTAATC CACCATTTGG GCATCAAGGT GAAGCAGCGA TGCGCGCATG GTTTAATAAA AATCTTCCAG GCCTTCTTGA TAAAGAAGTA AACAGCGAAA TATATAACGA CTTTCTGCAA TTTGATGGAA ACTCACAAAC CCTACGCCTG GTTACCAAAT TACAAATTAT TAACGATAAC TTTGGTCTGA ATCTGACCTA TGCCACGCTG GCTGCACTGA TCAAATATCC CCGAGCTTCA TATTCATCGG ACAAACATTG GAAAAAGCAC GGTTTTTTCT ATTCTGAGCA TGAGGTTGTC ATGGATGTCT GGCAGCAAAC GGGCCTTGGC GTACCAGGTA ATGATGCCGG TTGCCGCCAC CCATTTACCT GGATAATGGA AGCTTGCGAC GATATTGCCT ACTCTGTGCT TGACGCAGAA GATACCATCA AGAAAGGATT TGCCTCATAC CAGGATTTGC GTGATTTTTT GATGTCACAA TCCGAGCCGA ACGATGTAAT TCAACATGTT ATTGAAGCTG TAGAGGCCAA AAATGACGCC TGGAAAGAGC AAAGAAATGC ACTTTCACCG GCGGAAATTA ATGAACTCAG TATGCAAATG TTCAGAGTTT ATGCCACCAG TGCATTGATT AATGCCGTAG TAGAAGCCTT TCGCGAACAA TTACCCACGC TGATGCAGCG TGACTGCCCG TTCAAAGATT TAATCTCGCA AAGCAAAGGT CACGCTCTCT GCACTGCCCT GAAAATGTTC GACCGAACAC GAGGCTACCT TCATCGTTCA GTCTTGCAAC TGGAACTTAG GGGATCTAAT TACATTACTA ACTTGATGGA TATGCTCTGG ATTGGCATCC ATGGACACAA AACAAAGAAA CAACGAGACG GAGAACGCGA GTCCGCGTCT GCAGATAACC TTTTTGTTTC TCGTGAAGAA TACCTGTCGG ATACTCCCTT TGGTCGCTAT GCTTACGGGA CAATCTCTGA AAATTATCGC CGTGTGTTTG AAGATCCCGC CAATACACTC CCTGCTCTTT ATAAAGAAGC GCAGTTGCTT ACCGACGCAA TCTCAGGCAT GACGGATAGT TATTTAATGC GCCTTCACGA TGAGCTAAAA TCACTGTATG AATGTGGTCT CAAACACCCA ACAGCTTGA
|
Protein sequence | MEWQQLLNCN RRKDNLKKTS IEFLHPLASG RQEIERDYDR LLFAAPTRRL ADKTQVFPLD QNDSIRTRLT HSHEVANFAR GIGMRLAFEM KEHIFGEIPT GIVVERDVPA LLAAIGLAHD LGNPPFGHQG EAAMRAWFNK NLPGLLDKEV NSEIYNDFLQ FDGNSQTLRL VTKLQIINDN FGLNLTYATL AALIKYPRAS YSSDKHWKKH GFFYSEHEVV MDVWQQTGLG VPGNDAGCRH PFTWIMEACD DIAYSVLDAE DTIKKGFASY QDLRDFLMSQ SEPNDVIQHV IEAVEAKNDA WKEQRNALSP AEINELSMQM FRVYATSALI NAVVEAFREQ LPTLMQRDCP FKDLISQSKG HALCTALKMF DRTRGYLHRS VLQLELRGSN YITNLMDMLW IGIHGHKTKK QRDGERESAS ADNLFVSREE YLSDTPFGRY AYGTISENYR RVFEDPANTL PALYKEAQLL TDAISGMTDS YLMRLHDELK SLYECGLKHP TA
|
| |