Gene EcSMS35_1305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1305 
Symbol 
ID6145524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1293156 
End bp1294664 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content45% 
IMG OID641616183 
Productputative deoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_001743363 
Protein GI170684108 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0227641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATGGC AACAATTGCT TAACTGCAAC CGTCGGAAAG ATAATCTAAA AAAAACCAGC 
ATAGAATTTT TACATCCCCT TGCTAGCGGA AGACAAGAGA TCGAGCGGGA TTACGACCGA
TTACTTTTTG CTGCGCCAAC TCGTCGGCTT GCGGATAAAA CACAAGTTTT CCCGTTAGAT
CAAAATGACA GTATACGGAC GAGGCTGACT CATTCTCACG AAGTAGCAAA CTTTGCTCGC
GGAATAGGAA TGAGACTCGC TTTCGAGATG AAGGAACACA TCTTTGGCGA AATACCCACG
GGGATTGTCG TTGAACGCGA CGTTCCTGCT CTGTTAGCAG CGATCGGACT GGCACACGAT
CTCGGTAATC CACCATTTGG GCATCAAGGT GAAGCAGCGA TGCGCGCATG GTTTAATAAA
AATCTTCCAG GCCTTCTTGA TAAAGAAGTA AACAGCGAAA TATATAACGA CTTTCTGCAA
TTTGATGGAA ACTCACAAAC CCTACGCCTG GTTACCAAAT TACAAATTAT TAACGATAAC
TTTGGTCTGA ATCTGACCTA TGCCACGCTG GCTGCACTGA TCAAATATCC CCGAGCTTCA
TATTCATCGG ACAAACATTG GAAAAAGCAC GGTTTTTTCT ATTCTGAGCA TGAGGTTGTC
ATGGATGTCT GGCAGCAAAC GGGCCTTGGC GTACCAGGTA ATGATGCCGG TTGCCGCCAC
CCATTTACCT GGATAATGGA AGCTTGCGAC GATATTGCCT ACTCTGTGCT TGACGCAGAA
GATACCATCA AGAAAGGATT TGCCTCATAC CAGGATTTGC GTGATTTTTT GATGTCACAA
TCCGAGCCGA ACGATGTAAT TCAACATGTT ATTGAAGCTG TAGAGGCCAA AAATGACGCC
TGGAAAGAGC AAAGAAATGC ACTTTCACCG GCGGAAATTA ATGAACTCAG TATGCAAATG
TTCAGAGTTT ATGCCACCAG TGCATTGATT AATGCCGTAG TAGAAGCCTT TCGCGAACAA
TTACCCACGC TGATGCAGCG TGACTGCCCG TTCAAAGATT TAATCTCGCA AAGCAAAGGT
CACGCTCTCT GCACTGCCCT GAAAATGTTC GACCGAACAC GAGGCTACCT TCATCGTTCA
GTCTTGCAAC TGGAACTTAG GGGATCTAAT TACATTACTA ACTTGATGGA TATGCTCTGG
ATTGGCATCC ATGGACACAA AACAAAGAAA CAACGAGACG GAGAACGCGA GTCCGCGTCT
GCAGATAACC TTTTTGTTTC TCGTGAAGAA TACCTGTCGG ATACTCCCTT TGGTCGCTAT
GCTTACGGGA CAATCTCTGA AAATTATCGC CGTGTGTTTG AAGATCCCGC CAATACACTC
CCTGCTCTTT ATAAAGAAGC GCAGTTGCTT ACCGACGCAA TCTCAGGCAT GACGGATAGT
TATTTAATGC GCCTTCACGA TGAGCTAAAA TCACTGTATG AATGTGGTCT CAAACACCCA
ACAGCTTGA
 
Protein sequence
MEWQQLLNCN RRKDNLKKTS IEFLHPLASG RQEIERDYDR LLFAAPTRRL ADKTQVFPLD 
QNDSIRTRLT HSHEVANFAR GIGMRLAFEM KEHIFGEIPT GIVVERDVPA LLAAIGLAHD
LGNPPFGHQG EAAMRAWFNK NLPGLLDKEV NSEIYNDFLQ FDGNSQTLRL VTKLQIINDN
FGLNLTYATL AALIKYPRAS YSSDKHWKKH GFFYSEHEVV MDVWQQTGLG VPGNDAGCRH
PFTWIMEACD DIAYSVLDAE DTIKKGFASY QDLRDFLMSQ SEPNDVIQHV IEAVEAKNDA
WKEQRNALSP AEINELSMQM FRVYATSALI NAVVEAFREQ LPTLMQRDCP FKDLISQSKG
HALCTALKMF DRTRGYLHRS VLQLELRGSN YITNLMDMLW IGIHGHKTKK QRDGERESAS
ADNLFVSREE YLSDTPFGRY AYGTISENYR RVFEDPANTL PALYKEAQLL TDAISGMTDS
YLMRLHDELK SLYECGLKHP TA