Gene EcSMS35_0946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0946 
Symbol 
ID6146051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp959894 
End bp961711 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content51% 
IMG OID641615833 
ProductD5 family nucleoside triphosphatase 
Protein accessionYP_001743025 
Protein GI170680147 
COG category[R] General function prediction only 
COG ID[COG3378] Predicted ATPase 
TIGRFAM ID[TIGR01613] phage/plasmid primase, P4 family, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG CGCCAAATTT AAAACACCAG CCGCGTGACA AAATGACGGA AGTCATCATT 
TTTGCGGGTA GTGATGCGTG GGCACATGCG AAACAGTGGC AGGAACAGGA CGGGCGACTG
GCTGGCGATA ACGTGCCACC TGTCTGGCTT GGAGAGCAAC AACTTGCCGA ACTGGACAAC
CTGCAAATCG TACCGGACGG ACGCTATCGC GTGCGTCTCT ATCAGGCGGG GTTATTGCGT
CCGGGGCTTG TTAATACCAT CGGGCAGAAA CTGGCAGTGG CAGGTGTCAG GGATGCTGAT
TATTACCCTG AAGGAATGCA CAGCCAGAAA CGGGAGAACT GGCGCGAATA TCTGGAACGT
GAACGGGCAG AGCAGGCGGA AAAGAAAAAG GTAGTTGAAC TGCCTGTAAA GAAAAAAGAG
CGGGTAAAAG ACGATAACGC TTCATCACTG GCGCTTAACC AGATGGGAGC AAGTCAACGC
GGCGAAGTTC TCCTGGCACA TTATGGCGGT GAACTGGCGA TTCATGCTGA CTCTGACACT
GTTCACCATT ACAACGGCGT TGTATGGGAG CCAGTACAGG ATAAAGAATT ACAGCGAGCT
ATGGCACAGA TTTTCATTGA TGCGGAGATC AGCTATTCGC AGAACGCCAT TAAATCGGCG
GTCGATACCA TGAAGTTAAG TTTGCCTGTA ATGGGGAATA CAGCCCGTAA CCTGATTGGA
TTCAGTAACG GGGTATTTGA TACCAGAACA GGTAATTTTC GGGAGCATAA CAAAAACGAC
TGGTTGTTAA TTGCCAGTGA ATTACCTTTC AGCCCACCAG CAGAGGGGGA AACGCTGGCA
ACACATGCGC CGAATTTCTG GAAGTGGTTA CGCCGTTCGG TGGCTGAGAA TGACCGCAAG
GCGGATCGCG TACTGGCTGC ATTATTCATG GTGCTGGCGA ACCGGTACGA CTGGCAGTTA
TTCATTGAGG TAACAGGTCC AGGGGGAAGT GGTAAAAGCG TGATGGCGGA GATTTGCACC
ATGCTGGCGG GTAAGGCCAA CACAGTATCG GCAAGCATGA AGGCGCTGGA AGATGCAAGG
GAACGCGCGT TAGTGGTTGG CTTTTCGCTG ATTATCATGC CGGATATGAC CCGCTACGCT
GGTGATGGGG CAGGGATTAA GGCTATTACA GGCGGTGACA AGGTGGCAAT TGACCCGAAA
CACAAAGCCC CCTACTCAAC GCGTATTCCG GCAGTAGTGC TGGCGGTTAA CAATAACGCC
ATGTCATTCA GTGACCGCAG CGGGGGGATC TCACGTCGTC GGGTGATATT CAATTTTTCG
GAAGTTGTAC CGGAGAACGA ACGCGATCCA ATGCTGGCGG AAAAAATAGA AGGTGAGCTG
GCGGTAGTGA TTCGCCATCT GCTTACACGG TTTGCTGACC AGGACGAAGC CAGACGCCTG
TTATATGAGC AGCAGAAATC TGAAGAAGCA CTGGCGATAA AGCGAGAGGG GGATTCGCTG
GTGGACTTCT GCGGCTATCT CATGGCGTCG GTAATGTGTG ATGGCCTGTT AGTGGGTAAT
GCTGAAATTG TGCCATTCAG CCCACGCAGG TATCTCTATC ATGCCTATCT GGCTTATATG
AGGGCACATG GGTTTGGTAA ACCTGTAACA CTGACGCGCT TCGGTAAAGA TATGCCGGGG
GCAATGGCGG AATATGGCAG GGAGTATATG AAACGGAAAA CGAAGCACGG TTTTCGTTCA
AACGTGACAC TGACGGAGGA ATCAGAAGAC TGGATGCCAT CATGTGTATC GGTCACTAAT
GACGATAGCA AAAATTAA
 
Protein sequence
MKKAPNLKHQ PRDKMTEVII FAGSDAWAHA KQWQEQDGRL AGDNVPPVWL GEQQLAELDN 
LQIVPDGRYR VRLYQAGLLR PGLVNTIGQK LAVAGVRDAD YYPEGMHSQK RENWREYLER
ERAEQAEKKK VVELPVKKKE RVKDDNASSL ALNQMGASQR GEVLLAHYGG ELAIHADSDT
VHHYNGVVWE PVQDKELQRA MAQIFIDAEI SYSQNAIKSA VDTMKLSLPV MGNTARNLIG
FSNGVFDTRT GNFREHNKND WLLIASELPF SPPAEGETLA THAPNFWKWL RRSVAENDRK
ADRVLAALFM VLANRYDWQL FIEVTGPGGS GKSVMAEICT MLAGKANTVS ASMKALEDAR
ERALVVGFSL IIMPDMTRYA GDGAGIKAIT GGDKVAIDPK HKAPYSTRIP AVVLAVNNNA
MSFSDRSGGI SRRRVIFNFS EVVPENERDP MLAEKIEGEL AVVIRHLLTR FADQDEARRL
LYEQQKSEEA LAIKREGDSL VDFCGYLMAS VMCDGLLVGN AEIVPFSPRR YLYHAYLAYM
RAHGFGKPVT LTRFGKDMPG AMAEYGREYM KRKTKHGFRS NVTLTEESED WMPSCVSVTN
DDSKN