Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4012 |
Symbol | |
ID | 6147438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4089863 |
End bp | 4092196 |
Gene Length | 2334 bp |
Protein Length | 777 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641618836 |
Product | D5 family nucleoside triphosphatase |
Protein accession | YP_001745974 |
Protein GI | 170679915 |
COG category | [R] General function prediction only |
COG ID | [COG3378] Predicted ATPase |
TIGRFAM ID | [TIGR01613] phage/plasmid primase, P4 family, C-terminal domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATGA ACGTAACCGC CACCGTCAGC CATGCGCTCG GCCACTGGCC GCGTATCCTC CCGGCGCTGG GGATTCAGGT GCTGAAAAAC CGTCATCAGC CCTGTCCGGT CTGTGGCGGG AGTGACCGCT TCCGTTTTGA TGACAGGGAG GGGCGCGGCA CCTGGTACTG CAATCAGTGT GGTGCCGGTG ACGGCCTGAA ACTGGTTGAA AAGGTGTTTG GTGTCTCCCC GTCCGACGCG GCCACAAAGG TGGCTGCCGT GACCGGCAGC CTGCCCCCGG CTGACCCGGC AGTGACGGCC GCCGCCGGTG CTGAAACAGA CGCTGCCCGG AAGAACGCCG CCGCACTGGC ACAAACCCTG ATGGCGAAAA CCCGTACCGG AACCGGTAAC GCCTACCTGA CCCGCAAGGG CTTTCCCGGC CGGGAATGCC GGATGCTGAC CGGCACACAC AGAGCCGGTG GCGTGAGCTG GCGTGCCGGT GACCTTGTGG TGCCACTGTA TGACGACAGC GGCGAACTGG TTAACCTTCA GTTAATCAGT GCTGACGGCC GTAAGCGCAC CCTGAAAGGC GGACAGGTCA GGGGCACCTG TCACACCCTT GAAGGACAGA ATCAGGCCGG AAAACGTCTG TGGATAGCGG AGGGATACGC GACCGCACTT ACCGTGCATC ACCTGACCGG TGAAACGGTG ATGGTGGCGC TTTCTTCCGT GAACCTCCTT TCTCTGGCCA GCCTTGCCCG GCAGAAGCAT CCGGCCTGTC AGATTGTCCT TGCCGCAGAC CGTGACCTCA GCGGTGACGG CCAGAAAAAA GCCGCCGCAG CCGCAGATGC GTGTGAGGGC GTTGTTGCCC TGCCGCCGGT CTTCGGTGAC TGGAATGATG CCTTCACGCA GTACGGCGGG GAAGCCACCC GTAAGGCCAT TTACGATGCC ATCCGGCCAC CGGCTGAAAG CCCGTTCGAC ACCATGAGCG AAGCAGAGTT TTCCGCCATG AGTACCAGCG AAAAGGCCAT GCGTATCTAT GAGCATTACG GTGAGGCGCT CGCGGTCGAT GCCAACGGCC AGCTTCTGTC CCGCTATGAA AATGGTGTCT GGAAGGTGCT GCCGCCACAG GACTTTGCCC GGGATGTGGC CGGGCTGTTT CAGCGTCTGC GCGCGCCGTT CTCCTCCGGG AAGGTGGCCT CCGTGGTGGA CACCCTGAAG CTGATTATTC CACAGCAGGA AGCCCCCTCC CGCCGCCTGA TTGGCTTTCG TAACGGCGTG CTCGACACGC AGAACGGCAC GTTCCACCCG CACAGTCCGT CACACTGGAT GCGCACCCTG TGCGATGTGG ATTTCACCCC GCCGGTGGAC GGTGAAACGC TGGAAACCCA CGCTCCCGCG TTCTGGCGCT GGCTTGACCG TGCTGCCGGT GGTCGTGCGG AAAAACGCGA CGTAATTCTG GCCGCACTGT TTATGGTGCT GGCAAACCGC TACGACTGGC AGCTCTTTCT GGAGGTGACC GGTCCCGGCG GCAGCGGCAA AAGTATCATG GCCGAAATAG CCACCCTGCT GGCCGGGGAG GATAACGCCA CGTCGGCCAC CATTGAGACG CTGGAATCCC CGCGTGAACG TGCCGCGTTA ACTGGCTTCT CACTGATACG CCTGCCGGAC CAGGAAAAAT GGAGCGGCGA CGGTGCCGGA CTCAAGGCCA TCACCGGCGG CGATGCGGTG TCCGTGGACC CGAAATACCG GGATGCGTAC TCCACGCATA TCCCGGCGGT GATTCTGGCC GTGAACAATA ACCCGATGCG CTTCACCGAC CGCAGCGGCG GCGTGTCACG CCGGCGGGTG ATTATTCACT TCCCGGAACA GATAGCCCCG CAGGAGCGCG ACCCGCAGCT TAAGGACAAA ATCACCCGCG AGCTGGCGGT CATCGTGCGT CACCTGATGC AGAAGTTCAG CGACCCGATG CTCGCCCGGT CACTGCTTCA GTCCCAGCAG AACTCAGACG AGGCACTGAA CATCAAACGG GATGCCGACC CGACGTTTGA TTTTATCGGC TATCTGGAAA CCCTGCCGCA GACCAGCGGC ATGTATATGG GGAACGCCAG TATCATCCCG CGTAATTACC GTAAATACCT CTATCACGCC TATCTGGCCT ACATGGAGGC AAACGGCTAC CGGAATGTAC TCAGTCTGAA AATGTTCGGG CTGGGGCTGC CGGTGATGCT GAAGGAATAC GGACTGAATT ACGAGAAGCG CCATACCAAA CAGGGGATAC AGACCAACCT GACGCTGAAA GAGGAAAGCT ACGGCGACTG GCTGCCAAAA TGTGACGACC CTGCGACAGC CTGA
|
Protein sequence | MKMNVTATVS HALGHWPRIL PALGIQVLKN RHQPCPVCGG SDRFRFDDRE GRGTWYCNQC GAGDGLKLVE KVFGVSPSDA ATKVAAVTGS LPPADPAVTA AAGAETDAAR KNAAALAQTL MAKTRTGTGN AYLTRKGFPG RECRMLTGTH RAGGVSWRAG DLVVPLYDDS GELVNLQLIS ADGRKRTLKG GQVRGTCHTL EGQNQAGKRL WIAEGYATAL TVHHLTGETV MVALSSVNLL SLASLARQKH PACQIVLAAD RDLSGDGQKK AAAAADACEG VVALPPVFGD WNDAFTQYGG EATRKAIYDA IRPPAESPFD TMSEAEFSAM STSEKAMRIY EHYGEALAVD ANGQLLSRYE NGVWKVLPPQ DFARDVAGLF QRLRAPFSSG KVASVVDTLK LIIPQQEAPS RRLIGFRNGV LDTQNGTFHP HSPSHWMRTL CDVDFTPPVD GETLETHAPA FWRWLDRAAG GRAEKRDVIL AALFMVLANR YDWQLFLEVT GPGGSGKSIM AEIATLLAGE DNATSATIET LESPRERAAL TGFSLIRLPD QEKWSGDGAG LKAITGGDAV SVDPKYRDAY STHIPAVILA VNNNPMRFTD RSGGVSRRRV IIHFPEQIAP QERDPQLKDK ITRELAVIVR HLMQKFSDPM LARSLLQSQQ NSDEALNIKR DADPTFDFIG YLETLPQTSG MYMGNASIIP RNYRKYLYHA YLAYMEANGY RNVLSLKMFG LGLPVMLKEY GLNYEKRHTK QGIQTNLTLK EESYGDWLPK CDDPATA
|
| |