Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0043 |
Symbol | |
ID | 6068457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 45542 |
End bp | 47875 |
Gene Length | 2334 bp |
Protein Length | 777 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641599446 |
Product | P4 family phage/plasmid primase |
Protein accession | YP_001723056 |
Protein GI | 170018102 |
COG category | [R] General function prediction only |
COG ID | [COG3378] Predicted ATPase |
TIGRFAM ID | [TIGR01613] phage/plasmid primase, P4 family, C-terminal domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATGA ACGTAACCGC CACCGTCAGC CATGCGCTCG GCCACTGGCC GCGTATTCTC CCGGCGCTGG GGATTCAGGT GCTGAAGAAC CGTCATCAGC CCTGTCCGGT CTGTGGCGGG AGTGACCGCT TCCGTTTTGA TGACAGGGAG GGGCGCGGCA CATGGTACTG CAATCAGTGT GGTGCCGGTG ACGGCCTGAA ACTGGTTGAA AAGGTGTTTG GTGTCTCCCC GTCCGACGCG GCCGCAAAGG TGGCTGCCGT GACCGGCAGC CTGCCCCCGG CTGACCCGGC AGTGACGGCC GCCGCCGGTG CTGAAACAGA CGCGGCCCGG AAGAACGCCG CCGCACTGGC ACAAACCCTG ATGGCAAAAA CCCGTCCCGG AACCGGTAAC GCCTACCTGA CCCGCAAGGG CTTTCCCGGC CGGGAATGCC GGATGCTGAC CGGCACACAC AGAGCCGGTG GCGTGAGCTG GCGTGCCGGT GACCTTGTGG TGCCACTGTA TGACGACAGC GGCGAACTGG TTAACCTTCA GTTAATCAGT GCTGACGGCC GTAAGCGCAC CCTGAAAGGC GGACAGGTCA GGGGCACCTG TCACACCCTT GAAGGACAGA ATCAGGCCGG AAAACGTCTG TGGATAGCGG AGGGATACGC GACCGCACTT ACCGTACATC ACCTGACCGG TGAAACGGTG ATGGTGGCGC TTTCTTCAGT GAACCTCCTT TCTCTGGCCA GCCTTGCCCG GCAGAAGCAT CCGGCCTGTC AGATTGTCCT TGCCGCAGAC CGTGACCTCA GCGGTGACGG CCAGAAAAAA GCCGCCGCAG CCGCAGATGC GTGTGAGGGC GTTGTTGCCC TGCCGCCGGT CTTCGGTGAC TGGAATGATG CCTTCACGCA GTACGGCGGG GAGGCCACCC GTAAGGCCAT TTACGATGCC ATCCGGCCAC CGGCTGAAAG CCCGTTCGAC ACCATGAGCG AAGCGGAGTT TTCCGCCATG AGTACCAGCG AAAAGGCCAT GCGTATCTAT GAGCATTACG GCGAGGCGCT CGCGGTCGAT GCCAACGGCC AGCTTCTGTC CCGTTATGAA AATGGTGTCT GGAAGGTGCT GCCACCACAG GACTTTGCCC GGGATGTGGC CGGGCTGTTT CAGCGGCTGC GTGCGCCGTT CTCCTCCGGG AAGGTGGCCT CCGTGGTGGA CACCCTGAAG CTGATTATTC CGCAGCAGGA AGCCCCCTCC CGCCGCCTGA TTGGCTTTCG TAACGGCGTG CTCGACACGC AGAACGGCAC GTTCCACCCG CACAGTCCGT CACACTGGAT GCGTACCCTG TGCGATGTGG ATTTCACCCC GCCGGTGGAA GGGGAAACGC TGGAAACCCA CGCCCCCGCG TTCTGGCGCT GGCTTGACCG TGCCGCCGGT GGCCGTGCGG AAAAACGCGA CGTGATTCTG GCCGCACTGT TTATGGTGCT GGCAAACCGC TACGACTGGC AGCTCTTTCT GGAGGTGACC GGTCCCGGCG GCAGCGGCAA AAGTATCATG GCCGAAATAG CCACCCTGCT GGCCGGGGAG GATAACGCCA CGTCGGCCAC CATCGAGACG CTGGAATCCC CGCGTGAACG TGCCGCGTTA ACTGGCTTCT CACTGATACG CCTGCCGGAC CAGGAAAAAT GGAGCGGCGA CGGTGCCGGA CTCAAGGCCA TCACCGGCGG CGATGCGGTG TCCGTTGACC CGAAATACCG GGATGCGTAC TCCACGCACA TCCCGGCGGT GATTCTGGCC GTGAACAATA ACCCGATGCG CTTCACCGAC CGCAGCGGCG GCGTGTCACG CCGGCGGGTG ATTATTCACT TCCCGGAACA GATAGCCCCG CAGGAGCGCG ACCCGCAGCT TAAGGACAAA ATCACCCGCG AGCTGGCGGT CATCGTGCGT CACCTGATGC AGAAATTCAG CGACCCGATG CTCGCCCGGT CACTGCTTCA GTCCCAGCAA AACTCAGACG AGGCGCTGAA CATCAAACGG GATGCCGACC CGACGTTTGA TTTTATCGGC TATCTGGAAA CCCTGCCGCA GACCAGCGGC ATGTATATGG GGAACGCCAG TATCATCCCG CGTAATTACC GTAAATACCT CTATCACGCC TATCTGGCCT ACATGGAGGC AAACGGCTAC CGGAACGTAC TCAGTCTGAA AATGTTCGGG CTGGGGCTAC CGGTGATGCT GAAGGAATAC GGACTGAATT ACGAGAAGCG CCATACCAAA CAGGGGATAC AGACCAACCT GACACTGAAA GAGGAAAGCT ACGGCGACTG GCTGCCGAAA TGTGACGACC CTGCAACAGC CTGA
|
Protein sequence | MKMNVTATVS HALGHWPRIL PALGIQVLKN RHQPCPVCGG SDRFRFDDRE GRGTWYCNQC GAGDGLKLVE KVFGVSPSDA AAKVAAVTGS LPPADPAVTA AAGAETDAAR KNAAALAQTL MAKTRPGTGN AYLTRKGFPG RECRMLTGTH RAGGVSWRAG DLVVPLYDDS GELVNLQLIS ADGRKRTLKG GQVRGTCHTL EGQNQAGKRL WIAEGYATAL TVHHLTGETV MVALSSVNLL SLASLARQKH PACQIVLAAD RDLSGDGQKK AAAAADACEG VVALPPVFGD WNDAFTQYGG EATRKAIYDA IRPPAESPFD TMSEAEFSAM STSEKAMRIY EHYGEALAVD ANGQLLSRYE NGVWKVLPPQ DFARDVAGLF QRLRAPFSSG KVASVVDTLK LIIPQQEAPS RRLIGFRNGV LDTQNGTFHP HSPSHWMRTL CDVDFTPPVE GETLETHAPA FWRWLDRAAG GRAEKRDVIL AALFMVLANR YDWQLFLEVT GPGGSGKSIM AEIATLLAGE DNATSATIET LESPRERAAL TGFSLIRLPD QEKWSGDGAG LKAITGGDAV SVDPKYRDAY STHIPAVILA VNNNPMRFTD RSGGVSRRRV IIHFPEQIAP QERDPQLKDK ITRELAVIVR HLMQKFSDPM LARSLLQSQQ NSDEALNIKR DADPTFDFIG YLETLPQTSG MYMGNASIIP RNYRKYLYHA YLAYMEANGY RNVLSLKMFG LGLPVMLKEY GLNYEKRHTK QGIQTNLTLK EESYGDWLPK CDDPATA
|
| |