Gene EcolC_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0043 
Symbol 
ID6068457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp45542 
End bp47875 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content61% 
IMG OID641599446 
ProductP4 family phage/plasmid primase 
Protein accessionYP_001723056 
Protein GI170018102 
COG category[R] General function prediction only 
COG ID[COG3378] Predicted ATPase 
TIGRFAM ID[TIGR01613] phage/plasmid primase, P4 family, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGA ACGTAACCGC CACCGTCAGC CATGCGCTCG GCCACTGGCC GCGTATTCTC 
CCGGCGCTGG GGATTCAGGT GCTGAAGAAC CGTCATCAGC CCTGTCCGGT CTGTGGCGGG
AGTGACCGCT TCCGTTTTGA TGACAGGGAG GGGCGCGGCA CATGGTACTG CAATCAGTGT
GGTGCCGGTG ACGGCCTGAA ACTGGTTGAA AAGGTGTTTG GTGTCTCCCC GTCCGACGCG
GCCGCAAAGG TGGCTGCCGT GACCGGCAGC CTGCCCCCGG CTGACCCGGC AGTGACGGCC
GCCGCCGGTG CTGAAACAGA CGCGGCCCGG AAGAACGCCG CCGCACTGGC ACAAACCCTG
ATGGCAAAAA CCCGTCCCGG AACCGGTAAC GCCTACCTGA CCCGCAAGGG CTTTCCCGGC
CGGGAATGCC GGATGCTGAC CGGCACACAC AGAGCCGGTG GCGTGAGCTG GCGTGCCGGT
GACCTTGTGG TGCCACTGTA TGACGACAGC GGCGAACTGG TTAACCTTCA GTTAATCAGT
GCTGACGGCC GTAAGCGCAC CCTGAAAGGC GGACAGGTCA GGGGCACCTG TCACACCCTT
GAAGGACAGA ATCAGGCCGG AAAACGTCTG TGGATAGCGG AGGGATACGC GACCGCACTT
ACCGTACATC ACCTGACCGG TGAAACGGTG ATGGTGGCGC TTTCTTCAGT GAACCTCCTT
TCTCTGGCCA GCCTTGCCCG GCAGAAGCAT CCGGCCTGTC AGATTGTCCT TGCCGCAGAC
CGTGACCTCA GCGGTGACGG CCAGAAAAAA GCCGCCGCAG CCGCAGATGC GTGTGAGGGC
GTTGTTGCCC TGCCGCCGGT CTTCGGTGAC TGGAATGATG CCTTCACGCA GTACGGCGGG
GAGGCCACCC GTAAGGCCAT TTACGATGCC ATCCGGCCAC CGGCTGAAAG CCCGTTCGAC
ACCATGAGCG AAGCGGAGTT TTCCGCCATG AGTACCAGCG AAAAGGCCAT GCGTATCTAT
GAGCATTACG GCGAGGCGCT CGCGGTCGAT GCCAACGGCC AGCTTCTGTC CCGTTATGAA
AATGGTGTCT GGAAGGTGCT GCCACCACAG GACTTTGCCC GGGATGTGGC CGGGCTGTTT
CAGCGGCTGC GTGCGCCGTT CTCCTCCGGG AAGGTGGCCT CCGTGGTGGA CACCCTGAAG
CTGATTATTC CGCAGCAGGA AGCCCCCTCC CGCCGCCTGA TTGGCTTTCG TAACGGCGTG
CTCGACACGC AGAACGGCAC GTTCCACCCG CACAGTCCGT CACACTGGAT GCGTACCCTG
TGCGATGTGG ATTTCACCCC GCCGGTGGAA GGGGAAACGC TGGAAACCCA CGCCCCCGCG
TTCTGGCGCT GGCTTGACCG TGCCGCCGGT GGCCGTGCGG AAAAACGCGA CGTGATTCTG
GCCGCACTGT TTATGGTGCT GGCAAACCGC TACGACTGGC AGCTCTTTCT GGAGGTGACC
GGTCCCGGCG GCAGCGGCAA AAGTATCATG GCCGAAATAG CCACCCTGCT GGCCGGGGAG
GATAACGCCA CGTCGGCCAC CATCGAGACG CTGGAATCCC CGCGTGAACG TGCCGCGTTA
ACTGGCTTCT CACTGATACG CCTGCCGGAC CAGGAAAAAT GGAGCGGCGA CGGTGCCGGA
CTCAAGGCCA TCACCGGCGG CGATGCGGTG TCCGTTGACC CGAAATACCG GGATGCGTAC
TCCACGCACA TCCCGGCGGT GATTCTGGCC GTGAACAATA ACCCGATGCG CTTCACCGAC
CGCAGCGGCG GCGTGTCACG CCGGCGGGTG ATTATTCACT TCCCGGAACA GATAGCCCCG
CAGGAGCGCG ACCCGCAGCT TAAGGACAAA ATCACCCGCG AGCTGGCGGT CATCGTGCGT
CACCTGATGC AGAAATTCAG CGACCCGATG CTCGCCCGGT CACTGCTTCA GTCCCAGCAA
AACTCAGACG AGGCGCTGAA CATCAAACGG GATGCCGACC CGACGTTTGA TTTTATCGGC
TATCTGGAAA CCCTGCCGCA GACCAGCGGC ATGTATATGG GGAACGCCAG TATCATCCCG
CGTAATTACC GTAAATACCT CTATCACGCC TATCTGGCCT ACATGGAGGC AAACGGCTAC
CGGAACGTAC TCAGTCTGAA AATGTTCGGG CTGGGGCTAC CGGTGATGCT GAAGGAATAC
GGACTGAATT ACGAGAAGCG CCATACCAAA CAGGGGATAC AGACCAACCT GACACTGAAA
GAGGAAAGCT ACGGCGACTG GCTGCCGAAA TGTGACGACC CTGCAACAGC CTGA
 
Protein sequence
MKMNVTATVS HALGHWPRIL PALGIQVLKN RHQPCPVCGG SDRFRFDDRE GRGTWYCNQC 
GAGDGLKLVE KVFGVSPSDA AAKVAAVTGS LPPADPAVTA AAGAETDAAR KNAAALAQTL
MAKTRPGTGN AYLTRKGFPG RECRMLTGTH RAGGVSWRAG DLVVPLYDDS GELVNLQLIS
ADGRKRTLKG GQVRGTCHTL EGQNQAGKRL WIAEGYATAL TVHHLTGETV MVALSSVNLL
SLASLARQKH PACQIVLAAD RDLSGDGQKK AAAAADACEG VVALPPVFGD WNDAFTQYGG
EATRKAIYDA IRPPAESPFD TMSEAEFSAM STSEKAMRIY EHYGEALAVD ANGQLLSRYE
NGVWKVLPPQ DFARDVAGLF QRLRAPFSSG KVASVVDTLK LIIPQQEAPS RRLIGFRNGV
LDTQNGTFHP HSPSHWMRTL CDVDFTPPVE GETLETHAPA FWRWLDRAAG GRAEKRDVIL
AALFMVLANR YDWQLFLEVT GPGGSGKSIM AEIATLLAGE DNATSATIET LESPRERAAL
TGFSLIRLPD QEKWSGDGAG LKAITGGDAV SVDPKYRDAY STHIPAVILA VNNNPMRFTD
RSGGVSRRRV IIHFPEQIAP QERDPQLKDK ITRELAVIVR HLMQKFSDPM LARSLLQSQQ
NSDEALNIKR DADPTFDFIG YLETLPQTSG MYMGNASIIP RNYRKYLYHA YLAYMEANGY
RNVLSLKMFG LGLPVMLKEY GLNYEKRHTK QGIQTNLTLK EESYGDWLPK CDDPATA