Gene Ent638_4102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_4102 
Symbol 
ID5110669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4459400 
End bp4462192 
Gene Length2793 bp 
Protein Length930 aa 
Translation table11 
GC content51% 
IMG OID640494326 
ProductDNA polymerase I 
Protein accessionYP_001178807 
Protein GI146313733 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000162392 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.404063 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCAGA TCCCAGAAAA CCCTCTTATT CTCGTCGATG GCTCTTCTTA CCTTTATCGG 
GCGTATCATG CGTTTCCTCC TCTGACCAAT AGCGCGGGGT TGCCGACCGG CGCGATGTAC
GGCGTGCTGA ACATGCTGCG CAGCCTCATC CTTCAGTATC AGCCATCGCA TGCCGCGGTC
GTTTTTGACG CGAAGGGCAA AACGTTCCGC GATGAACTGT TCGAGCATTA CAAATCCCAT
CGTCCACCTA TGCCGGACGA TTTGCGTGCG CAAATTGAGC CTCTGCATGC CATGGTGAAA
GCAATGGGCT TGCCGTTGCT GGCCGTTTCT GGCGTTGAGG CTGATGACGT CATTGGCACG
CTGGCGCGTG AAGCTGAAAA GTTAGGCCGT CCGGTGCTAA TCAGCACGGG TGATAAAGAC
ATGGCTCAAC TCGTCACCCC TGGTATCACC CTAATCAATA CCATGACAAA TACCATTCTT
GGGCCAGATG AAGTCGCCAC CAAATATGGC GTGCCTCCAG AACTGATTAT CGATTTCCTG
GCCTTGATGG GCGACTCCTC AGATAACATC CCAGGCGTGC CGGGTGTAGG CGAAAAAACC
GCGCAAGCGC TGCTTCAAGG GTTAGGTGGG CTGGACACCC TGTATGCTGA ATCAGATAAA
ATCGCTGGCT TAACATTCCG TGGCGCGAAA ACGATAGCCG CGAAGCTGGA ACAGAACAAA
GAGGTGGCCT ACCTCTCCTA CAAACTGGCG ACCATCAAAA CCGACGTTGA ACTGGAACTG
ACCTGTGAAC AGCTTGAGGT GCAGCAGCCT ATTGCTGAAG AGCTGCTTGG CCTGTTTAAG
CAATATGAAT TCAAGCGCTG GATAACCGAT GTCGAAGCCG GCAAATGGAT GCAAGCTAAG
GGCAGTAAAC CTGCCGCTAA GCCAAAAGAC ACCATCGTCG TCGATGCTGA AGACGAAGTG
GAAGAAGAGG CGACGACGCT CTCTTATGAT AACTATGAAG TGGTGCTTGA AGAGTCTCAG
CTCATTGCCT GGGTAGAAAA ACTGAAAAAA GCGCCCGTTT TTGCTTTTGA TACCGAAACC
GACAGCCTTG ATAACATCTC TGCCAATATG GTGGGTCTGT CATTTGCGAC GGAGCCTGGC
ATGGCGGCTT ACGTTCCTGT CGCGCATGAC TATCTTGATG CCCCTAATCA GATCTCCCGT
GAGCGCGTAC TGGAATTACT GAAACCGCTC CTCGAGGATG AGAAAGCCAA AAAAGTTGGG
CAAAACCTCA AGTTTGACCG CGGCATTTTG CAAAATTACG GCATTGAGCT GCGCGGTATT
GCCTATGACA CCATGCTGGA ATCGTACATT CTGAACAGTG TCGCGGGCCG CCATGATATG
GATTCGCTTT CAGATCGCTG GCTTAAGCAT AAAACCGTCA CCTTTGAAGA GATCGCCGGT
AAAGGCAAGA ATCAACTGAC GTTTAATCAG ATCGCGCTCG AAGAAGCGGG CCGTTACGCG
GCGGAAGACG CCGACGTCAC GCTGCAACTG CATCTGAAAA TGTGGCCAAA ACTGCAAAAG
CAGGAAGGCC CGCTGAATGT ATTTGAGCAT ATTGAAATGC CGCTGGTGCC CGTTATTTCG
CGCATTGAGC GCAATGGCGT CAAAATTGAT CCGGCGGTGT TGCACAAACA TTCTGAGGAG
CTTGCCCTAC GATTGAGTGA GCTTGAGCAA AAAGCCCATG AGATTGCCGG AGAGCCGTTC
AACCTATCCT CCACCAAGCA GTTGCAGACT ATCTTGTTTG AAAAGCAAGG TATCAAGCCG
CTGAAGAAAA CGCCTGGTGG TGCACCATCA ACGTCTGAAG AGGTACTGGA AGAGCTGGCG
CTCGATTACC CATTACCAAA AGTGATTCTG CAATACCGTG GTCTCGCCAA GCTTAAATCG
ACTTATACCG ATAAGCTGCC GTTGATGATT AACCCAAAAA CCAGTCGCGT TCACACCTCT
TATCATCAGG CGGTTGCGGC GACAGGGCGT TTGTCCTCTA CTGAGCCAAA CCTGCAGAAC
ATCCCGGTTC GTAATGAAGA GGGGCGCAGA ATCCGTCAGG CATTTATCGC GCCGAAAGAT
TATCTGATTG TATCTGCCGA CTACTCGCAA ATTGAACTGC GCATCATGGC GCATTTGTCG
CGGGATAAAG GCTTACTGAC CGCATTTGCC GAAGGCAAAG ATATCCATCG CGCGACTGCA
GCAGAAGTTT TTGGTTTGCC ACTGGATAGC GTGACCAACG ACCAGCGCCG TAGTGCGAAA
GCGATCAATT TTGGTCTGAT TTACGGCATG AGCGCGTTTG GCCTTTCTCG CCAGCTAAAT
ATTCCGCGCA AAGAGTCGCA AAAATACATG GACCTCTACT TTGAGCGTTA TCCAGGCGTT
CTGGAATACA TGGAACGCAC GCGCGCGCAG GCGAAGGAAA AAGGTTACGT TGAAACGCTG
GATGGTCGTC GTCTTTATCT CCCTGACATC ACGTCAAGCA ATGCGGCTCG TCGGGCTGGG
GCAGAGCGAG CGGCGATCAA CGCACCGATG CAGGGTACGG CAGCCGACAT CATCAAACGT
GCAATGATTG CAGTAGATGC GTGGCTTGAA AAAGACAAAC CGCGCGTGCG CATGATCATG
CAGGTACACG ATGAACTGGT GTTCGAAGTT CATCATGAAG ATCTGGAAGC CGTTTCTAAA
AAGATCCATG AACTGATGGA AGGCAGCATG ACATTAGATG TGCCGTTGCT GGTGGAAGTA
GGAAGCGGTG AAAACTGGGA CTTGGCTCAC TAA
 
Protein sequence
MVQIPENPLI LVDGSSYLYR AYHAFPPLTN SAGLPTGAMY GVLNMLRSLI LQYQPSHAAV 
VFDAKGKTFR DELFEHYKSH RPPMPDDLRA QIEPLHAMVK AMGLPLLAVS GVEADDVIGT
LAREAEKLGR PVLISTGDKD MAQLVTPGIT LINTMTNTIL GPDEVATKYG VPPELIIDFL
ALMGDSSDNI PGVPGVGEKT AQALLQGLGG LDTLYAESDK IAGLTFRGAK TIAAKLEQNK
EVAYLSYKLA TIKTDVELEL TCEQLEVQQP IAEELLGLFK QYEFKRWITD VEAGKWMQAK
GSKPAAKPKD TIVVDAEDEV EEEATTLSYD NYEVVLEESQ LIAWVEKLKK APVFAFDTET
DSLDNISANM VGLSFATEPG MAAYVPVAHD YLDAPNQISR ERVLELLKPL LEDEKAKKVG
QNLKFDRGIL QNYGIELRGI AYDTMLESYI LNSVAGRHDM DSLSDRWLKH KTVTFEEIAG
KGKNQLTFNQ IALEEAGRYA AEDADVTLQL HLKMWPKLQK QEGPLNVFEH IEMPLVPVIS
RIERNGVKID PAVLHKHSEE LALRLSELEQ KAHEIAGEPF NLSSTKQLQT ILFEKQGIKP
LKKTPGGAPS TSEEVLEELA LDYPLPKVIL QYRGLAKLKS TYTDKLPLMI NPKTSRVHTS
YHQAVAATGR LSSTEPNLQN IPVRNEEGRR IRQAFIAPKD YLIVSADYSQ IELRIMAHLS
RDKGLLTAFA EGKDIHRATA AEVFGLPLDS VTNDQRRSAK AINFGLIYGM SAFGLSRQLN
IPRKESQKYM DLYFERYPGV LEYMERTRAQ AKEKGYVETL DGRRLYLPDI TSSNAARRAG
AERAAINAPM QGTAADIIKR AMIAVDAWLE KDKPRVRMIM QVHDELVFEV HHEDLEAVSK
KIHELMEGSM TLDVPLLVEV GSGENWDLAH