Gene Cphamn1_0540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0540 
Symbol 
ID6374204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp568792 
End bp571602 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content49% 
IMG OID642683057 
ProductDNA polymerase I 
Protein accessionYP_001958984 
Protein GI189499514 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.560108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGTCGT TGCATAAGAG CGATCAGAGT CAACTGTCTT TTGAGATGAG CACGACAGCG 
ATCACAGAAA AACCGCAAGG AAGCAAAAAA CCCTCCCTTT TTCTGCTTGA CGGCATGGCG
CTGGTCTACC GGTCATTTTT CGCCCTCCAG CGCGCAGGAA TGTCAACAAA AGACGGAATT
CCGACAGGCG CCGCTTACGG ATTTCTCATG ACCCTGCTGA AAATATTTGA AACCGGTAAG
CCTGAATATC TTGCTGTTGC TTTTGACAGT AAGGAAAAAA CATTCCGGCA TGAGCTCTAT
GCTCCATACA AGGCGAACCG TCCGGAACCG CCGGAAGACC TTATTGCGCA ACTGGAGCTG
ATTTTCAAGC TTGTCGAAGC GTTCCGGATC CCGATTCTCA AGCAACCCGG TTATGAAGCC
GACGATCTTA TAGGAAGTGC CGTCAGGCAG TTCGAGAAAC AGTGCGAGAT AGTTATCGTC
ACACCGGACA AGGATCTCGC GCAACTTGTC AATGAAGGGG TTTCGATCCT TAAACCGGGC
AGAAACAGGA ACGAACTTGA TACCATGGGC TGTGATGAGG TCAAAAAACA GTTTGGTGTG
CCGCCTGAAT GTTTTATCGA TTTTCTGACC CTGACCGGGG ACAGTTCGGA CAACATACCC
GGCGCGAAAG GAATCGGACC GAAAACCGCC TCAAAGTTGC TCACGACCTA TGGTACTCTG
GAAAATGTCC TTGCCCATAT CGAGGAACTT GCCCCCAGGT CACGTAAAAG TCTTGAAGAG
TTCGAGACAA ACCGTAAGCT TATCCAGGCT CTTGTTACGA TAAGGACAGA TATCGAACTT
GACACTACCC TGCCTGAGCT TGCATCCGGA GAGCCTGACA GCGTAAAACT GTTCGGTCTG
CTGGAAAAAC TTGAAATGAA CAGCGTCGCG GAAAAGATAC CGCACATCTT TCCAGGATCA
ACTCCTCCCC GAACAGGCCT TCAGGAACAA AGCGAGCCTT CATTCTCTCC CCCGGAAGGA
GCCGCGTATC ATCTCATTGA TACCGAAGCC GCTCTCACAG AACTCACGAA TACACTTGAG
AAACAGAGCT CGTTTTCCAT AGATACTGAA ACCACCAGCC TGAACACTTT TGAAGCCGAA
CTTGTCGGCA TTTCGATCTG CTGGAAACCG GGCGAAGCGT ATTTCATTCA CTTTACGGAT
AAAGAGCTCA GCGCAAAGAC TTTTCCCGGA AAACTGCAGG ATGTACTTGA AAATCCTGAC
ATCAAAAAAA CAGGACAGAA TCTCAAGTAC GACATTCTCG TACTGAAAAA TCACCATGTA
CGGCTCGCGC CGGTCGGGTT CGATACCATG CTTGCAAGCT ATGTCATCAA TCCCGAGGAG
AAGCACAACC TTGACGATCT TGCAAAAAAA CATCTCAATC ACCGGACAAT CACCTACAGT
GAACTTACCG GTACAGGGAA AAAAGCGATC CCGATCCGTG AGGTTCCGAT CGATAGACTT
ACTGTCTATG CCTGCCAGGA TGCGGATGTC GCTTTGCAGC TCGAACAGAA ACAGAAAATA
CTGCTCGGAG AAAACAGCGA GCTTGAACAA CTCTGCGTGA ACATAGAGTT CCCGCTTGTC
GAAGTGCTTG CCGACATGGA GTACCTGGGA ATCGCTCTCG ATACAGCTCA GCTGGAAAGA
ACGGCTGAAA CCGTTAACCG TCAACTGCTG GAACTTACTG AGAGAATTTA TGATACCGCC
GGAACCATCT TTAACATCGA TTCCCCCAAA CAGCTTGGAA ACGTGCTTTT CAATGTCCTT
GGACTACCTG CAAAAAAAAC AACAAAAACC GGTTTCTCGA CCAATGTCCA GGTCCTTGAA
GATCTCTCTC TTATCCATCC GGTGGCAAAA GATCTTCTGG AATATCGCAG CCTCCAGAAA
CTCAAGACAA CCTATATTGA CGCCCTGCCG AAAATACTCA ATCCGAAAAC AGGGCGGGTT
CACACCTCCT TTAACCAGCA CATAACAGCG ACAGGCAGAC TTTCCTCATC AAACCCGAAC
TTGCAGAACA TCCCGATTCG CACTCCTCTT GGCAGAGAGA TCCGAAAAGC GTTTATCCCC
TCGACGAGTG ACAGGTACCT TCTTTCCGCC GATTACTCCC AGATCGAACT CCGTATCGCG
GCGGAAATCT CACAGGACAG TCATCTCATA GACGCATTCA GAAACCGGGA AGATATTCAC
ACCGCGACCG CGAAAACCAT CTTCGATACG GACGATATCA CCAAGGATAT GAGACGAAAA
GCCAAGGAGG TTAACTTCGG TGTTCTCTAC GGTATCCAGC CATATGGACT GGCTCAAAGG
CTGAACATAT CCCAAAAAGA GGCGAAAGCA ATCATTGACA CCTATATTTC AAAGTATCCG
GGCCTGTTCA GTGCCCTGCA GACGACCATC ACAGAAGCTG CAGAAAAAGG ATATGTCACG
ACGCTGACAG GACGCAGACG TTACATAGAG AATCTTCGCA GCAGAAACCG GAACATCAGG
ATGGCAGCGG AACGAGCGGC CATGAACACC CCTATCCAGG GAACTGCGGC AGATATCATC
AAGTGCGCTA TGGGCCTTGT TTCTGAAGCA ATAAAAAAGA AACGGATGCA ATCCGCAATG
CTCCTGCAGG TTCACGATGA ACTTGTTTTC GAAACGACAG AAGAGGAAAA AGCCGCTCTC
GCCGAAATCG CCGAGGGTTG CATGCAAAAA GCGGCAGAAC TCTGCGGACT TGAAACCGTT
CCTGTAGAAG TCGAAATCGG TACAGGAAAA AACTGGCTGG AAGCCCACTG A
 
Protein sequence
MVSLHKSDQS QLSFEMSTTA ITEKPQGSKK PSLFLLDGMA LVYRSFFALQ RAGMSTKDGI 
PTGAAYGFLM TLLKIFETGK PEYLAVAFDS KEKTFRHELY APYKANRPEP PEDLIAQLEL
IFKLVEAFRI PILKQPGYEA DDLIGSAVRQ FEKQCEIVIV TPDKDLAQLV NEGVSILKPG
RNRNELDTMG CDEVKKQFGV PPECFIDFLT LTGDSSDNIP GAKGIGPKTA SKLLTTYGTL
ENVLAHIEEL APRSRKSLEE FETNRKLIQA LVTIRTDIEL DTTLPELASG EPDSVKLFGL
LEKLEMNSVA EKIPHIFPGS TPPRTGLQEQ SEPSFSPPEG AAYHLIDTEA ALTELTNTLE
KQSSFSIDTE TTSLNTFEAE LVGISICWKP GEAYFIHFTD KELSAKTFPG KLQDVLENPD
IKKTGQNLKY DILVLKNHHV RLAPVGFDTM LASYVINPEE KHNLDDLAKK HLNHRTITYS
ELTGTGKKAI PIREVPIDRL TVYACQDADV ALQLEQKQKI LLGENSELEQ LCVNIEFPLV
EVLADMEYLG IALDTAQLER TAETVNRQLL ELTERIYDTA GTIFNIDSPK QLGNVLFNVL
GLPAKKTTKT GFSTNVQVLE DLSLIHPVAK DLLEYRSLQK LKTTYIDALP KILNPKTGRV
HTSFNQHITA TGRLSSSNPN LQNIPIRTPL GREIRKAFIP STSDRYLLSA DYSQIELRIA
AEISQDSHLI DAFRNREDIH TATAKTIFDT DDITKDMRRK AKEVNFGVLY GIQPYGLAQR
LNISQKEAKA IIDTYISKYP GLFSALQTTI TEAAEKGYVT TLTGRRRYIE NLRSRNRNIR
MAAERAAMNT PIQGTAADII KCAMGLVSEA IKKKRMQSAM LLQVHDELVF ETTEEEKAAL
AEIAEGCMQK AAELCGLETV PVEVEIGTGK NWLEAH