Gene Cpha266_0536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0536 
Symbol 
ID4569073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp589218 
End bp592058 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content46% 
IMG OID639765135 
ProductDNA polymerase I 
Protein accessionYP_911017 
Protein GI119356373 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATTG ACAATCAATT TGATTTTTTC CAGGCAGGTT GCGGCCAACC GTCTGAAGCA 
ACAAAAAAAA ACATACCGGA AAAAAAACCC GCACTGTTTC TGATAGATGG CATGGCCATG
GTTTACCGGG CATATTATGC GTTGCAGTCT GCAAGAATGA AAACCCGCGA TGGCCTTCCA
TCAGGAGCTG TGTTTGGCTT CACCTCTGCG CTGCTCAAGA TTTTCGAAAC CTATAAACCC
GACTATCTTG CCGTTGCATT TGACAGCAGG GAAAAAACAT TTCGCCATGA CCTTTATGAC
CTTTACAAAG CAAACCGTCC CTCCCCACCA GAGGATCTCA TAAGTCAGCT CGATGCCATT
TATCAGCTTG TAACGCTCCT CGGCATACCG ATTATCAAAA CAGCCGGGTT TGAAGCCGAT
GACCTTATCG GTTCACTTGC GCGCAAATTC GAAAAGAGTT GCGCTATCTA TATCGTAACG
CCTGACAAAG ACCTTGCGCA ACTTGTCAAC GATGAAGTAT ACATACTTAA ACCGGGAAAA
ATCCAGAATG AACTTGAACT TCTTGGCATT AAGGAAATTA CAATACAGTT TGGAGTCCGT
CCTGAGCAGT TCACCGACTT TTTAGCACTC GCAGGAGATG CCTCGGACAA TATCCCCGGA
GCAAAAGGTA TTGGCCCGAA AACCGCTGCA AGCCTGATAG GAAAATATGG ATCCATTGCC
AGCATACTTC TTAATCTCAA TGAGATATCT CCAAAAAACC GACAGAGTCT CGAAGAATTT
CAACCTCGCC TCAAACTGAT CAGGCAACTG GTAACCATAC GAACCGATCT CGACCTTCAC
GTGAGCCTGC AAAACCTTGC AACAACGACT CCTGACGTCG AAAAAATACT TCCTTTCCTG
AAGCAATACG AGCTGAAATC CATAGCAGCA AGACTTCCCG CTATTTTTCC GGAAATGAAT
CTTCAGCCAG AATCACTACC CGCAGATAAC GAAAATAATG AGAAGAACGA CAGCCCGGAG
CTTACTGCTC CCGGCTCCGC GAAAGATGGC GCTGATTACC GAATAGTCAC AAGGAAAGAG
GAGCTGCAGG CACTCGTCGA ACAACTGAAC CATGCGGCAG CACTTGCTGT TGATACCGAG
ACCACCAGCC TGGATACCTT TCAGGCTGAA CTTGTCGGGA TATCGCTCTC CATAAAACCA
AAACAGGCAT GGTTCGTCTA TTTCGGCAAA GGCGGAGTGG ATAGACGAGT CGCTCTCGAC
ATGCTCAAGC CGGTTCTTGA AAACCCATCT CTGCGAAAAA CGGGACAAAA CCTCAAATAT
GACCTGCTCG TTCTGAAAAA ATATGGAATC GACATCAACC CTGTTGCATT TGACACCATG
CTTGCGAGTT ACGTGCTCGA TCCGGAAGCG AAACATAATC TTGACGACCT TGCGCTGCGC
CATCTCTCCA TAAAAACCAC AACGTACGAC GAACTGGTCG CTGACGGCAA GAAAAAAATG
TCCATCCTTG ACGTTCCACC CGGCGAACTG TCCGATTATG CATGTCAGGA TGCCGATCTT
GCGCTGCGCC TGCAGGAGGT TTTCAAAGAA AAACTCCTGC AGGAAAAAGA TCTGCTCTGG
TTATGTGAAA ATATTGAATT CCCTCTTGTG CCGGTTCTCG CAACAATGGA GTATCATGGC
ATTTCAATTG ATACCGATCA CCTGAAAAAA ACAGAAACAA CTGTTTCAAA GCAGATCGGC
CAACTCTCCG AAAAAATTTT CGAAGCATCC GGCAGAGTTT TCAACCTCGA TTCGCCAAAA
CAACTTGCTC ATATTCTGTT CGACATTCTC GGGCTTCCTT CCGGCAAAGC AACGAAAACC
GGCTTTTCCA CCAACGTTCA GGTCCTTGAA GATCTTGCTC CGATCCACCC TGTCGCACAG
GATCTCCTTG AATACAGAAG CCTGCAGAAA CTCAGAAACA CGTATATCGA AGCTTTACCA
AAAATGATCA ATCCTCTGAC GGGAAAACTG CATACATCAT TCAATCAACA TGTTACCGCA
ACAGGACGGC TTTCATCATC GCATCCTAAC CTGCAAAACA TCCCGATACG CACCCTGATT
GGAAAAGAAA TACGGCGAGC ATTTATACCG TCCAACCCTG AAAACCTGCT TCTTTCAGCA
GACTACTCCC AGATAGAACT CAGAGTGGCT GCCGAAATCA GCGGCGATGA AAAGCTTATG
GAGGCGTTCA GAAACAGGGA AGACATCCAT TCCGCTACAG CAAGAACCAT TTTCAACACA
ACCGAGATCA CCCCGGAGAT GCGACGCAAG GCAAAGGAGG TGAATTTCGG AGTATTGTAT
GGAATCATGC CTTTTGGTCT TTCACGACGC CTCAACATCT CTCGCAATGA AGCAAAAAAT
ATTATCGATA CGTATACCGA AAAGTACCCC GGCATATTCA ACGCTTTGCA ACAAATCATC
AGTGACGGCA AGGAACGCGG TTATGTTTCG ACCCTGCTTG GCAGAAGACG ATACATCCCT
GATCTGAACA GCAGAAACAA GAATATGCAG AAAGCTGCAG AAAGAGCAGC AATGAATACT
CCGATTCAGG GAACGGCTGC AGACATCATC AAATGCGCAA TGAATCTCTG CAGCACGCAA
CTTCGATTGC ATAAAATGAA ATCCGTTATG CTCCTTCAGG TTCATGACGA ACTTCTTTTT
GAAACACCGG AAAACGAAAA ACATAGGCTG AAAACGCTTG TAGAAGAGGT AATGATTGAT
GCGGCAAAGC GTTGCGGGTT ACATAACGTC CCTGTAGAAG TAGATACCGG TATCGGAAAA
AACTGGCTCG AAGCCCATTA A
 
Protein sequence
MSIDNQFDFF QAGCGQPSEA TKKNIPEKKP ALFLIDGMAM VYRAYYALQS ARMKTRDGLP 
SGAVFGFTSA LLKIFETYKP DYLAVAFDSR EKTFRHDLYD LYKANRPSPP EDLISQLDAI
YQLVTLLGIP IIKTAGFEAD DLIGSLARKF EKSCAIYIVT PDKDLAQLVN DEVYILKPGK
IQNELELLGI KEITIQFGVR PEQFTDFLAL AGDASDNIPG AKGIGPKTAA SLIGKYGSIA
SILLNLNEIS PKNRQSLEEF QPRLKLIRQL VTIRTDLDLH VSLQNLATTT PDVEKILPFL
KQYELKSIAA RLPAIFPEMN LQPESLPADN ENNEKNDSPE LTAPGSAKDG ADYRIVTRKE
ELQALVEQLN HAAALAVDTE TTSLDTFQAE LVGISLSIKP KQAWFVYFGK GGVDRRVALD
MLKPVLENPS LRKTGQNLKY DLLVLKKYGI DINPVAFDTM LASYVLDPEA KHNLDDLALR
HLSIKTTTYD ELVADGKKKM SILDVPPGEL SDYACQDADL ALRLQEVFKE KLLQEKDLLW
LCENIEFPLV PVLATMEYHG ISIDTDHLKK TETTVSKQIG QLSEKIFEAS GRVFNLDSPK
QLAHILFDIL GLPSGKATKT GFSTNVQVLE DLAPIHPVAQ DLLEYRSLQK LRNTYIEALP
KMINPLTGKL HTSFNQHVTA TGRLSSSHPN LQNIPIRTLI GKEIRRAFIP SNPENLLLSA
DYSQIELRVA AEISGDEKLM EAFRNREDIH SATARTIFNT TEITPEMRRK AKEVNFGVLY
GIMPFGLSRR LNISRNEAKN IIDTYTEKYP GIFNALQQII SDGKERGYVS TLLGRRRYIP
DLNSRNKNMQ KAAERAAMNT PIQGTAADII KCAMNLCSTQ LRLHKMKSVM LLQVHDELLF
ETPENEKHRL KTLVEEVMID AAKRCGLHNV PVEVDTGIGK NWLEAH