Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0540 |
Symbol | |
ID | 6374204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 568792 |
End bp | 571602 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642683057 |
Product | DNA polymerase I |
Protein accession | YP_001958984 |
Protein GI | 189499514 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.560108 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGTCGT TGCATAAGAG CGATCAGAGT CAACTGTCTT TTGAGATGAG CACGACAGCG ATCACAGAAA AACCGCAAGG AAGCAAAAAA CCCTCCCTTT TTCTGCTTGA CGGCATGGCG CTGGTCTACC GGTCATTTTT CGCCCTCCAG CGCGCAGGAA TGTCAACAAA AGACGGAATT CCGACAGGCG CCGCTTACGG ATTTCTCATG ACCCTGCTGA AAATATTTGA AACCGGTAAG CCTGAATATC TTGCTGTTGC TTTTGACAGT AAGGAAAAAA CATTCCGGCA TGAGCTCTAT GCTCCATACA AGGCGAACCG TCCGGAACCG CCGGAAGACC TTATTGCGCA ACTGGAGCTG ATTTTCAAGC TTGTCGAAGC GTTCCGGATC CCGATTCTCA AGCAACCCGG TTATGAAGCC GACGATCTTA TAGGAAGTGC CGTCAGGCAG TTCGAGAAAC AGTGCGAGAT AGTTATCGTC ACACCGGACA AGGATCTCGC GCAACTTGTC AATGAAGGGG TTTCGATCCT TAAACCGGGC AGAAACAGGA ACGAACTTGA TACCATGGGC TGTGATGAGG TCAAAAAACA GTTTGGTGTG CCGCCTGAAT GTTTTATCGA TTTTCTGACC CTGACCGGGG ACAGTTCGGA CAACATACCC GGCGCGAAAG GAATCGGACC GAAAACCGCC TCAAAGTTGC TCACGACCTA TGGTACTCTG GAAAATGTCC TTGCCCATAT CGAGGAACTT GCCCCCAGGT CACGTAAAAG TCTTGAAGAG TTCGAGACAA ACCGTAAGCT TATCCAGGCT CTTGTTACGA TAAGGACAGA TATCGAACTT GACACTACCC TGCCTGAGCT TGCATCCGGA GAGCCTGACA GCGTAAAACT GTTCGGTCTG CTGGAAAAAC TTGAAATGAA CAGCGTCGCG GAAAAGATAC CGCACATCTT TCCAGGATCA ACTCCTCCCC GAACAGGCCT TCAGGAACAA AGCGAGCCTT CATTCTCTCC CCCGGAAGGA GCCGCGTATC ATCTCATTGA TACCGAAGCC GCTCTCACAG AACTCACGAA TACACTTGAG AAACAGAGCT CGTTTTCCAT AGATACTGAA ACCACCAGCC TGAACACTTT TGAAGCCGAA CTTGTCGGCA TTTCGATCTG CTGGAAACCG GGCGAAGCGT ATTTCATTCA CTTTACGGAT AAAGAGCTCA GCGCAAAGAC TTTTCCCGGA AAACTGCAGG ATGTACTTGA AAATCCTGAC ATCAAAAAAA CAGGACAGAA TCTCAAGTAC GACATTCTCG TACTGAAAAA TCACCATGTA CGGCTCGCGC CGGTCGGGTT CGATACCATG CTTGCAAGCT ATGTCATCAA TCCCGAGGAG AAGCACAACC TTGACGATCT TGCAAAAAAA CATCTCAATC ACCGGACAAT CACCTACAGT GAACTTACCG GTACAGGGAA AAAAGCGATC CCGATCCGTG AGGTTCCGAT CGATAGACTT ACTGTCTATG CCTGCCAGGA TGCGGATGTC GCTTTGCAGC TCGAACAGAA ACAGAAAATA CTGCTCGGAG AAAACAGCGA GCTTGAACAA CTCTGCGTGA ACATAGAGTT CCCGCTTGTC GAAGTGCTTG CCGACATGGA GTACCTGGGA ATCGCTCTCG ATACAGCTCA GCTGGAAAGA ACGGCTGAAA CCGTTAACCG TCAACTGCTG GAACTTACTG AGAGAATTTA TGATACCGCC GGAACCATCT TTAACATCGA TTCCCCCAAA CAGCTTGGAA ACGTGCTTTT CAATGTCCTT GGACTACCTG CAAAAAAAAC AACAAAAACC GGTTTCTCGA CCAATGTCCA GGTCCTTGAA GATCTCTCTC TTATCCATCC GGTGGCAAAA GATCTTCTGG AATATCGCAG CCTCCAGAAA CTCAAGACAA CCTATATTGA CGCCCTGCCG AAAATACTCA ATCCGAAAAC AGGGCGGGTT CACACCTCCT TTAACCAGCA CATAACAGCG ACAGGCAGAC TTTCCTCATC AAACCCGAAC TTGCAGAACA TCCCGATTCG CACTCCTCTT GGCAGAGAGA TCCGAAAAGC GTTTATCCCC TCGACGAGTG ACAGGTACCT TCTTTCCGCC GATTACTCCC AGATCGAACT CCGTATCGCG GCGGAAATCT CACAGGACAG TCATCTCATA GACGCATTCA GAAACCGGGA AGATATTCAC ACCGCGACCG CGAAAACCAT CTTCGATACG GACGATATCA CCAAGGATAT GAGACGAAAA GCCAAGGAGG TTAACTTCGG TGTTCTCTAC GGTATCCAGC CATATGGACT GGCTCAAAGG CTGAACATAT CCCAAAAAGA GGCGAAAGCA ATCATTGACA CCTATATTTC AAAGTATCCG GGCCTGTTCA GTGCCCTGCA GACGACCATC ACAGAAGCTG CAGAAAAAGG ATATGTCACG ACGCTGACAG GACGCAGACG TTACATAGAG AATCTTCGCA GCAGAAACCG GAACATCAGG ATGGCAGCGG AACGAGCGGC CATGAACACC CCTATCCAGG GAACTGCGGC AGATATCATC AAGTGCGCTA TGGGCCTTGT TTCTGAAGCA ATAAAAAAGA AACGGATGCA ATCCGCAATG CTCCTGCAGG TTCACGATGA ACTTGTTTTC GAAACGACAG AAGAGGAAAA AGCCGCTCTC GCCGAAATCG CCGAGGGTTG CATGCAAAAA GCGGCAGAAC TCTGCGGACT TGAAACCGTT CCTGTAGAAG TCGAAATCGG TACAGGAAAA AACTGGCTGG AAGCCCACTG A
|
Protein sequence | MVSLHKSDQS QLSFEMSTTA ITEKPQGSKK PSLFLLDGMA LVYRSFFALQ RAGMSTKDGI PTGAAYGFLM TLLKIFETGK PEYLAVAFDS KEKTFRHELY APYKANRPEP PEDLIAQLEL IFKLVEAFRI PILKQPGYEA DDLIGSAVRQ FEKQCEIVIV TPDKDLAQLV NEGVSILKPG RNRNELDTMG CDEVKKQFGV PPECFIDFLT LTGDSSDNIP GAKGIGPKTA SKLLTTYGTL ENVLAHIEEL APRSRKSLEE FETNRKLIQA LVTIRTDIEL DTTLPELASG EPDSVKLFGL LEKLEMNSVA EKIPHIFPGS TPPRTGLQEQ SEPSFSPPEG AAYHLIDTEA ALTELTNTLE KQSSFSIDTE TTSLNTFEAE LVGISICWKP GEAYFIHFTD KELSAKTFPG KLQDVLENPD IKKTGQNLKY DILVLKNHHV RLAPVGFDTM LASYVINPEE KHNLDDLAKK HLNHRTITYS ELTGTGKKAI PIREVPIDRL TVYACQDADV ALQLEQKQKI LLGENSELEQ LCVNIEFPLV EVLADMEYLG IALDTAQLER TAETVNRQLL ELTERIYDTA GTIFNIDSPK QLGNVLFNVL GLPAKKTTKT GFSTNVQVLE DLSLIHPVAK DLLEYRSLQK LKTTYIDALP KILNPKTGRV HTSFNQHITA TGRLSSSNPN LQNIPIRTPL GREIRKAFIP STSDRYLLSA DYSQIELRIA AEISQDSHLI DAFRNREDIH TATAKTIFDT DDITKDMRRK AKEVNFGVLY GIQPYGLAQR LNISQKEAKA IIDTYISKYP GLFSALQTTI TEAAEKGYVT TLTGRRRYIE NLRSRNRNIR MAAERAAMNT PIQGTAADII KCAMGLVSEA IKKKRMQSAM LLQVHDELVF ETTEEEKAAL AEIAEGCMQK AAELCGLETV PVEVEIGTGK NWLEAH
|
| |