Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_1645 |
Symbol | |
ID | 3745096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | + |
Start bp | 1845351 |
End bp | 1848188 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637769678 |
Product | DNA polymerase A |
Protein accession | YP_375542 |
Protein GI | 78187499 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAGA GGCAGCTTGG CGTGTTCGAC GACGGCGCAG GAATGGAGCT GCACGACCAG CCGGCAGCGG CTGCCGGAGC AAAACCCGAT CTCTTCATCA TCGACGGCAT GGCCCTGCTC TACCGGGCCT ACTTCGCACT TCTCCGTGCC GGCATGAAAA CCCGGGACGG CAGGCCGACC GGAGCCCTGT ACGGGTTCAT GTCGACCCTG CTCAGGATAT TCGAATCCTA TCACCCGCGG TATCTGGCCG CCGCGTTTGA CAGCCGGGAA AAGACCTTCC GCCATGACAT TTTTCCCGAA TACAAAGCAA ACCGGACTGC ACCTCCCGAA GAGATGACCG GCCAGATCGA ACCGCTCTTC GAACTGCTCC GGGCGCTCGG CATTCCCATT CTCCGCACGC CGGGCTTTGA GGCCGATGAT CTCATCGGCA CGGCAGCAAA AGAGTTCGAA GATGCCTGCA GCATCTACAT CGTCACCCCG GACAAGGACC TCGCCCAGCT GGTACACGAT GGCGTCAAGC TCCTCAGGCC CTCAAAGAAC CAGAATGAGC TGCAGCTGAT GGGACCGCGG GAAGTAGAGG AGCAGTTCGG CGTCCCGCCG GAACAGTTCA CAGCCTTCCT CACCCTCACC GGCGACACCT CCGACAACAT CCCCGGAGCA GAGGGCATCG GCCCCAAAAC CGCGACCTCG CTGCTCGCAC GGTTCGGATC TCTGGAGGAA GTCTACCGCC ATATCGAAGA ACTCACGCCG AAAGTCAGGA AAAGCCTCGA AGCGTTCCGG CCCCGCCTCC AGCTCATTAC CGATCTCGTC ACCATCCGCA CCGATCTGGC ACTCCATGTC ACCCTTGAAG AGCTTCGCTG TACCACTCCC GACCGGCATG AACTGCTCGA ACTCCTCGGA AAGCTTGAGC TGAGGAGCAT CGCATCGCGT CTCCCGGCAG CCTTCCCGGG TCTGGCCGTC GAATCACCGG CCGGAGATCC GGCCAGAACG GCAGGAGCGG AAAAAACTGC AGCCGAAACG GAGGAGCTCG GCGACCCGCG CCTTGGTGCA GACTACGGAA TGATTGCGAC AGAGGACCTT CTCCGGGAAT TCGTCAGGGA GATGTTGCAG ACCGATAAAA TCGCCGTCGA CACGGAAACC ACCTCGCTCG ACACCTTCGA GGCCGAACTT GCAGGCATTT CCATCTCGGC TGAACCGGCC AAAGCACGTT TTATTTCATT TGCAGGCACC GGACTCGACC GGGGGCGCTC AATCGAGATC CTGCGGCCGC TGCTCGAGAA TCCGGCCATA CCAAAAACCG GCCAGAACCT CAAATACGAC ATCCTCGTCC TGAAGAACTA CGGCCTGGAA CTCGGGCCCG TCGGCTTCGA CACGATGCTG GCCAGCTACG TGCTCGACCC CGACGGCAAA CACAACCTCG ACGACATGGC CGCGCTGCAC CTCCGGCTCA AAACCACAAA ATATGACGAG CTCACCGGCA CAGGAAAGAA CAGGCTGCAC ATTTACGATG TCGAACCCGC AAAACTGACA GACTACGCCT GCCAGGACGC CGACCTCGCC CTGCAGCTTG AAGGGGCATT CACCAGAAAC CTTGAAGCCG AACCCCGCCT CATGGAGCTT TGCCAAACGA TCGAGTTCCC GCTCGTCAGT GTGCTCGCAA AGATGGAGCA TCAGGGCATA AGCATCGACA GCAAGCACTT GGAGGAAACG TCGATGGCTG TCGGTCACCA GCTTGAATCG CTGAGGGAAA CGATCCATGC CGCCGCAGGC ACCGACTTCA ACATCGATTC CCCGAAGCAG CTCTCCCACA TCCTCTTCAA CGTGCTTGGC CTTCCGACGA AAAAAACAAC CAAAACGGGT TTCTCGACCA ACGTGGAGGT GCTTGAGGAA CTCGCTCCGC TTCACCCCGT AGTCAGTGAT CTGCTTCTGT ACCGAAGCCT CCAGAAGCTC AAGACCACCT ACATCGATGC GCTCCCGAAG ATGGTGAACC CCCGCACCGG CAGGGTGCAC ACATCCTTCA ACCAGCACGT CACGGCCACC GGACGGCTCT CCTCGTCAAA CCCGAACCTG CAGAACATCC CCATCCGAAC CCCGCTGGGG CGCGAGATCC GCAAGGCGTT CATCCCCACA AACCCCGCGA ACTGGCTCCT CTCGGCCGAC TACTCCCAGA TCGAGCTCCG CATCGCTGCA GAGATCTCCG GGGACCCGAA GCTCATCGAG GCATTCCGTA ACGGCCTGGA CATCCACGCC GAAACGGCAA GGGTCATTTT CGACACGGAA GACATCACCG CCGACATGCG CCGCAAGGCT AAGGAGGTCA ACTTCGGCGT CCTCTACGGC ATCCAGCCAT TCGGACTCTC CAAGCGCCTG AACATTCCCA GAAACGAGGC CAGGGAGATC ATCGACACCT ACCGTGAGAA GTACCCCGGG CTCTTCAGCT CCCTTGAGGG AGTCATCGAA GAGGGCAGAA AGAACGGCTT CGTCACGACA CTGCTCGGCC GCCGGCGGTA CCTCAGAGAC CTCACGAGCC GGAACTCAAA TGTGCAGAAA GCCGCCGAGC GGGCTGCCAT GAACACCCCA ATCCAGGGCA CCGCGGCCGA CATCATCAAA TGCGCCATGA ATCTCTGCAG CCGACGGATC GATGCGGCTT CCATGCAGTC AGTCATGCTC CTTCAGGTAC ACGATGAACT GGTCTTTGAA ACCACCGAAG ATGAAAAAGA AGGCCTCAAG TCACTCGTCG AGGAAGCCAT GATCGAGGCG GCGGCACTCT GCGGCCTGAA AAATGTCCCG GTCGCTGTCG ATACCGGAAT TGGGAAGAAC TGGCTCGAAG CCCACTGA
|
Protein sequence | MNERQLGVFD DGAGMELHDQ PAAAAGAKPD LFIIDGMALL YRAYFALLRA GMKTRDGRPT GALYGFMSTL LRIFESYHPR YLAAAFDSRE KTFRHDIFPE YKANRTAPPE EMTGQIEPLF ELLRALGIPI LRTPGFEADD LIGTAAKEFE DACSIYIVTP DKDLAQLVHD GVKLLRPSKN QNELQLMGPR EVEEQFGVPP EQFTAFLTLT GDTSDNIPGA EGIGPKTATS LLARFGSLEE VYRHIEELTP KVRKSLEAFR PRLQLITDLV TIRTDLALHV TLEELRCTTP DRHELLELLG KLELRSIASR LPAAFPGLAV ESPAGDPART AGAEKTAAET EELGDPRLGA DYGMIATEDL LREFVREMLQ TDKIAVDTET TSLDTFEAEL AGISISAEPA KARFISFAGT GLDRGRSIEI LRPLLENPAI PKTGQNLKYD ILVLKNYGLE LGPVGFDTML ASYVLDPDGK HNLDDMAALH LRLKTTKYDE LTGTGKNRLH IYDVEPAKLT DYACQDADLA LQLEGAFTRN LEAEPRLMEL CQTIEFPLVS VLAKMEHQGI SIDSKHLEET SMAVGHQLES LRETIHAAAG TDFNIDSPKQ LSHILFNVLG LPTKKTTKTG FSTNVEVLEE LAPLHPVVSD LLLYRSLQKL KTTYIDALPK MVNPRTGRVH TSFNQHVTAT GRLSSSNPNL QNIPIRTPLG REIRKAFIPT NPANWLLSAD YSQIELRIAA EISGDPKLIE AFRNGLDIHA ETARVIFDTE DITADMRRKA KEVNFGVLYG IQPFGLSKRL NIPRNEAREI IDTYREKYPG LFSSLEGVIE EGRKNGFVTT LLGRRRYLRD LTSRNSNVQK AAERAAMNTP IQGTAADIIK CAMNLCSRRI DAASMQSVML LQVHDELVFE TTEDEKEGLK SLVEEAMIEA AALCGLKNVP VAVDTGIGKN WLEAH
|
| |