Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Paes_0494 |
Symbol | |
ID | 6459709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prosthecochloris aestuarii DSM 271 |
Kingdom | Bacteria |
Replicon accession | NC_011059 |
Strand | - |
Start bp | 532238 |
End bp | 535090 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642724493 |
Product | DNA polymerase I |
Protein accession | YP_002015197 |
Protein GI | 194333337 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.958979 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATTGT CAAGTAATGA AAATCAGTTC CAGCTCTCCC TGGACGAAAC AACGCAAGGA GTTGCAGAAC CCTCACTGCC AGACACACCG CGTACTGAAA AACCCGAGCT TTTCCTTCTT GACGGTATGG CTCTTGTCTA CAGGGCGTTT TTCGCCCTTC AGCGGGCAGA TATGAAAACC AGAGACGGCG TTCCGACAGG GGCCGCTTTC GGATTTCTGA CAACCCTCCT TAAAATCTAT GAAACCTACG CTCCGGACTA TCTTGCCGTA GCATTCGACA GCAGGGAAAA AACATTCCGG CACGAACGCT ATGACGCATA CAAGGCAAAC CGCCCTCAAC CGCCTGAAGA TCTTATCACC CAGCTGGATT TCATTTTCCG TCTTATCGAA GCATTCAACA TCCCGATCCT CAAACAGCCC GGTTACGAAG CTGATGATCT TATCGGCACT GCAGCACGCG AGTTCGAACC GGAGTGCCGG ATCAATATCG TCAGTCCCGA CAAGGATATG ACTCAGCTGA TCCATGATGG CGTCACCCTT CTCAAACCGG GAAAACGCCA GAATGAACTC CTGCGCTTTG GAGCGGCTGA GCTCAAAGAG GAGCTCGGTA TTGCTCCAGA TCAGTTCATA GACCTGTTGA CCCTTACCGG AGATTCGTCA GACAATATAC CCGGAGCAAA AGGGATCGGT CCGAAAACTG CATCAAAACT GCTCCTGAGC TATGGCTCTC TTGATGAGAT CAGCAAAAAT CTCGAAAAGC TTCCTCCGAG AACACGCAAG AGCCTTGAAG AATTTCTACC CGAACGTGAG CTGATCCGTG ATCTGGTCAC CATCAAAACG GATATAGACC TTCAAACACC CCTTGCGTCA CTGCAATGCG GACAGCCTGA TCCCGAAAAG CTTTTTCCTC TGCTCGAAAG GCTCGAACTG CGTGCAATCG CCTCGAAAAT TCCCGATCTT TTTCCCGCTA TAACCCCACC AAAAGAAATC GCCCCGGATA ATGCGCCCAA AGTAAAAAGG GAAGCGCACC GTGCACCAAC CATAACAGAG CCGCCTGAAG ACGCCGTTTA CACGCTCATC GATACCGAAG AAGAGCTTAC CACGTTAACC GAAACACTCG CCGGGTTATC GTCGTTTGCT CTCGACACAG AAACAACCAG CCTCAACACC TTTGAAGCCG AGCTGGTCGG CATCTCCATA AGCGTCAAAC CGCAAGAAGC GGTCTTCATC TACTGCAGCC CGGAGGGTCT CAAGCCTGAA AAAGCCCTGT CGATCCTCAA ACCGGTTCTT GAAAATCCGG CGATAGAAAA ATGGGGACAG AACCTGAAAT ACGATATTCT CGTCCTGAAA AATTATGGAA TCCAGCTTGC GCCGACCGGT TTCGACACCA TGCTCGCCAG CTATGTCATC AACCCTGAGG AAACACATAA TCTCGATGAT CTGGCACAAC GCCACCTGCA ATACCGGACA ACGACATACA GCGAGCTGTG CGGAACAGGC AAAAAAGCTG TCCCACTCCG CCAGGTCCCT CTTGAGGCAC TCAAAAACTA CGCGTGTCAG GATGCAGACA TAGCCCTTCG TCTCCAAAAC ACGCTCGAAA AACAGCTGAA AGACAACCAG GAGCTCGAAT GGCTCTGCAG AAACATCGAG TTTCCGCTGG TTAACGTTCT GGTCGCCATG GAATACAGAG GAATTTCGCT TGACACAAAG CATCTGGAAA AAACTGCGAC AAAAGTCGCA TCGCTGACAG CAGACCTCAA AGAGCGAATC TACGCCACAG CAGGCTCGAC ATTCAACATC GACTCGCCCA AACAGCTTGG AGAAATTCTT TTCACCAGAT TAGGACTGCC GGCGAAGAAA ACGACAAAAA CCGGCTACTC AACCAACGTT CAGGTTCTCG AAGAGTTGGC GATGATACAC CCTATTGCGC AGGATATCCT GGAATACCGG AGTCTGCAGA AGCTCCGTTC TACCTATATA GAAGCACTGC CGAAAATCCT GAACCCTGCT ACTGGAAAGC TCCATACCTC CTTCAACCAG CATATCACAG CTACCGGAAG GCTCTCATCG TCAAACCCGA ACCTGCAGAA CATCCCTATC CGTACCGAAC TGGGCAAAGA GATCCGCAAA GCCTTTATTG CCTCAACAAA CAGGAATTTT CTGCTCTCGG CGGACTACTC CCAGATCGAA CTGCGAATCG CTGCGGAACT ATCAGGCGAT AAGAAGCTTA TCGAAGCGTT CCGCAACCAG GAAGATATTC ATGCGGCAAC AGCCAAAGCG ATTTTCGACA CAGAGGATGT CACCAGCGAT ATGCGAAGAA AGGCCAAAGA GGTAAACTTT GGCGTTCTTT ACGGTATTCA GCCCTATGGC TTGTCGATGA GGCTCAACAT CTCCCAGAAA GAAGCTAAAG CCATCATTGA CACCTATAAA TCGAAATATC CCGGCCTGTT CGAGGCTCTC GACGCTCTTG TCAGACAAGC CGCTGAAAAT GGCTTTGTAA CGACTCTTCT CGGACGACGG CGCTATATCG CAGCACTCAA CAGCCGCAAC GGTAATATCC GCAAAGCTGC AGAGCGGGCT GCCATGAACA CGCCTGTCCA GGGAACTGCG GCGGATATCA TCAAATACGC CATGTGCCTC ATAGAAAATA CCATGCAGGA GCACAAGATG CATTCGACCA TGCTCCTTCA GGTTCATGAC GAACTGGTCT TCGAAACCAA TGAAGAGGAA CACGAAAAAC TTGAAACAAT GGTGGTTGCA TCAATGAAAG AAGCCGCTCT GGTCTGCGGA CTGAAACAGG TACCTGTCGA AGTCGATACC GGTATCGGTA AAAACTGGCT CGATGCCCAT TAA
|
Protein sequence | MALSSNENQF QLSLDETTQG VAEPSLPDTP RTEKPELFLL DGMALVYRAF FALQRADMKT RDGVPTGAAF GFLTTLLKIY ETYAPDYLAV AFDSREKTFR HERYDAYKAN RPQPPEDLIT QLDFIFRLIE AFNIPILKQP GYEADDLIGT AAREFEPECR INIVSPDKDM TQLIHDGVTL LKPGKRQNEL LRFGAAELKE ELGIAPDQFI DLLTLTGDSS DNIPGAKGIG PKTASKLLLS YGSLDEISKN LEKLPPRTRK SLEEFLPERE LIRDLVTIKT DIDLQTPLAS LQCGQPDPEK LFPLLERLEL RAIASKIPDL FPAITPPKEI APDNAPKVKR EAHRAPTITE PPEDAVYTLI DTEEELTTLT ETLAGLSSFA LDTETTSLNT FEAELVGISI SVKPQEAVFI YCSPEGLKPE KALSILKPVL ENPAIEKWGQ NLKYDILVLK NYGIQLAPTG FDTMLASYVI NPEETHNLDD LAQRHLQYRT TTYSELCGTG KKAVPLRQVP LEALKNYACQ DADIALRLQN TLEKQLKDNQ ELEWLCRNIE FPLVNVLVAM EYRGISLDTK HLEKTATKVA SLTADLKERI YATAGSTFNI DSPKQLGEIL FTRLGLPAKK TTKTGYSTNV QVLEELAMIH PIAQDILEYR SLQKLRSTYI EALPKILNPA TGKLHTSFNQ HITATGRLSS SNPNLQNIPI RTELGKEIRK AFIASTNRNF LLSADYSQIE LRIAAELSGD KKLIEAFRNQ EDIHAATAKA IFDTEDVTSD MRRKAKEVNF GVLYGIQPYG LSMRLNISQK EAKAIIDTYK SKYPGLFEAL DALVRQAAEN GFVTTLLGRR RYIAALNSRN GNIRKAAERA AMNTPVQGTA ADIIKYAMCL IENTMQEHKM HSTMLLQVHD ELVFETNEEE HEKLETMVVA SMKEAALVCG LKQVPVEVDT GIGKNWLDAH
|
| |