Gene Paes_0494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0494 
Symbol 
ID6459709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp532238 
End bp535090 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content50% 
IMG OID642724493 
ProductDNA polymerase I 
Protein accessionYP_002015197 
Protein GI194333337 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.958979 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTGT CAAGTAATGA AAATCAGTTC CAGCTCTCCC TGGACGAAAC AACGCAAGGA 
GTTGCAGAAC CCTCACTGCC AGACACACCG CGTACTGAAA AACCCGAGCT TTTCCTTCTT
GACGGTATGG CTCTTGTCTA CAGGGCGTTT TTCGCCCTTC AGCGGGCAGA TATGAAAACC
AGAGACGGCG TTCCGACAGG GGCCGCTTTC GGATTTCTGA CAACCCTCCT TAAAATCTAT
GAAACCTACG CTCCGGACTA TCTTGCCGTA GCATTCGACA GCAGGGAAAA AACATTCCGG
CACGAACGCT ATGACGCATA CAAGGCAAAC CGCCCTCAAC CGCCTGAAGA TCTTATCACC
CAGCTGGATT TCATTTTCCG TCTTATCGAA GCATTCAACA TCCCGATCCT CAAACAGCCC
GGTTACGAAG CTGATGATCT TATCGGCACT GCAGCACGCG AGTTCGAACC GGAGTGCCGG
ATCAATATCG TCAGTCCCGA CAAGGATATG ACTCAGCTGA TCCATGATGG CGTCACCCTT
CTCAAACCGG GAAAACGCCA GAATGAACTC CTGCGCTTTG GAGCGGCTGA GCTCAAAGAG
GAGCTCGGTA TTGCTCCAGA TCAGTTCATA GACCTGTTGA CCCTTACCGG AGATTCGTCA
GACAATATAC CCGGAGCAAA AGGGATCGGT CCGAAAACTG CATCAAAACT GCTCCTGAGC
TATGGCTCTC TTGATGAGAT CAGCAAAAAT CTCGAAAAGC TTCCTCCGAG AACACGCAAG
AGCCTTGAAG AATTTCTACC CGAACGTGAG CTGATCCGTG ATCTGGTCAC CATCAAAACG
GATATAGACC TTCAAACACC CCTTGCGTCA CTGCAATGCG GACAGCCTGA TCCCGAAAAG
CTTTTTCCTC TGCTCGAAAG GCTCGAACTG CGTGCAATCG CCTCGAAAAT TCCCGATCTT
TTTCCCGCTA TAACCCCACC AAAAGAAATC GCCCCGGATA ATGCGCCCAA AGTAAAAAGG
GAAGCGCACC GTGCACCAAC CATAACAGAG CCGCCTGAAG ACGCCGTTTA CACGCTCATC
GATACCGAAG AAGAGCTTAC CACGTTAACC GAAACACTCG CCGGGTTATC GTCGTTTGCT
CTCGACACAG AAACAACCAG CCTCAACACC TTTGAAGCCG AGCTGGTCGG CATCTCCATA
AGCGTCAAAC CGCAAGAAGC GGTCTTCATC TACTGCAGCC CGGAGGGTCT CAAGCCTGAA
AAAGCCCTGT CGATCCTCAA ACCGGTTCTT GAAAATCCGG CGATAGAAAA ATGGGGACAG
AACCTGAAAT ACGATATTCT CGTCCTGAAA AATTATGGAA TCCAGCTTGC GCCGACCGGT
TTCGACACCA TGCTCGCCAG CTATGTCATC AACCCTGAGG AAACACATAA TCTCGATGAT
CTGGCACAAC GCCACCTGCA ATACCGGACA ACGACATACA GCGAGCTGTG CGGAACAGGC
AAAAAAGCTG TCCCACTCCG CCAGGTCCCT CTTGAGGCAC TCAAAAACTA CGCGTGTCAG
GATGCAGACA TAGCCCTTCG TCTCCAAAAC ACGCTCGAAA AACAGCTGAA AGACAACCAG
GAGCTCGAAT GGCTCTGCAG AAACATCGAG TTTCCGCTGG TTAACGTTCT GGTCGCCATG
GAATACAGAG GAATTTCGCT TGACACAAAG CATCTGGAAA AAACTGCGAC AAAAGTCGCA
TCGCTGACAG CAGACCTCAA AGAGCGAATC TACGCCACAG CAGGCTCGAC ATTCAACATC
GACTCGCCCA AACAGCTTGG AGAAATTCTT TTCACCAGAT TAGGACTGCC GGCGAAGAAA
ACGACAAAAA CCGGCTACTC AACCAACGTT CAGGTTCTCG AAGAGTTGGC GATGATACAC
CCTATTGCGC AGGATATCCT GGAATACCGG AGTCTGCAGA AGCTCCGTTC TACCTATATA
GAAGCACTGC CGAAAATCCT GAACCCTGCT ACTGGAAAGC TCCATACCTC CTTCAACCAG
CATATCACAG CTACCGGAAG GCTCTCATCG TCAAACCCGA ACCTGCAGAA CATCCCTATC
CGTACCGAAC TGGGCAAAGA GATCCGCAAA GCCTTTATTG CCTCAACAAA CAGGAATTTT
CTGCTCTCGG CGGACTACTC CCAGATCGAA CTGCGAATCG CTGCGGAACT ATCAGGCGAT
AAGAAGCTTA TCGAAGCGTT CCGCAACCAG GAAGATATTC ATGCGGCAAC AGCCAAAGCG
ATTTTCGACA CAGAGGATGT CACCAGCGAT ATGCGAAGAA AGGCCAAAGA GGTAAACTTT
GGCGTTCTTT ACGGTATTCA GCCCTATGGC TTGTCGATGA GGCTCAACAT CTCCCAGAAA
GAAGCTAAAG CCATCATTGA CACCTATAAA TCGAAATATC CCGGCCTGTT CGAGGCTCTC
GACGCTCTTG TCAGACAAGC CGCTGAAAAT GGCTTTGTAA CGACTCTTCT CGGACGACGG
CGCTATATCG CAGCACTCAA CAGCCGCAAC GGTAATATCC GCAAAGCTGC AGAGCGGGCT
GCCATGAACA CGCCTGTCCA GGGAACTGCG GCGGATATCA TCAAATACGC CATGTGCCTC
ATAGAAAATA CCATGCAGGA GCACAAGATG CATTCGACCA TGCTCCTTCA GGTTCATGAC
GAACTGGTCT TCGAAACCAA TGAAGAGGAA CACGAAAAAC TTGAAACAAT GGTGGTTGCA
TCAATGAAAG AAGCCGCTCT GGTCTGCGGA CTGAAACAGG TACCTGTCGA AGTCGATACC
GGTATCGGTA AAAACTGGCT CGATGCCCAT TAA
 
Protein sequence
MALSSNENQF QLSLDETTQG VAEPSLPDTP RTEKPELFLL DGMALVYRAF FALQRADMKT 
RDGVPTGAAF GFLTTLLKIY ETYAPDYLAV AFDSREKTFR HERYDAYKAN RPQPPEDLIT
QLDFIFRLIE AFNIPILKQP GYEADDLIGT AAREFEPECR INIVSPDKDM TQLIHDGVTL
LKPGKRQNEL LRFGAAELKE ELGIAPDQFI DLLTLTGDSS DNIPGAKGIG PKTASKLLLS
YGSLDEISKN LEKLPPRTRK SLEEFLPERE LIRDLVTIKT DIDLQTPLAS LQCGQPDPEK
LFPLLERLEL RAIASKIPDL FPAITPPKEI APDNAPKVKR EAHRAPTITE PPEDAVYTLI
DTEEELTTLT ETLAGLSSFA LDTETTSLNT FEAELVGISI SVKPQEAVFI YCSPEGLKPE
KALSILKPVL ENPAIEKWGQ NLKYDILVLK NYGIQLAPTG FDTMLASYVI NPEETHNLDD
LAQRHLQYRT TTYSELCGTG KKAVPLRQVP LEALKNYACQ DADIALRLQN TLEKQLKDNQ
ELEWLCRNIE FPLVNVLVAM EYRGISLDTK HLEKTATKVA SLTADLKERI YATAGSTFNI
DSPKQLGEIL FTRLGLPAKK TTKTGYSTNV QVLEELAMIH PIAQDILEYR SLQKLRSTYI
EALPKILNPA TGKLHTSFNQ HITATGRLSS SNPNLQNIPI RTELGKEIRK AFIASTNRNF
LLSADYSQIE LRIAAELSGD KKLIEAFRNQ EDIHAATAKA IFDTEDVTSD MRRKAKEVNF
GVLYGIQPYG LSMRLNISQK EAKAIIDTYK SKYPGLFEAL DALVRQAAEN GFVTTLLGRR
RYIAALNSRN GNIRKAAERA AMNTPVQGTA ADIIKYAMCL IENTMQEHKM HSTMLLQVHD
ELVFETNEEE HEKLETMVVA SMKEAALVCG LKQVPVEVDT GIGKNWLDAH