Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3415 |
Symbol | |
ID | 5210392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 4286078 |
End bp | 4288975 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640597010 |
Product | DNA polymerase I |
Protein accession | YP_001277723 |
Protein GI | 148657518 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00208167 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0804526 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCAACA ACCGTCCGCT GCTGCTGCTG ATCGATGGCC ATGCGCTGGC GTACCGCGCG TTCCACGCCC TGGCGGAGGC TGGTTTACGC TCTTCGACCG GCGAGCCGAC GTATGCGGTC TTCGGTTTTA CGTCGGCGAT GCTCAATGCC ATCGAGGAGT ATCATCCCGA CTATGCAGCA GTGGCATTCG ATGTCGGGAA GACCTTCCGC GACGACCTGT ACGCCGAATA CAAGGCGAAC CGCGCTGAAA CTCCTGCCGA GTTCGAGCAG CAACTCGAGC GCATTAAGCA AGTGCTGGCG GCGTTCGACA TCCCGATCTA TACCGCCGAT GGGTATGAAG CCGACGACGT GATCGGCACG CTGGCGCGTC AGGCGACGGA GCGCGGCGTT GATGTGCTGA TCCTGACCGG CGATACCGAT ACGCTTCAAC TGGTCGATGA GCATGTGACA GTGCTGCTCA ATAATCCCTA TGTGCGCGGT TCCAAAAACA CAACGCGCTA TGGGGTCGCC GATGTGTGCG CGCGTTACAA AGGGTTGCGC CCCGATCAAC TCGCCGATCT GCGTGGGTTG AAGGGCGATC CGTCCGATAA TATCCCCGGC GTCAAAGGGA TCGGGGAAGC GGGCGCAATT GCGTTGCTCA ATCAGTTTGG CTCAATTGAG AACCTGTACG ATCATCTGGA CGAAGCGCCG AAACGCTACC AGAAGCACCT GGAAGGTCAG CGTGACGCGG CGTTGTTCAG CAAGAAACTG GCGACAATTG TACGTGATGC GCCGGTGACG CTCGACCTTC CTGCCGCAAC TCTTGCCGAC TATGATCGCA GTCGGGTGAT TGCTGTCTTC CAGGAACTTG AATTCGGTGC GTCACTGGTC AGGCGCCTCC CCCCGTCGCA GACCATCGCA GCGCCGCAGG CGCTGCCGCC GGTCGAGCCG CCTGCGCCGC TTCAGGTCGA TATGTTTGCC CCTGCAACAC CTGGACCGGA TGACGGACCG CAGCAGCTGA CCCTGTTCAA CGATATGCCG ACGCCTGTTG CGCCGGTGGT TGAGCCGCCA GCGCACGATG CGCCTGGTGA GTATCGCGCC GCGTGCAACG ATGCTGATCT GGAAGCAATT GTCACAGAAC TCAAGCATGC GTCGCTATTT GCATTCGATA CCGAGACGCG CGGCACCAAT CCCCTGCGCG ACGATCTGGT GGGCATCGCC CTGGCGACGA TCCCCGGCAG TGGATGGTAT GTTCCGCTGG GGCATACCAC CGGCGAGGCG CAACTGCCGC GTGAGCGAGT CATTGCTGCG CTGCGTCCGT TTTTCGCCGA TCCGGCGCGC TCCAGAATAG CGCACAATGC GAAGTTCGAC ATCGAGGTGC TGGAACGCGC TGGCATCCCG GTTGCCGGTG TGGCGTTCGA CACGATGCTG GCAGCGGCGC TGCTCGACAA ACGGCGCAAC CTGAAAGACC TGGCGTTCTA TGAACTGAAC CTCGCCGCTC CGCTCGAATC GATTGAAGCG TTGATCGGGA AGGGCAAAAA CCAGGTGACC TTCGCCGATG TGCCGATTGC GCGCGCCACG CCGTATGCCG CTGCCGACGC CGATATGACG CTGCGCCTGA AGCCAGCGCT TGAAGCGAAA CTGCGCGCAG CCGGCAGTGT CGCAGATGTG TTCTACCGCC TGGAGATGCC GCTTGTTCCG GTGCTGGTGC GCATGGAACA GGCAGGCATT CTGCTTGATG TTCCGTATAT GCGCGCTCTT GGTGAGCGCA TGGGGCGGGA ACTCGAACAG ATTGAGCAGC AAATCTACGC AATTGCCGGG CAGACATTCA ACATCAACTC CGGTGATCAG CTGAGCGAGG TGTTGTTCGG TCCCAAGATC AACCTGCCAA CTACCGGTCT TGATCGCACC CGAACGGGGC GCTACTCGCT GACCGCGCAG GCGCTCGAAG AACTGCAAGC CAGCGACACC ACCGGCATCA TCGAGTTGAT CCTGCGTCAC CGTCGCCTGA GCAAACTCAA ATCGACCTAC GTCGATGAAC TGCCGGCGCT GGTTAACCCG GAGACCGGCA GGGTTCATAC CGATTACAAC CAGCTTGGCG CCGCGACCGG TCGGTTGAGC AGCAACTCGC CCAACCTGCA AAATATTCCC ACGCGCACCG AAGAGGGGCG CGAGGTGCGG CGCGGTTTCA TCGCTGCGCC GGGTCACCTG CTGATCGCCG CCGACTATTC GCAGATCGAG TTGCGTGTGC TGGCGCATAT GACCGGCGAT CCGAACCTGA TCCAGACCTT TATCGAGGGG CGGGACATCC ACGCGGCAAC TGCCGCCCGG CTGTTTGGCG TCGGCTTCAG TGCGGTGGAC AAGAATCAGC GACGGATCGC AAAAACTGTG GTTTTTGGCG TCATCTATGG CATCAGCCCG TTTGGGCTGG CGCAGCGGTT GGGTATCTCG CGCGAACAGG CGCGTGGTCT GATCGATAGT CTGTTCGATC AGTTCCCGCG TATCCGCGAC TATATCGACC GCACCCTCGA CATCGGGCGG AGCGAAGGGT ATGTGCAGTC ACTCTTCGGT CGTCGCCGCC CCATGTTCGA CCTGCGCGTA TCCGGTCCAC GCCGCCAGGC AGCCGAACGT GAAGCGATCA ACCACCCGAT CCAGTCCACG GCTGCGGACA TTATGAAACT GGCAATGATC GCGGTCGATG CTGAACTCCA GCGTCGACAG ATGCGCACCC GGATGCTGCT TCAGGTTCAC GACGAACTGA TCTTCGAAGC GCCGGAAGCG GAGGTGGACG ATGTGGTGGC GCTGGTGCGC GAGCGGATGG AAGGCGTGTT GCACGGGATG GAACCGCCGT TCGCTGTGCC GTTGCGCGTC GAGATCGAAA CAGGACCGAA CTGGGAAGAA TTGACCCCGG CGGGATGA
|
Protein sequence | MSNNRPLLLL IDGHALAYRA FHALAEAGLR SSTGEPTYAV FGFTSAMLNA IEEYHPDYAA VAFDVGKTFR DDLYAEYKAN RAETPAEFEQ QLERIKQVLA AFDIPIYTAD GYEADDVIGT LARQATERGV DVLILTGDTD TLQLVDEHVT VLLNNPYVRG SKNTTRYGVA DVCARYKGLR PDQLADLRGL KGDPSDNIPG VKGIGEAGAI ALLNQFGSIE NLYDHLDEAP KRYQKHLEGQ RDAALFSKKL ATIVRDAPVT LDLPAATLAD YDRSRVIAVF QELEFGASLV RRLPPSQTIA APQALPPVEP PAPLQVDMFA PATPGPDDGP QQLTLFNDMP TPVAPVVEPP AHDAPGEYRA ACNDADLEAI VTELKHASLF AFDTETRGTN PLRDDLVGIA LATIPGSGWY VPLGHTTGEA QLPRERVIAA LRPFFADPAR SRIAHNAKFD IEVLERAGIP VAGVAFDTML AAALLDKRRN LKDLAFYELN LAAPLESIEA LIGKGKNQVT FADVPIARAT PYAAADADMT LRLKPALEAK LRAAGSVADV FYRLEMPLVP VLVRMEQAGI LLDVPYMRAL GERMGRELEQ IEQQIYAIAG QTFNINSGDQ LSEVLFGPKI NLPTTGLDRT RTGRYSLTAQ ALEELQASDT TGIIELILRH RRLSKLKSTY VDELPALVNP ETGRVHTDYN QLGAATGRLS SNSPNLQNIP TRTEEGREVR RGFIAAPGHL LIAADYSQIE LRVLAHMTGD PNLIQTFIEG RDIHAATAAR LFGVGFSAVD KNQRRIAKTV VFGVIYGISP FGLAQRLGIS REQARGLIDS LFDQFPRIRD YIDRTLDIGR SEGYVQSLFG RRRPMFDLRV SGPRRQAAER EAINHPIQST AADIMKLAMI AVDAELQRRQ MRTRMLLQVH DELIFEAPEA EVDDVVALVR ERMEGVLHGM EPPFAVPLRV EIETGPNWEE LTPAG
|
| |