Gene RoseRS_3415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3415 
Symbol 
ID5210392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4286078 
End bp4288975 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content61% 
IMG OID640597010 
ProductDNA polymerase I 
Protein accessionYP_001277723 
Protein GI148657518 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00208167 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0804526 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCAACA ACCGTCCGCT GCTGCTGCTG ATCGATGGCC ATGCGCTGGC GTACCGCGCG 
TTCCACGCCC TGGCGGAGGC TGGTTTACGC TCTTCGACCG GCGAGCCGAC GTATGCGGTC
TTCGGTTTTA CGTCGGCGAT GCTCAATGCC ATCGAGGAGT ATCATCCCGA CTATGCAGCA
GTGGCATTCG ATGTCGGGAA GACCTTCCGC GACGACCTGT ACGCCGAATA CAAGGCGAAC
CGCGCTGAAA CTCCTGCCGA GTTCGAGCAG CAACTCGAGC GCATTAAGCA AGTGCTGGCG
GCGTTCGACA TCCCGATCTA TACCGCCGAT GGGTATGAAG CCGACGACGT GATCGGCACG
CTGGCGCGTC AGGCGACGGA GCGCGGCGTT GATGTGCTGA TCCTGACCGG CGATACCGAT
ACGCTTCAAC TGGTCGATGA GCATGTGACA GTGCTGCTCA ATAATCCCTA TGTGCGCGGT
TCCAAAAACA CAACGCGCTA TGGGGTCGCC GATGTGTGCG CGCGTTACAA AGGGTTGCGC
CCCGATCAAC TCGCCGATCT GCGTGGGTTG AAGGGCGATC CGTCCGATAA TATCCCCGGC
GTCAAAGGGA TCGGGGAAGC GGGCGCAATT GCGTTGCTCA ATCAGTTTGG CTCAATTGAG
AACCTGTACG ATCATCTGGA CGAAGCGCCG AAACGCTACC AGAAGCACCT GGAAGGTCAG
CGTGACGCGG CGTTGTTCAG CAAGAAACTG GCGACAATTG TACGTGATGC GCCGGTGACG
CTCGACCTTC CTGCCGCAAC TCTTGCCGAC TATGATCGCA GTCGGGTGAT TGCTGTCTTC
CAGGAACTTG AATTCGGTGC GTCACTGGTC AGGCGCCTCC CCCCGTCGCA GACCATCGCA
GCGCCGCAGG CGCTGCCGCC GGTCGAGCCG CCTGCGCCGC TTCAGGTCGA TATGTTTGCC
CCTGCAACAC CTGGACCGGA TGACGGACCG CAGCAGCTGA CCCTGTTCAA CGATATGCCG
ACGCCTGTTG CGCCGGTGGT TGAGCCGCCA GCGCACGATG CGCCTGGTGA GTATCGCGCC
GCGTGCAACG ATGCTGATCT GGAAGCAATT GTCACAGAAC TCAAGCATGC GTCGCTATTT
GCATTCGATA CCGAGACGCG CGGCACCAAT CCCCTGCGCG ACGATCTGGT GGGCATCGCC
CTGGCGACGA TCCCCGGCAG TGGATGGTAT GTTCCGCTGG GGCATACCAC CGGCGAGGCG
CAACTGCCGC GTGAGCGAGT CATTGCTGCG CTGCGTCCGT TTTTCGCCGA TCCGGCGCGC
TCCAGAATAG CGCACAATGC GAAGTTCGAC ATCGAGGTGC TGGAACGCGC TGGCATCCCG
GTTGCCGGTG TGGCGTTCGA CACGATGCTG GCAGCGGCGC TGCTCGACAA ACGGCGCAAC
CTGAAAGACC TGGCGTTCTA TGAACTGAAC CTCGCCGCTC CGCTCGAATC GATTGAAGCG
TTGATCGGGA AGGGCAAAAA CCAGGTGACC TTCGCCGATG TGCCGATTGC GCGCGCCACG
CCGTATGCCG CTGCCGACGC CGATATGACG CTGCGCCTGA AGCCAGCGCT TGAAGCGAAA
CTGCGCGCAG CCGGCAGTGT CGCAGATGTG TTCTACCGCC TGGAGATGCC GCTTGTTCCG
GTGCTGGTGC GCATGGAACA GGCAGGCATT CTGCTTGATG TTCCGTATAT GCGCGCTCTT
GGTGAGCGCA TGGGGCGGGA ACTCGAACAG ATTGAGCAGC AAATCTACGC AATTGCCGGG
CAGACATTCA ACATCAACTC CGGTGATCAG CTGAGCGAGG TGTTGTTCGG TCCCAAGATC
AACCTGCCAA CTACCGGTCT TGATCGCACC CGAACGGGGC GCTACTCGCT GACCGCGCAG
GCGCTCGAAG AACTGCAAGC CAGCGACACC ACCGGCATCA TCGAGTTGAT CCTGCGTCAC
CGTCGCCTGA GCAAACTCAA ATCGACCTAC GTCGATGAAC TGCCGGCGCT GGTTAACCCG
GAGACCGGCA GGGTTCATAC CGATTACAAC CAGCTTGGCG CCGCGACCGG TCGGTTGAGC
AGCAACTCGC CCAACCTGCA AAATATTCCC ACGCGCACCG AAGAGGGGCG CGAGGTGCGG
CGCGGTTTCA TCGCTGCGCC GGGTCACCTG CTGATCGCCG CCGACTATTC GCAGATCGAG
TTGCGTGTGC TGGCGCATAT GACCGGCGAT CCGAACCTGA TCCAGACCTT TATCGAGGGG
CGGGACATCC ACGCGGCAAC TGCCGCCCGG CTGTTTGGCG TCGGCTTCAG TGCGGTGGAC
AAGAATCAGC GACGGATCGC AAAAACTGTG GTTTTTGGCG TCATCTATGG CATCAGCCCG
TTTGGGCTGG CGCAGCGGTT GGGTATCTCG CGCGAACAGG CGCGTGGTCT GATCGATAGT
CTGTTCGATC AGTTCCCGCG TATCCGCGAC TATATCGACC GCACCCTCGA CATCGGGCGG
AGCGAAGGGT ATGTGCAGTC ACTCTTCGGT CGTCGCCGCC CCATGTTCGA CCTGCGCGTA
TCCGGTCCAC GCCGCCAGGC AGCCGAACGT GAAGCGATCA ACCACCCGAT CCAGTCCACG
GCTGCGGACA TTATGAAACT GGCAATGATC GCGGTCGATG CTGAACTCCA GCGTCGACAG
ATGCGCACCC GGATGCTGCT TCAGGTTCAC GACGAACTGA TCTTCGAAGC GCCGGAAGCG
GAGGTGGACG ATGTGGTGGC GCTGGTGCGC GAGCGGATGG AAGGCGTGTT GCACGGGATG
GAACCGCCGT TCGCTGTGCC GTTGCGCGTC GAGATCGAAA CAGGACCGAA CTGGGAAGAA
TTGACCCCGG CGGGATGA
 
Protein sequence
MSNNRPLLLL IDGHALAYRA FHALAEAGLR SSTGEPTYAV FGFTSAMLNA IEEYHPDYAA 
VAFDVGKTFR DDLYAEYKAN RAETPAEFEQ QLERIKQVLA AFDIPIYTAD GYEADDVIGT
LARQATERGV DVLILTGDTD TLQLVDEHVT VLLNNPYVRG SKNTTRYGVA DVCARYKGLR
PDQLADLRGL KGDPSDNIPG VKGIGEAGAI ALLNQFGSIE NLYDHLDEAP KRYQKHLEGQ
RDAALFSKKL ATIVRDAPVT LDLPAATLAD YDRSRVIAVF QELEFGASLV RRLPPSQTIA
APQALPPVEP PAPLQVDMFA PATPGPDDGP QQLTLFNDMP TPVAPVVEPP AHDAPGEYRA
ACNDADLEAI VTELKHASLF AFDTETRGTN PLRDDLVGIA LATIPGSGWY VPLGHTTGEA
QLPRERVIAA LRPFFADPAR SRIAHNAKFD IEVLERAGIP VAGVAFDTML AAALLDKRRN
LKDLAFYELN LAAPLESIEA LIGKGKNQVT FADVPIARAT PYAAADADMT LRLKPALEAK
LRAAGSVADV FYRLEMPLVP VLVRMEQAGI LLDVPYMRAL GERMGRELEQ IEQQIYAIAG
QTFNINSGDQ LSEVLFGPKI NLPTTGLDRT RTGRYSLTAQ ALEELQASDT TGIIELILRH
RRLSKLKSTY VDELPALVNP ETGRVHTDYN QLGAATGRLS SNSPNLQNIP TRTEEGREVR
RGFIAAPGHL LIAADYSQIE LRVLAHMTGD PNLIQTFIEG RDIHAATAAR LFGVGFSAVD
KNQRRIAKTV VFGVIYGISP FGLAQRLGIS REQARGLIDS LFDQFPRIRD YIDRTLDIGR
SEGYVQSLFG RRRPMFDLRV SGPRRQAAER EAINHPIQST AADIMKLAMI AVDAELQRRQ
MRTRMLLQVH DELIFEAPEA EVDDVVALVR ERMEGVLHGM EPPFAVPLRV EIETGPNWEE
LTPAG