Gene RoseRS_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1899 
Symbol 
ID5208860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2356599 
End bp2359469 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content61% 
IMG OID640595508 
ProductDNA polymerase III, epsilon subunit 
Protein accessionYP_001276238 
Protein GI148656033 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases
[COG2176] DNA polymerase III, alpha subunit (gram-positive type) 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family
[TIGR01407] DnaQ family exonuclease/DinG family helicase, putative 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0589249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGA TCTACATCGC AATCGATGTC GAAACGACCG GGCTTGAGGC GGGGGTTGAT 
GAGATTATCG AGATCGCTGC GGTCAAGTTC AACGCAGATG AGGTGCTCGA AACGTTCAGC
ACCCTGGTGC AACCGGTCCA TTCACTCCCC CTTAATTCCA GCCGGTTGAC AGGCATCACT
GCAGATATGC TCGCCAGCGC CCCGCGCTTT GCCGAAGTTG CGCCACGTTT TGCCGCGTTC
CTCAAGAATT ACCCGCTCGT CGGGCACAAT GTCGAGTTCG ACCTGCGTAT GCTGCGCGCT
CAGGGGATGC GACTGCCCCA ACCAGCATTC GATACGTTTG AACTGGCAAC GTTGCTGATG
CCGCGCACGC CAGCGTACCG CCTTAGCGCG CTCGCGGCAA CCCTGGGTAT TCATCACGAT
GAAGCTCACC GCGCGCTGAG CGACGCAGAT GTAACCCGTC AGATCTTCCT CTACCTGCTG
CGCCGGATTG AGGCGCTCCC CCTGGACGAT CTCAATGAGA TCATGCGCCT GACCAGCCAG
ATTGAGTGGG GTCTGCGGAG CCTGTTCGAG GAAGCGCAAC GCGCAAAAGC GCGACGCGTC
TTTGTGGACG CAATACCCAT CCAGGCAGAC CGTTACGACG ATCGTGATGA GAAGATCGTG
CCGCTGAAAC CGACCGGTGA TGAGCGCCCG ATCGATCTGG CGGACATCCG TCGGTTCTTC
AGTCCTGATG GCGCGCTCGG TCGCGCGTTT GCGGGGTATG AACAGCGCGA TCAGCAGGTG
CGTATGTCCG AAGCGGTCGC CGATACGTTC AACCAGGGAG GGGCGCTGAT CGCCGAAGCC
GGCACCGGCA CCGGCAAAGG GCTGGCGTAC CTGGTTCCTG CGGCGCTGCA CGCTGCGCGC
CGTGGCGAAC GGGTGGTGAT TTCGACCAAT ACGATCAATC TGCAAGATCA ACTCTTCTTC
AAAGACATCC CGGCGTTGCA ACAGGTGATG TCCAACGGCG CCGACGAAGC GCCGTTCACT
GCGGCGCTGC TCAAAGGGCG CAGCAACTAT CTCTGTCTCA AGCGCTATAA AGACCTGCGC
CGCGATCAGC GGTTGATGTC GGACGATGTG CGCGCGCTGC TCAAGGTGCA GTTGTGGCTG
CCGACGACCG AAAGCGGGGA TCGGGCGGAA CTGCCGCTTC ACGAGGGCGA GAATGCGTCC
TGGAACAGGA TGAGCGCCGC CTGGGAACAG TGCACCGGTC CACGGTGCAG CGAGTTTCAG
CGCTGCTTCT TCTTCAAGGC GCGCAAACAG GCGGAAGCGG CGCACCTGGT GATCGTCAAC
CACGCGCTGC TCATGGCGGA TCTGGCGGTC GAAAACGATG TCATCCCACC CTACGACTAC
CTCATCATCG ATGAGGCGCA CAATCTGGAA GACGTGGCTA CCGACCAGTT GAGTTTCAAT
GTTGATCGTG AAGGATTGCT TGCATTTCTC GACGATATTT TCACCGAAGG GCAGGCGCAG
GTGGTCGGCG GTCTGCTGAG CGAACTGCCC AACCACTTCC GCGAAAGTAT GGCAACCCGC
ACCGACATCG ACCGCGCCGA TGCGATTGCC GATACGTTGC GCCCGGCAGT GGCGCGCGCG
CGCGATGCGG TGTATGGGTG CTTCAATGTG CTGATGACAT TCATCAAGCG CGAAGCCGAA
CTGACCGCCT ATGACTCGCG GTTGCGCATC ACCGATTCGC TGCGACGCAA ACCGTCGTGG
GCAGAGGTCG AACGCGCCTG GGACGCGCTC AGCGTGGCGC TCCACGCTGT CGGCGAGGGG
CTGGGGCGTC TGGAGACGTT GTTGATCGAC CTTAAGGATG CAGAATTACT GGAGTACGAC
GCCCTGATGC TACGGGTGCA GGCGCTCAGG CGCTACGCCA CCGACGTGCG CGTCAATATC
GGGCATATCC TGACCGGCGG CGCGGAAGAA AAGGTCACCT GGCTGACTCA CGATCGTATG
CGCGATACGC TGACTCTTTC CGCAGCGCCG TTGTCGGTCG CTGACGTGCT GCGCACCAAC
CTGTTCGAAG CGAAGACTGC GACCATCCTG ACCTCGGCGA CGCTGTCGGT CGGCGGCAAC
TTTGCCTTCA TCCGTGAACG CATTGGTCTT GATCACGCCG AAGAGTTGAC GCTGGACTCG
CCGTTCGACT ATACCCGGCA GGCGTTGCTC TACATCCCGC AGGATATTCC CGAACCAAAC
CAGAATGGCT ACCAGCGGGC GCTGGAGCAG GCGATCATCG ACCTGGCGCG CGCGACCGGC
GGTCGAATGC TGGTGCTGTT TACCGCTACA AATGCGCTAC GCCAGACGTA CCGCGCCATC
CAGGAGCCGC TGGAAGATGC AGGGATCGCG GTGCTCGGTC AGGGGATCGA CGGTTCGCGC
CGCAGTCTGC TCGAACGTTT CAAGGAGTTT CCCGGTACCG TGCTGCTCGG CACGTCGAGT
TTCTGGGAAG GGGTTGATGT CGTCGGCGAT GCGCTCTCGG TGCTGGTGAT CGCCAAACTC
CCTTTCAGCG TGCCCACCGA CCCGATCTTT GCAGCGCGGT CGGAACAGTT CGACGATCCC
TTCAACCAGT ATGCAGTGCC ACAGTCGATC CTGCGCTTCA AGCAGGGGTT CGGGCGCCTG
ATCCGTTCCA GAGAGGATCG CGGCGTGGTG GCAGTCCTCG ACCGTCGCCT GCTGACGAAG
AAGTATGGAC AGATGTTCCT TGAGTCGCTG CCGCATACCA CCGTGCGCAG CGGACCGTTG
CAACGCCTGC CCGATCTTGC GAAACGCTTC CTGGCGGTCG GCAACGGTGC GATCAACGGC
GCTCCCGGCG CAACGGCAAC CGCTTCTGAA ATGAAACGCA CGCAGCGATA A
 
Protein sequence
MNQIYIAIDV ETTGLEAGVD EIIEIAAVKF NADEVLETFS TLVQPVHSLP LNSSRLTGIT 
ADMLASAPRF AEVAPRFAAF LKNYPLVGHN VEFDLRMLRA QGMRLPQPAF DTFELATLLM
PRTPAYRLSA LAATLGIHHD EAHRALSDAD VTRQIFLYLL RRIEALPLDD LNEIMRLTSQ
IEWGLRSLFE EAQRAKARRV FVDAIPIQAD RYDDRDEKIV PLKPTGDERP IDLADIRRFF
SPDGALGRAF AGYEQRDQQV RMSEAVADTF NQGGALIAEA GTGTGKGLAY LVPAALHAAR
RGERVVISTN TINLQDQLFF KDIPALQQVM SNGADEAPFT AALLKGRSNY LCLKRYKDLR
RDQRLMSDDV RALLKVQLWL PTTESGDRAE LPLHEGENAS WNRMSAAWEQ CTGPRCSEFQ
RCFFFKARKQ AEAAHLVIVN HALLMADLAV ENDVIPPYDY LIIDEAHNLE DVATDQLSFN
VDREGLLAFL DDIFTEGQAQ VVGGLLSELP NHFRESMATR TDIDRADAIA DTLRPAVARA
RDAVYGCFNV LMTFIKREAE LTAYDSRLRI TDSLRRKPSW AEVERAWDAL SVALHAVGEG
LGRLETLLID LKDAELLEYD ALMLRVQALR RYATDVRVNI GHILTGGAEE KVTWLTHDRM
RDTLTLSAAP LSVADVLRTN LFEAKTATIL TSATLSVGGN FAFIRERIGL DHAEELTLDS
PFDYTRQALL YIPQDIPEPN QNGYQRALEQ AIIDLARATG GRMLVLFTAT NALRQTYRAI
QEPLEDAGIA VLGQGIDGSR RSLLERFKEF PGTVLLGTSS FWEGVDVVGD ALSVLVIAKL
PFSVPTDPIF AARSEQFDDP FNQYAVPQSI LRFKQGFGRL IRSREDRGVV AVLDRRLLTK
KYGQMFLESL PHTTVRSGPL QRLPDLAKRF LAVGNGAING APGATATASE MKRTQR