Gene Rsph17025_4035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4035 
Symbol 
ID5086208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp68719 
End bp71703 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content68% 
IMG OID640485598 
Productdeoxyuridine 5'-triphosphate nucleotidohydrolase Dut 
Protein accessionYP_001170192 
Protein GI146280035 
COG category 
COG ID 
TIGRFAM ID[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.565458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGAAAGA CAAATCCGGC ATGGTGGGCA GCAGGTTTCT GTCTGTTCGC ACCGCTCGCA 
ACGCCTGCAG CGGCGGGTCA CCTGTTCACG CAATCCTCCT CGAACAAGCC GACCTGGATT
TCCGAGGGCA AGGGGATCCT CATCCAGCAG TATGGCACGG AGAGCGCCTG GAGCTCGGGC
GGCAACATCG AAATGTACTT CGAGCAGCCG CCCACGGTGC CCGAAGGTGC CGACACCAGC
AGTGCGAACT GGTGGCAATG GAGCGTCGGA GACGTGATGA AGATCACGAT CCCGACCGAT
GCCGACACTG TTATCCTCAC CATCGGCTAT GACACGGCCG GAACCGACGG CTGTGCCTAC
AGCTACTGCG GCATCACCGG CTCCAGCTTC TTCTCCTTCG GAGGGATCAC GGCCCTCCAG
AACCTGTCCC TCAAGGACCA CTCCGGCCAG AGGTATGACG GAAACTACAC CGAAAGCGAC
GTCTATTTCC CTTGGAGCAT CCAGTCCCTG GCCGGCGAAT TCTCCCTCGG CGGTTACCGC
ATCTATACTT CCGATGGCAC GATCAACGGC ACGGGCTCGG GTCCGCTCGA CCAGAACAGC
GTGATCGACG AGGACGACAT CGATGTCGGC GGCGGCGGGC CCAAACCCAT CACAGGCGAC
GACAATTACG ACACCGATCT CGGCACCGAT CTTGCTTACG TCTTCGATGG CGGCACGCTG
AACAGTTCGA CCGAAGTGGG CTCCGACTTC ACCCTTACCG GTAATGGCGG CACGATCCGC
GTGGCCGAGG GCGATCAGGC GAGCTTCACC GGTGTTATCG CTGACGACGA TCCGGCGGCC
CCCGGCCGTC TCACCAAGAC GGGCGACGGC CGGCTGGAAC TGACCGGGAC GAACAGCTAC
TCGGGCGGCA CCTCGGTGAC GGGCGGCACG CTGGCGATCG CCTCCGACAC GGCACTCGGC
GCGGCGGAGG GCGATCTCAC GCTCGACGGC GGCACGCTCG AGACGACGGC GGACGTGACC
TCGGGTCGCG ACATCCAGCT CGGCGCGGCG GGCGGCACGC TGGATGTGAC CGCGGGCCAT
GAGACGGTGC TTGCGGGAAC CGTGGCCGAC GCGGCCGACG GCACGCCGGG CGCGCTCACC
AAGACGGGCG ACGGCCGGCT GGAACTGACC GGGACGAACA GCTACTCGGG CGGCACCTCG
GTGACGGGCG GCACGCTGGC GATCGCCTCC GACGCGGCAC TCGGCGCGGC GGAGGGCGAT
CTCACGCTCG ACGGCGGCAC GCTCGAGACG ACGGCGGACG TGACCTCGGG TCGCGACATC
CAGCTCGGCG CGGCGGGCGG CACGCTGGAT GTGACCGCGG GCCATGAGAC GGTGCTTGCG
GGAACCGTGG CCGACGCGGC CGACGGCACG CCGGGCGCGC TCACCAAGAC GGGCGACGGC
CGGCTGGAAC TGACCGGGAC GAACAGCTAC TCGGGCGGCA CCTCGGTGAC GGGCGGCACG
CTGGCGATCG CCTCCGACGC GGCACTCGGC GCGGCGGAGG GCGATCTCAC GCTCGACGGC
GGCACGCTGG AAGCCACGGG CAACATGACG CTCGCGCGCA CCCTCTTGGT GGGGGAGGCG
GGCGGCACAC TCGAGGTGGG CGGCTCGCGC ACCGTGCGGG CCACGGGCAT TCTCGCGGGC
AGCGGCGATC TCGCCAAGAC CGGCAGCGGC AGCTTCCTCT TCTCGGGCAT GGGCCTCCAT
ACCGGCGCCC TCTCGATCCT CGAGGGCACC TTCGGCACGA GCGGCATTCT GTTGGCGGAC
TCGATCTCGG TGGCGTCTGG CGCGCGGCTC GATGCCTCGA ACAGGGTTGC AGCCGACATC
GAGGTGGCGG GAACGCTTGC GGTGAACGAA GCGGTCTCGA CCCTAACGCT CACCCGGAAC
ATGACGCTGC AGGAGAACGC GACGCTCGAG CTTGACATTG ATGGGCGCGC CTTCAGCACC
GAGGGTGGCG CCGGATCCTA CGACCGGATC GACGTCGTGG GGCTGGATGC GGTCTTCTCG
GCCGCGGGAC GGCTGGTGCC GATGCTGCGC AACATCGCGG GCGCGGCCAC CAACAGCTTC
ACCGCGCTGA TCGGCGACAG CTTCCGCATC GTGACCACCG CGAATGAGAA CGGTATCTCC
GGCGCCTTCT CCGAAATCGT GACCCCGGCC GAGGGTCTCG CCGCAAACAG CCGCTTCGCC
GTGGTCTACG GTTCGGACTA TATCGACCTC GTGGTAACAG CTGACAGCTT CGAGCTGCTG
GCAATGCCCT TCGGCAACCG CAACGCCACG GCGACCGGCG CGGCGGTGGA CATGATGCGC
GACGGCGACC CCGGCTTCGA CAGCACCGAG CTCCTCTACG GACTCTATGG ACTCGACACC
GCCCAGACCG CGCGGGCGCT GGCCCAGCTC TCAGGCGAGA TCCACGCCTT CGCCCTCTCC
GACCTGCGCA AGGCTGACCG CGTGGCGGCC GAGCGGCTGA CTGCAGGCGC ACAGGGCCTG
CGTCCCGACG GGCGCAATGC CTGGGTGGAC GTGAGCGGCC TGAGCTTCGA GGCGGACGGC
TCGGCGCAGG CCTCTGGCAA TTCGTCGGAC AGCACCCTCG TGTGGCTCGG GCTCGACCTG
CTGCGCACCG GGAACGCCAC GCTGGGCCTG GCCCTCGGCC AGTCGGAGGG CGACCTCGAC
GCGGGTCTCT CGGGCGATGC CTCTCGCACG ACAGACTCGC TCGCACTCTA CGGCTTCGGC
CTGGCGGGCC GGCTCTCCTA TGACGTGAGC CTGATGGCCA GTCGGTCCGA CATCGAAGGC
GACCGCACCG TGACCCTCGG CACGGGGGTC CAGTCGAACA GCTTCGACGC GGAGATGACC
TCGATCCAGG CATCCGGTCG CATCGGTTAC CGGATGGACT TCCCGGACCA GACTGCGGTC
ATGCCCTGGA TAGGTGCCGA GGTGAACTGG ATGCGCGCGG CTTGA
 
Protein sequence
MRKTNPAWWA AGFCLFAPLA TPAAAGHLFT QSSSNKPTWI SEGKGILIQQ YGTESAWSSG 
GNIEMYFEQP PTVPEGADTS SANWWQWSVG DVMKITIPTD ADTVILTIGY DTAGTDGCAY
SYCGITGSSF FSFGGITALQ NLSLKDHSGQ RYDGNYTESD VYFPWSIQSL AGEFSLGGYR
IYTSDGTING TGSGPLDQNS VIDEDDIDVG GGGPKPITGD DNYDTDLGTD LAYVFDGGTL
NSSTEVGSDF TLTGNGGTIR VAEGDQASFT GVIADDDPAA PGRLTKTGDG RLELTGTNSY
SGGTSVTGGT LAIASDTALG AAEGDLTLDG GTLETTADVT SGRDIQLGAA GGTLDVTAGH
ETVLAGTVAD AADGTPGALT KTGDGRLELT GTNSYSGGTS VTGGTLAIAS DAALGAAEGD
LTLDGGTLET TADVTSGRDI QLGAAGGTLD VTAGHETVLA GTVADAADGT PGALTKTGDG
RLELTGTNSY SGGTSVTGGT LAIASDAALG AAEGDLTLDG GTLEATGNMT LARTLLVGEA
GGTLEVGGSR TVRATGILAG SGDLAKTGSG SFLFSGMGLH TGALSILEGT FGTSGILLAD
SISVASGARL DASNRVAADI EVAGTLAVNE AVSTLTLTRN MTLQENATLE LDIDGRAFST
EGGAGSYDRI DVVGLDAVFS AAGRLVPMLR NIAGAATNSF TALIGDSFRI VTTANENGIS
GAFSEIVTPA EGLAANSRFA VVYGSDYIDL VVTADSFELL AMPFGNRNAT ATGAAVDMMR
DGDPGFDSTE LLYGLYGLDT AQTARALAQL SGEIHAFALS DLRKADRVAA ERLTAGAQGL
RPDGRNAWVD VSGLSFEADG SAQASGNSSD STLVWLGLDL LRTGNATLGL ALGQSEGDLD
AGLSGDASRT TDSLALYGFG LAGRLSYDVS LMASRSDIEG DRTVTLGTGV QSNSFDAEMT
SIQASGRIGY RMDFPDQTAV MPWIGAEVNW MRAA