Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_4035 |
Symbol | |
ID | 5086208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009430 |
Strand | - |
Start bp | 68719 |
End bp | 71703 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640485598 |
Product | deoxyuridine 5'-triphosphate nucleotidohydrolase Dut |
Protein accession | YP_001170192 |
Protein GI | 146280035 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02601] autotransporter-associated beta strand repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.565458 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGAAAGA CAAATCCGGC ATGGTGGGCA GCAGGTTTCT GTCTGTTCGC ACCGCTCGCA ACGCCTGCAG CGGCGGGTCA CCTGTTCACG CAATCCTCCT CGAACAAGCC GACCTGGATT TCCGAGGGCA AGGGGATCCT CATCCAGCAG TATGGCACGG AGAGCGCCTG GAGCTCGGGC GGCAACATCG AAATGTACTT CGAGCAGCCG CCCACGGTGC CCGAAGGTGC CGACACCAGC AGTGCGAACT GGTGGCAATG GAGCGTCGGA GACGTGATGA AGATCACGAT CCCGACCGAT GCCGACACTG TTATCCTCAC CATCGGCTAT GACACGGCCG GAACCGACGG CTGTGCCTAC AGCTACTGCG GCATCACCGG CTCCAGCTTC TTCTCCTTCG GAGGGATCAC GGCCCTCCAG AACCTGTCCC TCAAGGACCA CTCCGGCCAG AGGTATGACG GAAACTACAC CGAAAGCGAC GTCTATTTCC CTTGGAGCAT CCAGTCCCTG GCCGGCGAAT TCTCCCTCGG CGGTTACCGC ATCTATACTT CCGATGGCAC GATCAACGGC ACGGGCTCGG GTCCGCTCGA CCAGAACAGC GTGATCGACG AGGACGACAT CGATGTCGGC GGCGGCGGGC CCAAACCCAT CACAGGCGAC GACAATTACG ACACCGATCT CGGCACCGAT CTTGCTTACG TCTTCGATGG CGGCACGCTG AACAGTTCGA CCGAAGTGGG CTCCGACTTC ACCCTTACCG GTAATGGCGG CACGATCCGC GTGGCCGAGG GCGATCAGGC GAGCTTCACC GGTGTTATCG CTGACGACGA TCCGGCGGCC CCCGGCCGTC TCACCAAGAC GGGCGACGGC CGGCTGGAAC TGACCGGGAC GAACAGCTAC TCGGGCGGCA CCTCGGTGAC GGGCGGCACG CTGGCGATCG CCTCCGACAC GGCACTCGGC GCGGCGGAGG GCGATCTCAC GCTCGACGGC GGCACGCTCG AGACGACGGC GGACGTGACC TCGGGTCGCG ACATCCAGCT CGGCGCGGCG GGCGGCACGC TGGATGTGAC CGCGGGCCAT GAGACGGTGC TTGCGGGAAC CGTGGCCGAC GCGGCCGACG GCACGCCGGG CGCGCTCACC AAGACGGGCG ACGGCCGGCT GGAACTGACC GGGACGAACA GCTACTCGGG CGGCACCTCG GTGACGGGCG GCACGCTGGC GATCGCCTCC GACGCGGCAC TCGGCGCGGC GGAGGGCGAT CTCACGCTCG ACGGCGGCAC GCTCGAGACG ACGGCGGACG TGACCTCGGG TCGCGACATC CAGCTCGGCG CGGCGGGCGG CACGCTGGAT GTGACCGCGG GCCATGAGAC GGTGCTTGCG GGAACCGTGG CCGACGCGGC CGACGGCACG CCGGGCGCGC TCACCAAGAC GGGCGACGGC CGGCTGGAAC TGACCGGGAC GAACAGCTAC TCGGGCGGCA CCTCGGTGAC GGGCGGCACG CTGGCGATCG CCTCCGACGC GGCACTCGGC GCGGCGGAGG GCGATCTCAC GCTCGACGGC GGCACGCTGG AAGCCACGGG CAACATGACG CTCGCGCGCA CCCTCTTGGT GGGGGAGGCG GGCGGCACAC TCGAGGTGGG CGGCTCGCGC ACCGTGCGGG CCACGGGCAT TCTCGCGGGC AGCGGCGATC TCGCCAAGAC CGGCAGCGGC AGCTTCCTCT TCTCGGGCAT GGGCCTCCAT ACCGGCGCCC TCTCGATCCT CGAGGGCACC TTCGGCACGA GCGGCATTCT GTTGGCGGAC TCGATCTCGG TGGCGTCTGG CGCGCGGCTC GATGCCTCGA ACAGGGTTGC AGCCGACATC GAGGTGGCGG GAACGCTTGC GGTGAACGAA GCGGTCTCGA CCCTAACGCT CACCCGGAAC ATGACGCTGC AGGAGAACGC GACGCTCGAG CTTGACATTG ATGGGCGCGC CTTCAGCACC GAGGGTGGCG CCGGATCCTA CGACCGGATC GACGTCGTGG GGCTGGATGC GGTCTTCTCG GCCGCGGGAC GGCTGGTGCC GATGCTGCGC AACATCGCGG GCGCGGCCAC CAACAGCTTC ACCGCGCTGA TCGGCGACAG CTTCCGCATC GTGACCACCG CGAATGAGAA CGGTATCTCC GGCGCCTTCT CCGAAATCGT GACCCCGGCC GAGGGTCTCG CCGCAAACAG CCGCTTCGCC GTGGTCTACG GTTCGGACTA TATCGACCTC GTGGTAACAG CTGACAGCTT CGAGCTGCTG GCAATGCCCT TCGGCAACCG CAACGCCACG GCGACCGGCG CGGCGGTGGA CATGATGCGC GACGGCGACC CCGGCTTCGA CAGCACCGAG CTCCTCTACG GACTCTATGG ACTCGACACC GCCCAGACCG CGCGGGCGCT GGCCCAGCTC TCAGGCGAGA TCCACGCCTT CGCCCTCTCC GACCTGCGCA AGGCTGACCG CGTGGCGGCC GAGCGGCTGA CTGCAGGCGC ACAGGGCCTG CGTCCCGACG GGCGCAATGC CTGGGTGGAC GTGAGCGGCC TGAGCTTCGA GGCGGACGGC TCGGCGCAGG CCTCTGGCAA TTCGTCGGAC AGCACCCTCG TGTGGCTCGG GCTCGACCTG CTGCGCACCG GGAACGCCAC GCTGGGCCTG GCCCTCGGCC AGTCGGAGGG CGACCTCGAC GCGGGTCTCT CGGGCGATGC CTCTCGCACG ACAGACTCGC TCGCACTCTA CGGCTTCGGC CTGGCGGGCC GGCTCTCCTA TGACGTGAGC CTGATGGCCA GTCGGTCCGA CATCGAAGGC GACCGCACCG TGACCCTCGG CACGGGGGTC CAGTCGAACA GCTTCGACGC GGAGATGACC TCGATCCAGG CATCCGGTCG CATCGGTTAC CGGATGGACT TCCCGGACCA GACTGCGGTC ATGCCCTGGA TAGGTGCCGA GGTGAACTGG ATGCGCGCGG CTTGA
|
Protein sequence | MRKTNPAWWA AGFCLFAPLA TPAAAGHLFT QSSSNKPTWI SEGKGILIQQ YGTESAWSSG GNIEMYFEQP PTVPEGADTS SANWWQWSVG DVMKITIPTD ADTVILTIGY DTAGTDGCAY SYCGITGSSF FSFGGITALQ NLSLKDHSGQ RYDGNYTESD VYFPWSIQSL AGEFSLGGYR IYTSDGTING TGSGPLDQNS VIDEDDIDVG GGGPKPITGD DNYDTDLGTD LAYVFDGGTL NSSTEVGSDF TLTGNGGTIR VAEGDQASFT GVIADDDPAA PGRLTKTGDG RLELTGTNSY SGGTSVTGGT LAIASDTALG AAEGDLTLDG GTLETTADVT SGRDIQLGAA GGTLDVTAGH ETVLAGTVAD AADGTPGALT KTGDGRLELT GTNSYSGGTS VTGGTLAIAS DAALGAAEGD LTLDGGTLET TADVTSGRDI QLGAAGGTLD VTAGHETVLA GTVADAADGT PGALTKTGDG RLELTGTNSY SGGTSVTGGT LAIASDAALG AAEGDLTLDG GTLEATGNMT LARTLLVGEA GGTLEVGGSR TVRATGILAG SGDLAKTGSG SFLFSGMGLH TGALSILEGT FGTSGILLAD SISVASGARL DASNRVAADI EVAGTLAVNE AVSTLTLTRN MTLQENATLE LDIDGRAFST EGGAGSYDRI DVVGLDAVFS AAGRLVPMLR NIAGAATNSF TALIGDSFRI VTTANENGIS GAFSEIVTPA EGLAANSRFA VVYGSDYIDL VVTADSFELL AMPFGNRNAT ATGAAVDMMR DGDPGFDSTE LLYGLYGLDT AQTARALAQL SGEIHAFALS DLRKADRVAA ERLTAGAQGL RPDGRNAWVD VSGLSFEADG SAQASGNSSD STLVWLGLDL LRTGNATLGL ALGQSEGDLD AGLSGDASRT TDSLALYGFG LAGRLSYDVS LMASRSDIEG DRTVTLGTGV QSNSFDAEMT SIQASGRIGY RMDFPDQTAV MPWIGAEVNW MRAA
|
| |