Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_4003 |
Symbol | |
ID | 5086178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009430 |
Strand | + |
Start bp | 32265 |
End bp | 34172 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640485562 |
Product | hypothetical protein |
Protein accession | YP_001170162 |
Protein GI | 146280005 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.738232 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGATG GTGATGTGGC GCATTACATG ATCCAGCCTG CAATCTGGCC GGAACGCGGA ATTTCCACAG AGACGCAGCC TTACTTCAAG TTGTCGGGCC CCGCAGGAAC CTCGCTCACC AGGGGCGGAT TTACCTTTGC GCCGGGAGGA TCGGTTCTGG TTGATGGTTA CTTCAACCTG TTCAACCTCG GCAAATGGTG TGATCTGTGT GGACCCGATC CGGTCAGCCT CTGGCTGAGG GGCGACGGGC GGTTCCAGCT CACGGTCTGG CTGGCCGCCC CGGACTGCTC ATATGAACGC CTATTCGACG AATGTGTCTT TCTCGACGGT GAGATGAGCC TGCCGCTGGA GGGCGCGCAG ATCCAAGGCG CTGGCATCCT GTACTTCCAC CTGACAGCGC TCTCCGAGGG ACAGATCGAG GATTTCGGGT GGAGCACGAC GGTTGCTCCA CGCCATCGTC CTGATCTTGT CCTCTCGGTG ACGACGTTCA AGCGAGAGGA GGCCGTCACC TCCATGGTGA ACCGATTCCG CCGGTTTAGG GCCGCCTCCA CCCTGCGCGA CCATCTCCGC ATGCTCGTGG TCGACAACGG CCAGTCGGTG CCGATCGACC AGGGAGAAGG CGTGACAATC CTTCCCAACG CCAACCTGGG GGGAGCAGGC GGCTTCTCCC GCGGCCTGCT CGAGGTTCGC AAGGCCGGCG CCACACATTG TCTGTTCATG GATGATGACG CCTCGATGCA TATGGGCGCG ATCACCCGGG TCTGGATGCT GCTGGCCTAT GCGCGGGATC CGCGCACGAC CGTCGCCGGG GCCATGATCA ACGCGGATCA CCGCTGGAAG CTGTGGGAAA ACGGCGCGGT CTTCGACAGG GGCTGCAAGC CGCTCTATTT CGGCTGTGAC CTGCGGCAAC AGGAGGACGT GTTCAAGATG GAGTTCGAGA CCACGGCGCC CCCTCCGCCG GGTTTCTATG GCGGCTGGTG GTTCTTCGCC TTTCCGGTGG ACAGGGCGCG GCACATGCCC TTCCCCTTCT TCGTTCGAGG TGACGACGTC AGCTTCTCTC TGGTCAACGA CCTGCGCATC GTCACCCTGC CCGGGGTGGC GTCGGTCCAG GAAAGCTTTG TCGACAAGGC ATCGCCGCAG ACCTGGTATC TCGACATGCG CAGCCACCTT GTCCACCATC TGAGCCTGCC GCAGAAGAGT GTCAGCTGGG GCGGCCTGCA GCGCATGTTC TTCAGCTTCT ATCTGAGAAC GGTGCTGCGC TATCACTACG ACAGCCTGTC CGCCGTCAAT CTCGCGATCG AGGACGTCAT GCAGGGCCCA CGGTTCTTCG CCGAGAACGC CGACATGGCG CAGCGGCGCA AGGATCTGAA GGAGATGACG AGAACGGAAG TCTGGACACC GGTCGACTTC CCGCCCCGGT ACCGGCACGG CAAGGCGTCG CGTCCGCTGC GCGCCCTCCT GCTGATGACG CTGAACGGCC ATCTTCTGCC CTTCAGCAAC CTTTTCGGCA GCAATCTGGT CCTCAAGGCT TGGGCGCGCG AGGACTTCCG TCAGGTCTAC GGTGCCCGGC GGATCACCTA TGTGAACGCA TCTCGCAACG CCGTCTACAC GGTGCAGCGC AGTCGCCGCC GCTTCTGGTC CGAAAGCCTG CGCCTAGTGC GCAACAGCCT CAGGCTGCGC AGGGCCTACG GCCGGTTGCA GGCCGAATGG CAGGACGGCT ATCCGAAGTT GACGTCAGAC GAGTTCTGGC ACCGCAAGCT GGGCCTTGTC GACGAAGGCA AGGGCGGCGT GCCTGACAGA TCGATCGAGA TCAAGTCACC TGCGGATCAG AGCACGAGCC CCGGATCCTC TCTCATCTCG ATGCCGGGAA AGACACTGCG CTCCAGCAGC TCCCGGTCAA ACCCGTAG
|
Protein sequence | MRDGDVAHYM IQPAIWPERG ISTETQPYFK LSGPAGTSLT RGGFTFAPGG SVLVDGYFNL FNLGKWCDLC GPDPVSLWLR GDGRFQLTVW LAAPDCSYER LFDECVFLDG EMSLPLEGAQ IQGAGILYFH LTALSEGQIE DFGWSTTVAP RHRPDLVLSV TTFKREEAVT SMVNRFRRFR AASTLRDHLR MLVVDNGQSV PIDQGEGVTI LPNANLGGAG GFSRGLLEVR KAGATHCLFM DDDASMHMGA ITRVWMLLAY ARDPRTTVAG AMINADHRWK LWENGAVFDR GCKPLYFGCD LRQQEDVFKM EFETTAPPPP GFYGGWWFFA FPVDRARHMP FPFFVRGDDV SFSLVNDLRI VTLPGVASVQ ESFVDKASPQ TWYLDMRSHL VHHLSLPQKS VSWGGLQRMF FSFYLRTVLR YHYDSLSAVN LAIEDVMQGP RFFAENADMA QRRKDLKEMT RTEVWTPVDF PPRYRHGKAS RPLRALLLMT LNGHLLPFSN LFGSNLVLKA WAREDFRQVY GARRITYVNA SRNAVYTVQR SRRRFWSESL RLVRNSLRLR RAYGRLQAEW QDGYPKLTSD EFWHRKLGLV DEGKGGVPDR SIEIKSPADQ STSPGSSLIS MPGKTLRSSS SRSNP
|
| |