Gene Rsph17025_4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4003 
Symbol 
ID5086178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp32265 
End bp34172 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content63% 
IMG OID640485562 
Producthypothetical protein 
Protein accessionYP_001170162 
Protein GI146280005 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.738232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGATG GTGATGTGGC GCATTACATG ATCCAGCCTG CAATCTGGCC GGAACGCGGA 
ATTTCCACAG AGACGCAGCC TTACTTCAAG TTGTCGGGCC CCGCAGGAAC CTCGCTCACC
AGGGGCGGAT TTACCTTTGC GCCGGGAGGA TCGGTTCTGG TTGATGGTTA CTTCAACCTG
TTCAACCTCG GCAAATGGTG TGATCTGTGT GGACCCGATC CGGTCAGCCT CTGGCTGAGG
GGCGACGGGC GGTTCCAGCT CACGGTCTGG CTGGCCGCCC CGGACTGCTC ATATGAACGC
CTATTCGACG AATGTGTCTT TCTCGACGGT GAGATGAGCC TGCCGCTGGA GGGCGCGCAG
ATCCAAGGCG CTGGCATCCT GTACTTCCAC CTGACAGCGC TCTCCGAGGG ACAGATCGAG
GATTTCGGGT GGAGCACGAC GGTTGCTCCA CGCCATCGTC CTGATCTTGT CCTCTCGGTG
ACGACGTTCA AGCGAGAGGA GGCCGTCACC TCCATGGTGA ACCGATTCCG CCGGTTTAGG
GCCGCCTCCA CCCTGCGCGA CCATCTCCGC ATGCTCGTGG TCGACAACGG CCAGTCGGTG
CCGATCGACC AGGGAGAAGG CGTGACAATC CTTCCCAACG CCAACCTGGG GGGAGCAGGC
GGCTTCTCCC GCGGCCTGCT CGAGGTTCGC AAGGCCGGCG CCACACATTG TCTGTTCATG
GATGATGACG CCTCGATGCA TATGGGCGCG ATCACCCGGG TCTGGATGCT GCTGGCCTAT
GCGCGGGATC CGCGCACGAC CGTCGCCGGG GCCATGATCA ACGCGGATCA CCGCTGGAAG
CTGTGGGAAA ACGGCGCGGT CTTCGACAGG GGCTGCAAGC CGCTCTATTT CGGCTGTGAC
CTGCGGCAAC AGGAGGACGT GTTCAAGATG GAGTTCGAGA CCACGGCGCC CCCTCCGCCG
GGTTTCTATG GCGGCTGGTG GTTCTTCGCC TTTCCGGTGG ACAGGGCGCG GCACATGCCC
TTCCCCTTCT TCGTTCGAGG TGACGACGTC AGCTTCTCTC TGGTCAACGA CCTGCGCATC
GTCACCCTGC CCGGGGTGGC GTCGGTCCAG GAAAGCTTTG TCGACAAGGC ATCGCCGCAG
ACCTGGTATC TCGACATGCG CAGCCACCTT GTCCACCATC TGAGCCTGCC GCAGAAGAGT
GTCAGCTGGG GCGGCCTGCA GCGCATGTTC TTCAGCTTCT ATCTGAGAAC GGTGCTGCGC
TATCACTACG ACAGCCTGTC CGCCGTCAAT CTCGCGATCG AGGACGTCAT GCAGGGCCCA
CGGTTCTTCG CCGAGAACGC CGACATGGCG CAGCGGCGCA AGGATCTGAA GGAGATGACG
AGAACGGAAG TCTGGACACC GGTCGACTTC CCGCCCCGGT ACCGGCACGG CAAGGCGTCG
CGTCCGCTGC GCGCCCTCCT GCTGATGACG CTGAACGGCC ATCTTCTGCC CTTCAGCAAC
CTTTTCGGCA GCAATCTGGT CCTCAAGGCT TGGGCGCGCG AGGACTTCCG TCAGGTCTAC
GGTGCCCGGC GGATCACCTA TGTGAACGCA TCTCGCAACG CCGTCTACAC GGTGCAGCGC
AGTCGCCGCC GCTTCTGGTC CGAAAGCCTG CGCCTAGTGC GCAACAGCCT CAGGCTGCGC
AGGGCCTACG GCCGGTTGCA GGCCGAATGG CAGGACGGCT ATCCGAAGTT GACGTCAGAC
GAGTTCTGGC ACCGCAAGCT GGGCCTTGTC GACGAAGGCA AGGGCGGCGT GCCTGACAGA
TCGATCGAGA TCAAGTCACC TGCGGATCAG AGCACGAGCC CCGGATCCTC TCTCATCTCG
ATGCCGGGAA AGACACTGCG CTCCAGCAGC TCCCGGTCAA ACCCGTAG
 
Protein sequence
MRDGDVAHYM IQPAIWPERG ISTETQPYFK LSGPAGTSLT RGGFTFAPGG SVLVDGYFNL 
FNLGKWCDLC GPDPVSLWLR GDGRFQLTVW LAAPDCSYER LFDECVFLDG EMSLPLEGAQ
IQGAGILYFH LTALSEGQIE DFGWSTTVAP RHRPDLVLSV TTFKREEAVT SMVNRFRRFR
AASTLRDHLR MLVVDNGQSV PIDQGEGVTI LPNANLGGAG GFSRGLLEVR KAGATHCLFM
DDDASMHMGA ITRVWMLLAY ARDPRTTVAG AMINADHRWK LWENGAVFDR GCKPLYFGCD
LRQQEDVFKM EFETTAPPPP GFYGGWWFFA FPVDRARHMP FPFFVRGDDV SFSLVNDLRI
VTLPGVASVQ ESFVDKASPQ TWYLDMRSHL VHHLSLPQKS VSWGGLQRMF FSFYLRTVLR
YHYDSLSAVN LAIEDVMQGP RFFAENADMA QRRKDLKEMT RTEVWTPVDF PPRYRHGKAS
RPLRALLLMT LNGHLLPFSN LFGSNLVLKA WAREDFRQVY GARRITYVNA SRNAVYTVQR
SRRRFWSESL RLVRNSLRLR RAYGRLQAEW QDGYPKLTSD EFWHRKLGLV DEGKGGVPDR
SIEIKSPADQ STSPGSSLIS MPGKTLRSSS SRSNP