Gene Rsph17025_2969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2969 
Symbol 
ID5085172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp3032536 
End bp3034221 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content71% 
IMG OID640484540 
Productprotein of unknown function DUF894, DitE 
Protein accessionYP_001169160 
Protein GI146279001 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.207712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCAGCCG CGCCCGCGCG CCCTACCCAT TGGGCTGCCG GCCCGACAGA AGGTTGTCCC 
TTGCCCGACC GTGTCTCACC CCTCGCGCCC TTCCGCCTTC CGACCTACCG CAACCTGTGG
ATGGCGAGCA TCGTCTCGAA CTTCGGCGGG CTGGTGCAGG CCGTGGGCGC AGGCTGGATG
ATGACGGAAC TCACCCATTC CGCGGGAATG GTGGCGCTGG TGCAGGCCTC GACCACGCTG
CCGATCATGC TCTTCTCGCT GCCGTCGGGC GCCCTGGCCG ACAGCCTGAA CCGGCGGCGC
CTCATGCTGA CGGCGCAGCT GTTCATGCTG GCCGTCTCGG CGGCGCTGGC ATTTGCGGCC
TTTGCCGGGC TGCTCACGCC CTGGCTGCTG CTGACCTTCA CCTTCCTGAT CGGCGTGGGG
GTGGCGCTGC ACAATCCCTC ATGGCAGGCC TCGGTCGGAG ATATCGTCCC GCGTGCGGAC
CTGCCGTCGG CGGTCGCGCT CAACAGCATG GGCTTCAACC TGATGCGCAG CGTCGGGCCG
GCGCTGGGCG GGCTGATCGT CGCCGCCGGC GGCGCGGCCG CAGCCTTTGC AATCAATGCG
GCGAGCTACC TGCCGCTGGT CCTCGCGCTG TTCTTCTGGC GGCCGGACTA TGCGCCGCGC
CGCCTTCCGC GCGAGGCGCT GGGATCGGCC GTCGCGGCGG GTCTGCGCTA TGTCTCGATG
TCGCCGGTGC TGCTGAAGGT GCTGTTCCGG GGCTTTCTCT TCGGACTGGC CGCGGTCAGC
CTGCTTGCGC TTCTGCCCAT CGTGGCCCGC GACCTGGTGG CCGGGGGCGC CTTCACCTAC
GGGGCGCTGC TGGGCTGCTT CGGGGTGGGG GCGATCGGCG GGGCCTTTGC CGGTGCCCGC
CTGCGCGAAC GGTTCCAGAA CGAGACCATC GTGCGCGCGG GCTTCCTGCT CTTTGCGGCG
GCGCTGGTGG GGCTAGGCCT TTCGCGCGAC CTCTGGCTGT CGGGGCTGAT GCTGTTGCCG
GCCGGTGCGG CCTGGGTGCT GGCGCTGTCG CTCTTCAATG TGAGCGTGCA GTTGGCCACT
CCGCGGTGGG TCGTCGGGCG GGCCCTTGCG CTCTATCAGA CGGCGACCTT CGGCGGGATG
GCAGCGGGCA GCTTCCTCTG GGGTCAGGCC GCCGAGGCGG GCGGCGTGGC CCAGGCGCTG
TTCGGCGCCG CGGCCGTGCT GGTGGTGGGG GCGATCGTTG GCCTGCGGAT GCCGCTGCCC
GCCTTCGGGA TGGAGGATCT CGATCCGCTC GGCCGCTTTG TCGAGCCGAA GCTGCCGGTC
GATCTGAGGA CGCGGTCGGG GCCGATCATG GTGACGGTGG AATATGATGT CGCGCCGGAG
AATGTGGAGC CGTTCCTCGC CGCGATGGCC GACCGGCGCC GGATCCGCAT CCGGGACGGG
GCCCGGCAGT GGGTGCTGCT GCGGGATCTG GAACGGCCCG GGATCTGGGC CGAAAGCTAT
CATGTGGCCA CCTGGGCCGA GTATCTGCGC CACCACGAGC GGCGGACGAA GGCGGATGCC
GAGGTGACCG ACCGGCTGCT GGCGCTGCAT GAGGGGCCGG GCAAGCCGCG CGTCCGCCGG
ATGATCGAGC GCCAGACAGT GCCGCTGCAC GACGATCTGC CGCTGAAGCC CGAGGAGCTG
ACCTGA
 
Protein sequence
MPAAPARPTH WAAGPTEGCP LPDRVSPLAP FRLPTYRNLW MASIVSNFGG LVQAVGAGWM 
MTELTHSAGM VALVQASTTL PIMLFSLPSG ALADSLNRRR LMLTAQLFML AVSAALAFAA
FAGLLTPWLL LTFTFLIGVG VALHNPSWQA SVGDIVPRAD LPSAVALNSM GFNLMRSVGP
ALGGLIVAAG GAAAAFAINA ASYLPLVLAL FFWRPDYAPR RLPREALGSA VAAGLRYVSM
SPVLLKVLFR GFLFGLAAVS LLALLPIVAR DLVAGGAFTY GALLGCFGVG AIGGAFAGAR
LRERFQNETI VRAGFLLFAA ALVGLGLSRD LWLSGLMLLP AGAAWVLALS LFNVSVQLAT
PRWVVGRALA LYQTATFGGM AAGSFLWGQA AEAGGVAQAL FGAAAVLVVG AIVGLRMPLP
AFGMEDLDPL GRFVEPKLPV DLRTRSGPIM VTVEYDVAPE NVEPFLAAMA DRRRIRIRDG
ARQWVLLRDL ERPGIWAESY HVATWAEYLR HHERRTKADA EVTDRLLALH EGPGKPRVRR
MIERQTVPLH DDLPLKPEEL T