Gene Rsph17029_2604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2604 
Symbol 
ID4897137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2743426 
End bp2745051 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content71% 
IMG OID640113204 
Productprotein of unknown function DUF894, DitE 
Protein accessionYP_001044478 
Protein GI126463364 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCGACC GTGTCTCTCC CCTCGCGCCC TTCCGCCTGC CCACCTACCG CAACCTCTGG 
ATGGCGAGCA TCGTCTCGAA CTTCGGCGGG CTGGTGCAGG CGGTGGGCGC AGGCTGGATG
ATGACCGAGC TCACCCATTC GGCCGGGATG GTGGCGCTGG TGCAGGCCTC GACGACCCTG
CCGATCATGC TCTTCTCGCT GCCCTCGGGC GCGCTGGCCG ACAGCCTCAA CCGGCGGCGC
CTGATGCTGA CGGCGCAGCT CTTCATGCTG ACGGCCTCGG CGGCGCTGGC GCTGGCGGCC
TTCGCGGATC TGCTGACGCC GTGGCTCTTG CTGACCTTCA CCTTCCTGAT CGGCGCGGGC
GTGGCGCTGC ACAACCCTTC GTGGCAGGCG TCTGTCGGCG ACATCGTGCC CCGCAAGGAC
CTGCCGTCGG CGGTCGCGCT GAACAGCATG GGCTTCAACC TGATGCGGAG CGTGGGCCCC
GCGCTGGGCG GGATCATCGT GGCGGCGGGC GGCGCGGCGG CGGCCTTCGC CATCAATGCG
GTGAGCTATC TGCCGCTCGT CCTCACGCTC TTTCTCTGGC GGCCCGATTA TGCACCGCGG
CGGCTGCCGC GCGAGACGCT GGGATCGGCG GTGGCCGCGG GCCTGCGCTA TGTCTCGATG
TCGCCCATCC TGCTCAAGGT GCTCTTCCGC GGCTTCCTCT TCGGCCTTGC GGCAGTGAGC
CTGCTTGCGC TGCTGCCCGT CGTCGCGCGC GACCTCGTGG GCGGGGGCGC CTTCACCTAC
GGCGTGCTTC TGGGCTGTTT CGGGGTGGGC GCCATCGGCG GCGCCTTCGC CGGCGCGCGC
CTGCGCGAGC GGTTCCAGAA CGAGACCATC GTGCGGGCGG GCTTCGTGAT CTTCGCGCTG
GCGCTGACCG GACTCGGCCT GTCGCGCGCG CTGTGGCTGT CGGGGCTCAT GCTCCTGCCG
GCGGGGGCGG CCTGGGTGCT GGCCTTGTCG CTGTTCAACG TGAGCGTCCA GCTTGCCACG
CCGCGCTGGG TCGTGGGGCG GGCGCTGGCC CTCTATCAGA CCGCCACTTT CGGCGGGATG
GCGGCGGGCA GCTTCCTCTG GGGGCAGGCG GCCGAGGCGG GCGGCGTGGA CGGGGCCCTG
TTCGGGGCGG CCGGTGTCCT CGTCGTGGGG GCGCTGGTGG GGCTGCGGAT GCCGCTGCCG
GCCTTCGGGA TGCAGGATCT CGATCCGCTG GGGCGGTTCG TGGAGCCGAG CCTGCCCGTG
GATCTGCGCA TGCGCTCGGG TCCGATCATG GTGACGGTCG AATATGACGT GGTGCCCGAG
AATGTGGACG AGTTTCTCGC CGCCATGGCG GACCGGCGCC GGATCCGGAT CCGGGACGGG
GCGCGGCAGT GGGTGCTGTT GCGCGATCTG GAGCGGCCGG GGATCTGGGC CGAGAGCTAC
CATGTCGCGA CCTGGGCGGA ATATCTGCGC CACCACGAGC GGCGGACGAA GGCGGATGCC
GAGGTGACGG ACCGGCTGCT CGCGCTGCAC GCAGGGCCGG GCAAGCCGCG GGTGCGCCGG
ATGATCGAGC GCCAGACGGT GCCCTTGCAC GACGATCTGC CGCTGAAGCC CGAGGATCTG
ACCTGA
 
Protein sequence
MPDRVSPLAP FRLPTYRNLW MASIVSNFGG LVQAVGAGWM MTELTHSAGM VALVQASTTL 
PIMLFSLPSG ALADSLNRRR LMLTAQLFML TASAALALAA FADLLTPWLL LTFTFLIGAG
VALHNPSWQA SVGDIVPRKD LPSAVALNSM GFNLMRSVGP ALGGIIVAAG GAAAAFAINA
VSYLPLVLTL FLWRPDYAPR RLPRETLGSA VAAGLRYVSM SPILLKVLFR GFLFGLAAVS
LLALLPVVAR DLVGGGAFTY GVLLGCFGVG AIGGAFAGAR LRERFQNETI VRAGFVIFAL
ALTGLGLSRA LWLSGLMLLP AGAAWVLALS LFNVSVQLAT PRWVVGRALA LYQTATFGGM
AAGSFLWGQA AEAGGVDGAL FGAAGVLVVG ALVGLRMPLP AFGMQDLDPL GRFVEPSLPV
DLRMRSGPIM VTVEYDVVPE NVDEFLAAMA DRRRIRIRDG ARQWVLLRDL ERPGIWAESY
HVATWAEYLR HHERRTKADA EVTDRLLALH AGPGKPRVRR MIERQTVPLH DDLPLKPEDL
T