Gene Rsph17029_3620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3620 
Symbol 
ID4898664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp712331 
End bp713422 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content65% 
IMG OID640114228 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_001045482 
Protein GI126464369 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.39212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTTCCC GCCCCCGTGA CGGGGACGGG CTACAGCTCA AAAGAGCATT CAGAAAGGGA 
GGAGACCTGA TGAATTACCT GACTTCCACC GCCGTGGCGC TGATCGCCGC GCTTACCGCC
GGCTCGGCCG CGATGGCGCA GGAACACCAT TTCCGCTTCC AGTCCTCGGA CCCGGCGGGC
AACCCGAACT TCGAGCTGCA GCATGTCTTC GCCGACAAGG TGAAGGAGCT GACCAACGGT
GAGGTCACGA TCGAGCTCAT GCCGGTCGGC ACCATCGTCG ACTACAAGGA GACGCCCGAC
GCGATCCAGG CCGGGCTGAT CGACGGCCAT ATCACCGACA CCTCCTATTT CGCCGGCCGT
GACCCGGCCT TCGGCCTGAT CGCGAACCCG GTCGGCGCCT GGGCGGACCC CGCGCAGATG
ATCGACTTCG TCGAGAACGG CGGCGGCAAG GAGCTGATGA ACGAGCTCAT CAATCCCTAC
GGGCTCCAGT TCATCGGCGT CTCGACCCCG GGCCTCGAGG CTTTCGTCTC GAAGGTGCCG
CTCGACACGG TGGAGGATCT GAAGGGCGTG AAGGTCCGCT CGCCGGAGGG GCTGATCGCC
AACGTCTTCG CCGCCGCGGG CGCGAACCCG GTCAACCTGC CCTCGTCCGA GGTCTATACC
TCGCTCGACA AGGGCGTGAT CGACGCGGCC GACTATTCGG TCTTTTCGGT GAACCAGGAC
ACCGGGATGA ACGATATCGC GCCGCATCCG GTCTATCCGG GCTTCCACTC GCTGCCGCTC
GTCGAAGTGT CGATGAACAA GCAGAAGTGG GACGCGCTGA CGCCCGAGCT GCAGGCCAAG
ATCACCGAGG CGCAGAAGAT CTTCCAGCAG ACCCAGATCG ACACGCTGCA CCAGCGCGAT
CTCGAGGCCG TCGAGGCCGC CAAGGCCGGC GGCAAGATCA CGGTCCACGA CTGGTCGGAC
GAGGAACGCG CCAAGTTCAG GGGCATCGCC CGCGGCGAAT GGGAGAAGGT CGCCGGCCAG
TCCGAGATGG CGCAGAAGGT CTATGACACG CTCGTGACCT ATCTGAAGGA CAAGGGCCTG
ATGGCCGAGT GA
 
Protein sequence
MPSRPRDGDG LQLKRAFRKG GDLMNYLTST AVALIAALTA GSAAMAQEHH FRFQSSDPAG 
NPNFELQHVF ADKVKELTNG EVTIELMPVG TIVDYKETPD AIQAGLIDGH ITDTSYFAGR
DPAFGLIANP VGAWADPAQM IDFVENGGGK ELMNELINPY GLQFIGVSTP GLEAFVSKVP
LDTVEDLKGV KVRSPEGLIA NVFAAAGANP VNLPSSEVYT SLDKGVIDAA DYSVFSVNQD
TGMNDIAPHP VYPGFHSLPL VEVSMNKQKW DALTPELQAK ITEAQKIFQQ TQIDTLHQRD
LEAVEAAKAG GKITVHDWSD EERAKFRGIA RGEWEKVAGQ SEMAQKVYDT LVTYLKDKGL
MAE