Gene Rsph17029_3571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3571 
Symbol 
ID4898378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp661683 
End bp662663 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content66% 
IMG OID640114180 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_001045434 
Protein GI126464321 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.303866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCGA AAACAAAGGC CGGTCTCGGC CTGACGGTCG GACTCATGCT CATGGCCGGC 
ACGGCGACCG CGCGGACCTT GACGCTGGGC ACCGTCTACG GAGCCCGCGA CGTCAGCACC
CAGGCCATGG AACATTGGAA CGAGGCGCTG AGCGAGGCCA CGGAGGGGCG GTGGTCGCTG
TCGATCGTGC CGGGCGGCAC CCTCGGCGGA GACCGCGAGA TGCTTCAGCA GCTCTCGACC
GGCGAGATCG ACATCAATCT CTCCTCGCCC GTGGTCATGC AGTATGTTGC GCCGCAATAT
CAGTGCCTTG AGGCGGAATA TATCTACGAT TCCGAAGAGC AGGGCTTCGC TGTGTGGCGC
GGGGACATCG GCAAGGCCGC CTCGCAGGCC ATGAAGGACG CCCATGGCAT CGAGATCGCC
GCCGTCGGCC GCCGGGGCGC GCGCCTCGTG ACGGCGAACA AGCCGATCCT GAAGCCGGAA
GATCTGGCGG GCCTGAAGTT CCGCGTCACC AACAACCTCA GGTCCGAGGT CTTCGCCGCC
TATGGCGCAC AGCCCGCGCC GCTTCCCCTG TCGGAGCTCT ATGGCGCGCT GCGTCAGGGC
GTGTTCGATG CGCAGGAGAA CCCGCTCTCC ACGATCTTCA GCCTGCGCTT CCACGAGGTT
CAGAGCCACA TCAGCGAGAC CAACCACATC TGGACCTACA ATCTGGTGCT GACCAACAGC
GCCCTGATGG ACGAACTGGG CGAGGATCGC GCCGCGTTCG AAAGCACGCT GGCCCAGTCG
CTGGAGTGGC TCTACACGGC CATCGACGAA GAGAATGCCC GGATCCGGGC CGAGATCGAG
GCCTCGGGCT CGGCCGTCTT CGACAAGCCC GACACGCAGG CCTTCCGCGA CGCCGCCCGC
CCCATCCTCG CGGCCTATGC CGAGGAAAGC TGCGCGCCGG GGCTGCTCGA TGCGGTCGAC
GCCGTGGCTG CATCGAACTG A
 
Protein sequence
MNAKTKAGLG LTVGLMLMAG TATARTLTLG TVYGARDVST QAMEHWNEAL SEATEGRWSL 
SIVPGGTLGG DREMLQQLST GEIDINLSSP VVMQYVAPQY QCLEAEYIYD SEEQGFAVWR
GDIGKAASQA MKDAHGIEIA AVGRRGARLV TANKPILKPE DLAGLKFRVT NNLRSEVFAA
YGAQPAPLPL SELYGALRQG VFDAQENPLS TIFSLRFHEV QSHISETNHI WTYNLVLTNS
ALMDELGEDR AAFESTLAQS LEWLYTAIDE ENARIRAEIE ASGSAVFDKP DTQAFRDAAR
PILAAYAEES CAPGLLDAVD AVAASN