Gene Rsph17029_3644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3644 
Symbol 
ID4898756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp742286 
End bp744148 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content67% 
IMG OID640114252 
ProductTRAP C4-dicarboxylate transport system permease DctM subunit 
Protein accessionYP_001045506 
Protein GI126464393 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1593] TRAP-type C4-dicarboxylate transport system, large permease component
[COG3090] TRAP-type C4-dicarboxylate transport system, small permease component 
TIGRFAM ID[TIGR00786] TRAP transporter, DctM subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.173742 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAG CAGAAGAATT CGACGTGGGA GACCTCAGCG AGAACGAGTC GGCGCGGCCG 
GGCTCGCGCT TCATGCGGCC GGTCGAGGCG ATCGCGGCCC TGCTTCTGGT GGCGGTGATC
GGCCTGCTGC TGACCGGGGT CGTCTCGCGC TATCTCGTCA ACCTCCCCAT CATCTGGATC
GAGGAAGCGG CCTCGATCTG CTTCATCTGG CTCGCCATGC TGGGCGCGGC GATCTCGCTC
GACCGCAACG AGCATCTGCG GCTCACGCTC TTCGTCCGGA TGCTGCCGCG GCGCGCCGAG
GAATTCGTCG AGACCTTCGC CATGGTCATC GTCGGCACCT TCCTCGTGGC GCTGATGCGC
GACAGCTTCG AATATGCCTA CGAGGAATGG TTCATCACCA CCGCGGCGCT GCATATTCCG
AACACCTACC GCGTCGCCGC GATCCCCGTG GGCATGCTGG CCATGCTGGG GCTGCTCCTC
ATGCATCTCT TCCGCTCGAG CCGGCTCGTC GATGTGGCGA TTTCGGTGGG GGCGGTCGTG
CTGATCTCGC TCGGCCTCTG GGCCCTCTCG CCCCAGTTCA TCGCGATGGG CGCAACCGCG
CTGCCGCTCT TCCTCGTGGT GATCGTGGGC GCCTGCCTCT TTGCCGGCGT GCCCATCGCC
TTCTGCTTCG GGCTCGGCAC GCTGAGCTAT CTGCTCTTCG CCACCCACAT TCCGACCTAT
GTCATGGTCG GGCGCATGGA CGAGGGCATG TCGGGGGTGA TCCTCCTGTC GGTGCCGGTC
TTCGTGCTTC TGGGCTGCGT GCTCGATGCC ACCGGCATGG GCAAGGCCAT CGTGGGCTTC
CTCTCCTCGC TACTCGGCCA TGTGAAGGCG GGCCTGTCGC ATGTGCTCCT CGTGTCGCTC
TTCATCGTGT CGGGGATCTC GGGCTCGAAG GTCTCGGACA TGGCGGCCGT TGCGCCCGCG
CTCTTCCCCG AGATGAAGCG CCGCGGCAAC AAGCCGAACG AGATGGTGGC GCAGCTTGCC
ACCGGCGCCG CGATGGCCGA CACCGTGCCG CCCTCGATCG TGCTGATCGT GCTCGGATCG
GTCGCGGGCA TCTCGATCGC CGCGCTCTTC TCGGCGGGCT TCGTGGTGGC CATCGTGCTC
CTCTTGTCGC TGATGGCGCT GGCGCGCTGG CGGGCGCGGC ACGAGAACAT GGAGGGCGTG
CGCCGCGCCC GCTTCGGCGA CGTGATCCGC CTGCTGGTCT TCGCCGCCCC CGCGCTGGTG
CTGCCCTTCC TGATCCGTGC CGCCGTGGCC GAGGGCGCCG CCACCGCGAC CGAGGTCTCG
ACCATCGCCG TGGTCTATTC GCTGGTGATC GGCGGCCTGA TGTATGGCGG CTTCAGCCTC
CGCGCGCTCT ACCGGATGCT GGTCGAGACC GCGGCCATGA GCGGGGCGAT CCTCCTCATC
CTCGGCACCG CCCTCTCGAT GGCCTGGGCC ATCACCCAGG CGGGCGTGGG CCAGACGCTT
GCGGATTTCG CCATCAGCCT GCCGGGCGGG GCGCCCACCT TCCTCGTCCT CTCCATCGCG
ATCTTCCTCG TGCTGGGCTG CGTGCTCGAG GGGCTGCCCG CGCTCGTGCT GATGTCGCCG
CTCATGTTCC CCATCGCCCA CCAGATGGGG ATCAACGAGG TCCACTACGC CATGGTGATC
GTGGTCGCGA TGAACATCGG CCTCATGACC CCGCCCATCG GCATCGGCTT CTACCTCGCC
TGCCGCATCG GCAATGTCTC GCCCGACAGC GCGATCCACG CGGTCTGGCC CTATGTCGCG
GCCCTCATGG CCGGCCTGAT CGTCATTGCC GCGGTGCCGG CCATCTCCAC GGGCTTCCTC
TGA
 
Protein sequence
MTTAEEFDVG DLSENESARP GSRFMRPVEA IAALLLVAVI GLLLTGVVSR YLVNLPIIWI 
EEAASICFIW LAMLGAAISL DRNEHLRLTL FVRMLPRRAE EFVETFAMVI VGTFLVALMR
DSFEYAYEEW FITTAALHIP NTYRVAAIPV GMLAMLGLLL MHLFRSSRLV DVAISVGAVV
LISLGLWALS PQFIAMGATA LPLFLVVIVG ACLFAGVPIA FCFGLGTLSY LLFATHIPTY
VMVGRMDEGM SGVILLSVPV FVLLGCVLDA TGMGKAIVGF LSSLLGHVKA GLSHVLLVSL
FIVSGISGSK VSDMAAVAPA LFPEMKRRGN KPNEMVAQLA TGAAMADTVP PSIVLIVLGS
VAGISIAALF SAGFVVAIVL LLSLMALARW RARHENMEGV RRARFGDVIR LLVFAAPALV
LPFLIRAAVA EGAATATEVS TIAVVYSLVI GGLMYGGFSL RALYRMLVET AAMSGAILLI
LGTALSMAWA ITQAGVGQTL ADFAISLPGG APTFLVLSIA IFLVLGCVLE GLPALVLMSP
LMFPIAHQMG INEVHYAMVI VVAMNIGLMT PPIGIGFYLA CRIGNVSPDS AIHAVWPYVA
ALMAGLIVIA AVPAISTGFL