Gene Rsph17029_3018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3018 
Symbol 
ID4898651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp26408 
End bp27547 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content65% 
IMG OID640113620 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_001044890 
Protein GI126463777 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCTT TCGAGAAGAA AGCGGCGTAT TCGCGCCGCT CGTTCCTGCG CACCGGTGCC 
TTGGCCGGGG GTGCGGCGGC GGGGTCTGTC CTGGCGGCTC CGGCCGTTCT GGCCCAAGCG
CCGCTGGTGA TGAAGATGCA GACATCCTGG CCCGCCTCGG ACATCTGGAT GGACTTCGCC
CGCGAATATG TCACGCGGGT CGAGGAGATG TCGGGCGGCA GGCTCAAGGT GGACCTGCTG
CCGGCCGGAG CCGTGGTCGG CGCCTTCCAG GTGATGGATG CCGTGCATGA CGGCGTGATC
GACGCCTCGC ATTCCGTGTC GGCCTACTGG TACGGCAAGT CGAAGGCGGC CTCGTTCTTC
GGCACGGGCC CGGTCTTCGG CGGTTCGGCG ACCACGATGC TCGGCTGGTT CTATCAGGGC
GGCGGTCAGG ATCTCTACCG CGAGCTGACC CAGGACATTC TCGGAATGAA CATCGTGGGC
TTCTACGGCT TCCCGATGCC GGCCCAGCCC TTCGGCTGGT TCAAGACCGA GGTGAACGGC
GTCGCCGACA TCCAGGGCTT CAAATACCGG ACCGTGGGGC TGGCGGCCGA CCTGCTGCAG
GCGATGGGCA TGTCGGTGGC GCAGCTGCCC GGCGGCGAGA TCGTGCCGGC GATGGAGCGG
GGCGTGATCG ACGCGTTCGA GTTCAACAAC CCCTCGTCGG ACATGCGCTT CGGCGCGCAG
GACGTGGCGA AGAACTACTA TCTCTCCTCC TACCATCAGG CGTCCGAGAG CTTCGAATAT
ACGTTCAACC GCGATTTCTA CGAGGATCTG GATCCCGACC TGCAGGCGAT CCTGAAATAT
GCAGTGGAGG CGGCCTCGAC CTCGAACACG GCGCTGGCGC TGCGCCAGTA TTCGGCCGAT
CTCGCGACGC TCGCGGCCGA AAACGGGGTC GCGGTGCATC GGACCCCGAA GGATATCCTT
TCGGGCCAGC TCGAGGCCTG GGACAAGCTG ATCGTGGATC TCGAGGCCGA CGAGTTCTTC
AAGAAGGTCC TCGACAGCCA GCGCGCCTGG GTGGAGCAGG TGAGCTATTA CGAGCTGATG
AACGCGGCCG ACCTCGGGCT GGCCTACGAA CATCACTTCC CCGGCAAGCT CAAGCTCTGA
 
Protein sequence
MTAFEKKAAY SRRSFLRTGA LAGGAAAGSV LAAPAVLAQA PLVMKMQTSW PASDIWMDFA 
REYVTRVEEM SGGRLKVDLL PAGAVVGAFQ VMDAVHDGVI DASHSVSAYW YGKSKAASFF
GTGPVFGGSA TTMLGWFYQG GGQDLYRELT QDILGMNIVG FYGFPMPAQP FGWFKTEVNG
VADIQGFKYR TVGLAADLLQ AMGMSVAQLP GGEIVPAMER GVIDAFEFNN PSSDMRFGAQ
DVAKNYYLSS YHQASESFEY TFNRDFYEDL DPDLQAILKY AVEAASTSNT ALALRQYSAD
LATLAAENGV AVHRTPKDIL SGQLEAWDKL IVDLEADEFF KKVLDSQRAW VEQVSYYELM
NAADLGLAYE HHFPGKLKL