Gene Rsph17025_3867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3867 
Symbol 
ID5085415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp765892 
End bp767031 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content66% 
IMG OID640485426 
Producthypothetical protein 
Protein accessionYP_001170027 
Protein GI146279869 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0795964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCTA TCGAGAAGAA GGCGGCCTAT TCCCGCCGCT CGTTCCTGCG CACCGGCGCC 
CTGGCCGGAG GCGCCGCTGC GGGATCCATG CTCGCCGCGC CGGCGGTGCT GGCGCAGTCT
CCGATCGTGA TGAAGATGCA GACATCCTGG CCCGCTTCGG ACATCTGGAT GGATTTCGCC
CGCGAATATG TCACGCGGGT GGAAGAGATG TCGGGCGGTC GGCTCAGGGT GGACCTGCTG
CCGGCCGGGG CCGTGGTCGG CGCCTTTCAG GTGATGGACG CCGTGCATGA CGGGGTGATC
GACGCATCGC ACTCCGTTTC GGCCTACTGG TATGGCAAGT CCAAGGCGGC CTCGTTCTTC
GGCACCGGCC CCGTCTTCGG CGGCTCGGCC ACGACCATGC TCGGCTGGTT CTACCAGGGC
GGCGGGCAGG ATCTTTACCG CGAGCTGACG CAGGACATCC TGGGCATGAA CATCGTGGGC
TTCTACGGCT TTCCGATGCC CGCCCAGCCG TTTGGCTGGT TCAAGTCCGA GGTCAACGGC
GTGGCCGACA TCCAGGGATT CAAGTATCGC ACCGTCGGCC TTGCCGCCGA CCTTCTGCAG
GCGATGGGCA TGTCGGTGGC CCAGCTTCCG GGCGGCGAGA TCGTGCCCGC GATGGAACGC
GGCGTGATCG ACGCCTTCGA GTTCAACAAC CCCTCGTCCG ACCGCCGCTT CGGCGCGCAG
GACGTGGCGA AGAACTACTA TCTCTCCTCG TACCACCAGG CGTCCGAGAG CTTTGAGTAC
ACCTTCAACC GCGACTTCTA CGAGGATCTG GAGCCCGACC TGCAGGCGAT CCTGAAATAT
GCGGTCGAGG CGGCCTCGAC CTCGAACACC GCGCTGGCGC TGCGCCAGTA CTCGGCCGAC
CTCGCGGCGC TTGCCTCGGA GAACGGGGTG CAGGTGCGCC GGACACCGAC CGAGATCCTC
TCGGGCCAGC TCGAGGCCTG GGACAGGCTG ATCACCGATC TGGAGGGCGA TGCCTTCTTC
AAGAAGGTCC TCGACAGCCA GCGCGCCTGG GTCGAGCAGG TCAGCTATTA CGAGCTGATG
AACGAGGCCG ACCTGCGGCT GGCCTACGAG CACTACTTCC CCGGCAAGCT CCAGCTCTGA
 
Protein sequence
MTAIEKKAAY SRRSFLRTGA LAGGAAAGSM LAAPAVLAQS PIVMKMQTSW PASDIWMDFA 
REYVTRVEEM SGGRLRVDLL PAGAVVGAFQ VMDAVHDGVI DASHSVSAYW YGKSKAASFF
GTGPVFGGSA TTMLGWFYQG GGQDLYRELT QDILGMNIVG FYGFPMPAQP FGWFKSEVNG
VADIQGFKYR TVGLAADLLQ AMGMSVAQLP GGEIVPAMER GVIDAFEFNN PSSDRRFGAQ
DVAKNYYLSS YHQASESFEY TFNRDFYEDL EPDLQAILKY AVEAASTSNT ALALRQYSAD
LAALASENGV QVRRTPTEIL SGQLEAWDRL ITDLEGDAFF KKVLDSQRAW VEQVSYYELM
NEADLRLAYE HYFPGKLQL