Gene Rsph17029_1557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1557 
Symbol 
ID4896490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1637646 
End bp1638656 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content67% 
IMG OID640112147 
Productextracellular solute-binding protein 
Protein accessionYP_001043439 
Protein GI126462325 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.456805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCACG CTGCTCTTCC GCTGGTAGCC CTTGCCCTTG GCACCACCGC CCTGCCCGCT 
CTGGCCGACG AGGTGAACAT CTATTCGCAC CGTCAGCCCG AGCTGATCCA GCCGCTGGTG
GATGCCTTCA CCGCCGAGAC CGGCATCGAC GTCAATGTGG CCTTCGTCGA CAAGGGCATG
GCGGAACGGC TCGTGGCCGA AGGCAACCGC TCGCCGGCCG ATCTGGTGCT GACGGTCGAT
ATCGCGCGGC TGATGCAGGT CGTCGAGGCG GGCGTCACGC AGCCGGTCGA GTCCGACGTG
CTCTCCTCGA ACATCCCGGC CGAGTTCCGC GATCCGGCGG GCCACTGGTT CGGACTGACC
AGCCGGGCCC GCATCGTCTA TGCCTCGAAG GAGCGGGTGA AGGACGGCGA GGTCACGACC
TACGAGGATC TCGCCTCGGA CAAGTGGAAG GGCCGGATCT GCACCCGCTC CTTCACCAGC
GACTACAACG TGGCGCTGAC CGGCGCCGTT ATCGCGCATC ACGGCACCGA GGGCGCGAAG
ACCTGGCTCG AAGGGGTGAA GGCGAACCTC GCCCGCAAGC CCGAAGGCAA CGACCGCGAT
CAGGTGAAGT CGATCTGGGC CGGCGAATGC GACATCAGCC TCGGCAACAC CTACTACATG
GGGCAGATGC TGGCCGATCC CGAGCAGAAA GAATGGGCGG ACTCGGTCCG CATCGTCTTC
CCGACCTTCG AGGGCGGCGG CACCCACATG AACATCTCGG GCGTCGCCAT GACGAAGGCC
GCGCCGAACC GCGAGGCCGC GCTGAAGCTG ATGGAGTGGC TTGCCTCCGA CGAGGCGCAG
CGGATCTATG CCGAGACGAA CCACGAGTTC CCGGTCGAGC CCGGTGTCGC GCGCTCGGAG
CTGGTGCAGA GCTGGGGCGA GTTCACGCCC GACGCGGTCA GCCTCGCCGA GGTGGCCTCG
CATCGCGGCG AGGCGCTGAA GCTGATCGAG ACCGTGGATT TCGACGGCTG A
 
Protein sequence
MRHAALPLVA LALGTTALPA LADEVNIYSH RQPELIQPLV DAFTAETGID VNVAFVDKGM 
AERLVAEGNR SPADLVLTVD IARLMQVVEA GVTQPVESDV LSSNIPAEFR DPAGHWFGLT
SRARIVYASK ERVKDGEVTT YEDLASDKWK GRICTRSFTS DYNVALTGAV IAHHGTEGAK
TWLEGVKANL ARKPEGNDRD QVKSIWAGEC DISLGNTYYM GQMLADPEQK EWADSVRIVF
PTFEGGGTHM NISGVAMTKA APNREAALKL MEWLASDEAQ RIYAETNHEF PVEPGVARSE
LVQSWGEFTP DAVSLAEVAS HRGEALKLIE TVDFDG