Gene Rsph17029_1990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1990 
Symbol 
ID4895142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2109928 
End bp2111028 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content68% 
IMG OID640112584 
Productextracellular solute-binding protein 
Protein accessionYP_001043866 
Protein GI126462752 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGA AACTTGCACT GGCCGCCTCG GTCAGCCTCG CCGCGATGGG CGCGGCCTCG 
GGCGCCTTCG CCCAGAGCGC CGAGCTCGTC GAGGCCGCCA AGGCCGAAGG GATGCTGACC
ACCATCGCCC TGCCCCACAA CTGGTGCGGC TACGGCGACG TGATCGCGGG CTTCAAGGCA
AAATATCCCG AAATCACCGT GAACGAGCTG AACCCCGACG CGGGCTCGGC CGACGAGATC
GAGGCGATCC GGGCGAACAA GGACAACAAG GGCCCGCAGG CGCCCGACGT GATCGACGTG
GGCCTCGCCT TCGGGCCGCA GGCCAAGGAC GAGGGCCTGA TCGCCCCCTA CAAGGTCGAG
ACCTGGGACG AGATCCCCGC CGAGATCAAG GATGCCGACG GCTACTGGTA TGGCGACTAC
TACGGCGTGA TGTCCTTCGG GGTGAACACC GATCTCGTGC AGGAAGTGCC GAAGAGCTGG
GAGGCGCTGC TCGACAGCCA GTATGCCAAT GCCTTCGCGC TGGCGGGCGA CCCGCGCGCC
TCGAACCAGG CGATCCTGGC CGTGATGTCG GCCGGCATCG CCGACGGCAC TGAGCCCGGC
GAGGCCTCGG GCAAGAAGGG GCTCGAGTTC TTCGGCAAGC TGAACAAGGC CGGCGGCTTC
GTGCCGGTGA TCGGCAAGGC CGGCACCATC GCCCAGGGTC AGACCCCGAT CGTCGCCGCT
TGGGACTACA ACCTGCTGTC CTGGCGCGAC GAGCTGAAGG GCAACCCGCC CATGGAAGTG
GTGATCCCGG AGGGCCCGAG CCTCGCCGGC GTCTATGTGC AGGCGATCTC GGCCTTCGCG
CCGCACCCGA ACGCGGCGAA GCTCTGGATG GAATATCTCT ATTCGGACGA AGGTCAGCTC
GGCTGGCTCA AGGGCTACTG CCACCCGGCG CGCTTCAACG CGATGGTCGA GGCCGGCAAG
ATCCCGCAGG AGCTGCTCGA CGCCCTGCCG CCCGCCGAAG GCTATGCCCG CGCGGTCTTC
CCGACCGTCG AGCAGCAGGA GGCCAACAAG GCGGCCGTGA CGGCCGGCTG GGATGGTGTG
GTCGGCGCCA ACGTGCAATG A
 
Protein sequence
MTMKLALAAS VSLAAMGAAS GAFAQSAELV EAAKAEGMLT TIALPHNWCG YGDVIAGFKA 
KYPEITVNEL NPDAGSADEI EAIRANKDNK GPQAPDVIDV GLAFGPQAKD EGLIAPYKVE
TWDEIPAEIK DADGYWYGDY YGVMSFGVNT DLVQEVPKSW EALLDSQYAN AFALAGDPRA
SNQAILAVMS AGIADGTEPG EASGKKGLEF FGKLNKAGGF VPVIGKAGTI AQGQTPIVAA
WDYNLLSWRD ELKGNPPMEV VIPEGPSLAG VYVQAISAFA PHPNAAKLWM EYLYSDEGQL
GWLKGYCHPA RFNAMVEAGK IPQELLDALP PAEGYARAVF PTVEQQEANK AAVTAGWDGV
VGANVQ