Gene Rru_A0092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A0092 
Symbol 
ID3834285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp105873 
End bp107150 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content66% 
IMG OID637824162 
Productextracellular solute-binding protein 
Protein accessionYP_425184 
Protein GI83591432 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAAGA CCACGGGTGG CGGGTTCGGT GTCTCGTGTT TGAGTGCTTT GGCGCTGGTC 
GCCGGGCTGG CGATCGCCGC GCCCAAACCG GCGGCGGCGG GCGGTTCGGT CGAGGTGTTG
CATTGGTGGA CGGCGGGCGG CGAAGCCAAG GCGGTTTCGG CGCTGAAGGA TCAGTTCGAG
GCCGAGGGCG GCACCTGGAT CGATTCGCCG GTGGCCGGCG GTGGCGGCGA CGCGGCGATG
ACCGCTTTGC GCTCGCGGGT GATCGCCGGC AATCCGCCCT CGGCCGTGCA GCTCAAGGGG
CCGTCGATCC AGGAATGGGC GGCCGAGGGC GTGGTCGCCA ATCTCGATGA CATCGCCAAG
GCCGAAAACT GGGACAAGCT GCTGCCCGCC CTGCTGAAGT CGGTGGTGAC CTACGAGGGG
CATTACGTCG CCGTTCCGGT CAATATCCAC CGTGTCGATT GGCTGTGGGC CAATCCGGCG
GTTCTGGCCA AGGCCGGCGT CGCCGTGCCG ACCACCTGGG ACGAGTTCAA TACCGCCGCC
GAGGCCCTGA AGGCCAAGGG GATCATTCCG CTGGCCCATG GCGGCCAGCC CTGGCAGGAC
GCCACCTTGT TTGAAGTGGT GGTTCTGGGC CTGGGCGGGC CGGCCTTCTA CCACAAGGCC
CTGGTCGAAT TGGACGACGC CGCCCTGCGC GGCGATACCA TGGTCAAGGT GTTCGACCAG
ATGCGCCGCC TGCGCGGCTT CGTCGATCCC AACTTCTCGG GCCGCGACTG GAACCTTGCC
ACCGCCATGG TGATCAACGG CGAGGCCGGC TTCCAGATCA TGGGCGACTG GGCCAAGGGC
GAATTCCTGG GCGCGGGCAA GGTCCCGGGC AAGGATTTCC TGTGCATCGC CGCGCCGGGC
AAGGGTTTCT TGCTCAATTC CGACAGTCTG GTGATGTTCG ACGTCAAGGG CGCCGATAAG
ATCGAGGGTC AGAAGACCCT GGCCCGCTTG GTGCTGGGCG AAACGTTCCA ACGCACCTTC
AACACCCTGA AGGGCTCGAT CCCCGCCCGT CAGGGCATGG ATCTGGCCGA TTTCGACGCT
TGCGCCCAGA AATCCCAAGC CGACCTGACC AAAGCCATCG CCGCCGATAG CCTGGAGCCA
AGCATGGCCC ATGAAATGGC CGTTCCACGC TCGGTGCGCG GGGCGATCAT GGATGTGGTC
ACCGCGCATT TCAATTCAAG CGAATCCTCG GCCGAGGCCG TGGCCCATCT CGCCGACTCC
ATCGCCCAGG CCCGTTAG
 
Protein sequence
MRKTTGGGFG VSCLSALALV AGLAIAAPKP AAAGGSVEVL HWWTAGGEAK AVSALKDQFE 
AEGGTWIDSP VAGGGGDAAM TALRSRVIAG NPPSAVQLKG PSIQEWAAEG VVANLDDIAK
AENWDKLLPA LLKSVVTYEG HYVAVPVNIH RVDWLWANPA VLAKAGVAVP TTWDEFNTAA
EALKAKGIIP LAHGGQPWQD ATLFEVVVLG LGGPAFYHKA LVELDDAALR GDTMVKVFDQ
MRRLRGFVDP NFSGRDWNLA TAMVINGEAG FQIMGDWAKG EFLGAGKVPG KDFLCIAAPG
KGFLLNSDSL VMFDVKGADK IEGQKTLARL VLGETFQRTF NTLKGSIPAR QGMDLADFDA
CAQKSQADLT KAIAADSLEP SMAHEMAVPR SVRGAIMDVV TAHFNSSESS AEAVAHLADS
IAQAR