Gene Rru_A2302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A2302 
Symbol 
ID3835730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2663888 
End bp2664937 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content62% 
IMG OID637826404 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_427389 
Protein GI83593637 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0869305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGT TCCTATCGTC CCTGGTGCTC GCGGCGTCGC TTGTCGCCGG TCCGGCCCTG 
GCCGCCGATG CGCCGGTCGA TGTCTCGCAG GTGCCCAAGG GCTTTTCCGC CAAAGACGTT
GGCAAAAGCT ATTCCATCGC CACCGTGGTC AAGGTCGATG GCATCGCCTG GTTTGACCGC
ATGCGCGAAG GCGCCAAGCA GTTCGGCGCC GATACCGGCC ATGACACCTG GATGGTCGGG
CCCAGTCAGG CCGACGCCGC CGCCCAGGTG CAACTGGTCG AGAACCTGAT CGCCCAGGGC
GTCGACGCCA TCTGCGTCGT GCCCTTCTCG GTCGAGGCCC TGGAGCCCGT GCTCAAGAAG
GCGCGCGATC GCGGCATCGT CGTCATCGCC CACGAGGCCT CGAACATCAC CAACGCGGAT
TTCGTGCTCG AAGCCTTCGA CAACCTCGCC TATGGCGCCA AGCTGATGGA AGTGCTGGGC
ACCTATATGA AGGGCGAAGG CAAGTATGTG ACGACGGTCG GCAGCCTGAC CTCGAAGTCT
CAGAACGAAT GGATCGACGG CGCCATCGCC TATCAGAAGG CCCATTTCCC CAAGATGGAG
CAGGCGACCG GCCGGCTCGA GACCTATGAC GACGCCAATA CCGACTACAA CAAGCTCAAG
GAAGTGCTGA CCACCTATCC CGATATCAAG GGCATCCTTG GTGGTCCGAT GCCGACCTCG
GCCGGCGCCG GTCGCCTGAT TTCGGAACGC GGCCTGAAGG ACAAGCTGTT CTTCGCTGGC
ACCGGTCTGG TTTCGGTCGC GGGCGAATAT TTGTCCAAGG GCGATATCCA GTACATCCAG
TTCTGGGATC CGGCGGTGGC CGCCTATGCG ATGAACATCG TCGCGGTGAT GGCCCTTGAC
GGCAAGGCCG ATCAGATCAA GGCCGGCCTC AATCTGGGCC TGCCCGGCTA CACCAGCCTG
ACCGCCCCGG TGGCGGGCAA GGACAAGCTG CTCTATGGCG CGGGCTGGGT CGGCGTGACC
AAGGACAACA TGGAAGACTA CAACTTCTAA
 
Protein sequence
MKKFLSSLVL AASLVAGPAL AADAPVDVSQ VPKGFSAKDV GKSYSIATVV KVDGIAWFDR 
MREGAKQFGA DTGHDTWMVG PSQADAAAQV QLVENLIAQG VDAICVVPFS VEALEPVLKK
ARDRGIVVIA HEASNITNAD FVLEAFDNLA YGAKLMEVLG TYMKGEGKYV TTVGSLTSKS
QNEWIDGAIA YQKAHFPKME QATGRLETYD DANTDYNKLK EVLTTYPDIK GILGGPMPTS
AGAGRLISER GLKDKLFFAG TGLVSVAGEY LSKGDIQYIQ FWDPAVAAYA MNIVAVMALD
GKADQIKAGL NLGLPGYTSL TAPVAGKDKL LYGAGWVGVT KDNMEDYNF