Gene Rru_A2356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A2356 
Symbol 
ID3835790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2742295 
End bp2743884 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content60% 
IMG OID637826464 
Productextracellular solute-binding protein 
Protein accessionYP_427443 
Protein GI83593691 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.458822 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAAAA TAGTGATTGG CGCGGCTTCG GCCGTTATCC TCGCCATGGC GGCGAGCGGG 
GCCCAGGCCA AGACGCTGGT CTATTGCTCG GAAGGCAGCC CCGAGGGCTT CAATCCGGCT
TTTTACACCA CCGGCACGAC CTTCGACGCC ACCAGCAAGA ACATTTTCGA CAAGCTCGTT
CTTTTCAAGC GCGGCACCAC GGAGATCGAA CCCGGTCTGG CCGAGAGCTG GGAGGTTTCG
CCCGACGGCA AGACCTATAC CTTCCACCTG CGCAAGGGCG TGACCTTCCA CGACAGCGAC
ATCTTCAAGC CGACGCGGCA ATTCAACGCC GATGACGTGA TCTGGAGCTT CGAGCGTCAG
TTGAAGAAGG ATCACCCCTA TCACGCGGTT TCCGGCGGCA CCTACGACTA CTTCGAAGGC
ATGTCGATGA ACACCCTTCT CGAAAAGATC GAGAAGGTCG ACGATTATAC GGTGGTCTTC
CACCTGAGCC GCCCCGAAGC GCCGATGCTG GCCAATCTGG CCATGGACTT CGCCTCGATC
TTCTCGGCCG AATACGCCGA TAAGATGATG AAGGCCGGAA CCCCGGAAGT CGTTGACCAG
AAGCCGATCG GCACCGGTCC CTTCATGTTC CGCGGTTACC AGAAGGACGC CCAGATCCGC
TACGAGGCCA ATCCGACCTA TTGGCAGGGC AAGGCCGCCA TCGACCGCCT GGTTTTCGTC
ATCACCCCCG ACGCCAGCGT GCGCTACGCC AAGCTGAAGG CCGGCGAATG CCATGTGATG
CCCTATCCCA ATCCGGCCGA CCTGGAAGCC ATGAAGACCG ACAAGGCGGT CAACCTGATG
CACCAGGAAG GCCTGAACGT CGGCTATCTG GCCTATAACG TCGAGAAGAA GCCCTTCGAC
GACGTGCGCG TGCGCAAGGC CCTCAATCTG GCGATCGACA AGAAGGCGAT CATCGACGCC
GTTTATCAGG GCGCCGGCAC CGCCGCCACC AACCCGATCC CGCCGACGAT CTGGTCCTAC
AACAAGGCCG TCAAGGACGA CGCCTTCGAT CCGGCCGCCG CCAAGAAGCT GCTGGCCGAA
GCCGGGGTGA AGGATCTCAA GACCACCATC TGGGCAATGC CCGTCCAGCG CCCCTACAAC
CCCAATGCCC GCCGCATGGC CGAAATCCTT CAGGCCAACT GGAAGGCCGT GGGCGTGGAT
GCCGAAATCA CCTCCTACGA ATGGGGCGAA TACCTCAAGC GCGCCAAGGC CGGCGAGCAT
GAGACGGCGC TGTTTGGCTG GACCGGCGAC AATGGCGATC CCGATAATTT CCTGGCGGTT
CTGCTGGGCT GCGACGCCAT CCCCGGCAAC AACTATGCGC GCTGGTGCGA CAAGTCCTTT
GAAAACCTGA TCCAGAAGGC CAAGATCGCC ACCAGCCAGG AAGAGCGGGT GAAGCTCTAC
GAAGAGGCTC AGGTCATCTT CAAGGAGCAG GCCCCCTGGG CGACGATCGC GCATTCGGTG
GTCTACGAGC CGATTCGCAA GGAAGTTATC GACTATAAGA TAGATCCGCT TGGCGGACAT
ATCTTCTACG GCGTCGACCT CAAGAAATAG
 
Protein sequence
MRKIVIGAAS AVILAMAASG AQAKTLVYCS EGSPEGFNPA FYTTGTTFDA TSKNIFDKLV 
LFKRGTTEIE PGLAESWEVS PDGKTYTFHL RKGVTFHDSD IFKPTRQFNA DDVIWSFERQ
LKKDHPYHAV SGGTYDYFEG MSMNTLLEKI EKVDDYTVVF HLSRPEAPML ANLAMDFASI
FSAEYADKMM KAGTPEVVDQ KPIGTGPFMF RGYQKDAQIR YEANPTYWQG KAAIDRLVFV
ITPDASVRYA KLKAGECHVM PYPNPADLEA MKTDKAVNLM HQEGLNVGYL AYNVEKKPFD
DVRVRKALNL AIDKKAIIDA VYQGAGTAAT NPIPPTIWSY NKAVKDDAFD PAAAKKLLAE
AGVKDLKTTI WAMPVQRPYN PNARRMAEIL QANWKAVGVD AEITSYEWGE YLKRAKAGEH
ETALFGWTGD NGDPDNFLAV LLGCDAIPGN NYARWCDKSF ENLIQKAKIA TSQEERVKLY
EEAQVIFKEQ APWATIAHSV VYEPIRKEVI DYKIDPLGGH IFYGVDLKK