Gene Rru_A2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A2007 
Symbol 
ID3835432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2318681 
End bp2319718 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content69% 
IMG OID637826107 
Producthypothetical protein 
Protein accessionYP_427094 
Protein GI83593342 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.21006 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGTTT CTTCCGCTGC TCCGCGTCCG GGCGATCGTG CCGCCGGCGC TCCCGACGGC 
GCCCCTGACA GTGCCCCCGA TGGCGCCCCC GATGGCGTCC ATGATACGGC GCGCGGCGTC
GCCTGGGTTC TGGTCGCCTG CGTCATCTTC ACCCTGGTTT ATGCTTCGGG CCGTCTGGCC
GGCGGCGGTG TCGGCGGCCT GCAGATCATG GCCATCCGCT ATGCCTCGGC CCTGGTTTGC
ATCGTCGGCT TCGCCTTGGC CCGGCGCGGC GGGGTGAGCG CCTGCCGCAG CCCGCGCCCC
TTTCGCCATT TCAGCCGGGC GATGATGGGC GGCCTGGGCG GCGCCTGCCT GATCCAGGGG
GTGACCTTGC TGCCGATGGC CGAGGCCTCG GCCATCGGTC TGCTGGACGG CGTCTTCTCG
GTGATCCTGG GCATCGTGCT GCTGGGCGAG CGGGTGAGGC CCCTGCGCTG GCTTGGCGTG
GCGCTGTCGC TGGTGGGGGG CGTGCTGGTG ATCGCCGGAC GGGCCGACCT CAGCGGCTTG
CTTGATCACC TGCTGGCCGG CGGGGCGGTG TTCTATCCCC TGGCTGGCGC CGCCTTCGTC
GCCATGGAAC GCGTTCTCAT GCGTCAATTG GCGCTGCGCG AGGGCAAGAT GGCGATCTTG
TTCCACGTCA ATTTGTTTGG CACGCTGATC TTGATGCCGG TCGCCTTGAT GACCTGGGTG
CCCCTTGAGG GGCCGACGCT GGCCTTGCTG ATCGCCTTCG GGCCGCTGGC CCTGCTGGGC
CAGTTCTGCA ATATCCGGGG GTTCGCCCTG GCCGAGGTCT CGATCACCGG ACCGGTGTGG
TATTCCTGGC TGATCTTCGC CGCCGCCCTC GGCTGGGTGA TGTTTGACGA GGTTCCCGGG
CCGGGCGTCA TCCTGGGTGG CGCCGTTATC GCCCTCGGCG GAGTTTTCCT CTGCGCTTCG
GGGCGCAAGC GGACCCCGGC CCTACCGGCC GATACCCTGG GCCCTTCGGA AACCCCCGGC
AAGGCAGGGC GCGAATGA
 
Protein sequence
MAVSSAAPRP GDRAAGAPDG APDSAPDGAP DGVHDTARGV AWVLVACVIF TLVYASGRLA 
GGGVGGLQIM AIRYASALVC IVGFALARRG GVSACRSPRP FRHFSRAMMG GLGGACLIQG
VTLLPMAEAS AIGLLDGVFS VILGIVLLGE RVRPLRWLGV ALSLVGGVLV IAGRADLSGL
LDHLLAGGAV FYPLAGAAFV AMERVLMRQL ALREGKMAIL FHVNLFGTLI LMPVALMTWV
PLEGPTLALL IAFGPLALLG QFCNIRGFAL AEVSITGPVW YSWLIFAAAL GWVMFDEVPG
PGVILGGAVI ALGGVFLCAS GRKRTPALPA DTLGPSETPG KAGRE