Gene Rru_A3120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3120 
Symbol 
ID3836566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3598607 
End bp3600178 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content69% 
IMG OID637827235 
Producthypothetical protein 
Protein accessionYP_428202 
Protein GI83594450 
COG category 
COG ID 
TIGRFAM ID[TIGR02602] eight transmembrane protein EpsH (proposed exosortase)
[TIGR02914] EpsI family protein
[TIGR03109] exosortase 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.548288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGGG GAAGAGAACC GCTGGTCGCA ACCGCGCCTT TGGCCCCCCG CTGGGCGATC 
GGCGGCGCGG ACGCTTCGGC TTTGGCCCTG GGAACGGCGA TCTGCCTGGG GATCTTTCTG
CTGTTTTTCG AGACCTTCGG CTCGATGATC GGGCTGTGGG CGACCATCGC CGATTACCAT
CATTCCTTCG CCGTCATCCC CTGCGCCCTG TGGCTGGGGT GGGAGCGGCG CGACCTGGTG
CGCGGCGTGG CCTTCGCCCC CTGGCCTTTG GCCTTCCTGG CGGTCGTCGG CGCCGCCCTG
CTCTGGCTGG CCGGACGCCT GGGCAGCGTG CAGTTCGTCG AGCATCTCGG CGTGATCTTG
CTGCTGCAAA GCGCCCTGAT CGCCGCCCTT GGCCAAGGGT TCGCCAAGGC GATGGCCCTG
CCTTTGGCCT TTCTGATCTT CCTTCTGCCC ATCGGCGACG CCCTGATCCC CTCGCTCCAG
GCCATCACCG CCTCGCTGGC GGCCAACGGT CTGGCGCTGT TTGACGTGCC GGTCTTTCAC
GACGGGGTTT TCCTGATCAC GCCCTTTGGC AATTTCGAGG TCGCCCAGGA ATGCTCGGGG
TTGCGCTTCC TGGTCGCCAC CTTGACCTTC GGGGCGGTGA TCAGCGCCGT CGTCCACCGC
TCGGCGTGGC GACGGGCGCT GTTTTTGCTC TCTTGCGCTA TCGTGCCCGT TCTGGCCAAT
GGCCTGAGGG TGATCGGCAT CATTTTACTG GCCCGACACG CCGGCCACGA AACCGCCGCC
GCTTTCGATC ATGTCATCTA TGGCTGGGTT TTCCTGAGCC TGGTCAGCGT CTTCATCTTC
GCCCTTGGCT GGCTGACGCG CGAGCCGCCG CGACCGCTCC GGCGCAAGGG CGCGCACCCG
CCGTCTTCGG CGCGCCCCTT GCGCCCGCTG CGGACGGGGA TCGCGGTTCT CGCCCTGGCC
GCCACCCCGG TCGCGGCCAA TGCGGCGATC GAGACCTCGC TGGCCGGCCG CCTCGCCCCA
GCGGCGCTGA TCCTGCCGGC GCCGGGCCCA GCCAGCACCG CCGATCCGCT GTGGACTCCC
CAGATCCCCG GCGCCGATCT CGTTGATCAT CGGGCCTTCC TGATCGGCGG CCGGGCGGTG
GACCGGGTGA TCGCCTATTT CACCCACCAG CGCCAAGGCG CCGAGGCCGT TTCGGCCACC
ATCGATCCGG CCGGGCTGGA GATGGTCAGC CAAGGCGAGC GCGTCGTCGC CCTTACCGAC
CCCGTCGCCG GCTTGGCGGC GGTTCGCGAA AGAGTGATCG CCGGACCGGG CCGCCAGCGC
CTTGTCTGGT CGTGGCTTTG GGTCGGTGGC CATTTCACCA CCGATGCCAA GGCGGCCAAG
ATCTTCCAGG TGCTGGGCGT CTTGCCCGGC GGCCAGCCCG CCGCCGCCCT GGTGATGATC
TCGGTCCCGC TCGCCGCCGC CCCGGCATCC CCCCAGGCTC TCGACGACGC CCGGGCCCTG
CTTGGCCGCT TCCTCGCCGG CCAAGGGGAT CTCGGCCCGG GCCTCGCCCT CGCCCAAGAG
AAGGACCGCT GA
 
Protein sequence
MNRGREPLVA TAPLAPRWAI GGADASALAL GTAICLGIFL LFFETFGSMI GLWATIADYH 
HSFAVIPCAL WLGWERRDLV RGVAFAPWPL AFLAVVGAAL LWLAGRLGSV QFVEHLGVIL
LLQSALIAAL GQGFAKAMAL PLAFLIFLLP IGDALIPSLQ AITASLAANG LALFDVPVFH
DGVFLITPFG NFEVAQECSG LRFLVATLTF GAVISAVVHR SAWRRALFLL SCAIVPVLAN
GLRVIGIILL ARHAGHETAA AFDHVIYGWV FLSLVSVFIF ALGWLTREPP RPLRRKGAHP
PSSARPLRPL RTGIAVLALA ATPVAANAAI ETSLAGRLAP AALILPAPGP ASTADPLWTP
QIPGADLVDH RAFLIGGRAV DRVIAYFTHQ RQGAEAVSAT IDPAGLEMVS QGERVVALTD
PVAGLAAVRE RVIAGPGRQR LVWSWLWVGG HFTTDAKAAK IFQVLGVLPG GQPAAALVMI
SVPLAAAPAS PQALDDARAL LGRFLAGQGD LGPGLALAQE KDR