Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3120 |
Symbol | |
ID | 3836566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 3598607 |
End bp | 3600178 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637827235 |
Product | hypothetical protein |
Protein accession | YP_428202 |
Protein GI | 83594450 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02602] eight transmembrane protein EpsH (proposed exosortase) [TIGR02914] EpsI family protein [TIGR03109] exosortase 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.548288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGGG GAAGAGAACC GCTGGTCGCA ACCGCGCCTT TGGCCCCCCG CTGGGCGATC GGCGGCGCGG ACGCTTCGGC TTTGGCCCTG GGAACGGCGA TCTGCCTGGG GATCTTTCTG CTGTTTTTCG AGACCTTCGG CTCGATGATC GGGCTGTGGG CGACCATCGC CGATTACCAT CATTCCTTCG CCGTCATCCC CTGCGCCCTG TGGCTGGGGT GGGAGCGGCG CGACCTGGTG CGCGGCGTGG CCTTCGCCCC CTGGCCTTTG GCCTTCCTGG CGGTCGTCGG CGCCGCCCTG CTCTGGCTGG CCGGACGCCT GGGCAGCGTG CAGTTCGTCG AGCATCTCGG CGTGATCTTG CTGCTGCAAA GCGCCCTGAT CGCCGCCCTT GGCCAAGGGT TCGCCAAGGC GATGGCCCTG CCTTTGGCCT TTCTGATCTT CCTTCTGCCC ATCGGCGACG CCCTGATCCC CTCGCTCCAG GCCATCACCG CCTCGCTGGC GGCCAACGGT CTGGCGCTGT TTGACGTGCC GGTCTTTCAC GACGGGGTTT TCCTGATCAC GCCCTTTGGC AATTTCGAGG TCGCCCAGGA ATGCTCGGGG TTGCGCTTCC TGGTCGCCAC CTTGACCTTC GGGGCGGTGA TCAGCGCCGT CGTCCACCGC TCGGCGTGGC GACGGGCGCT GTTTTTGCTC TCTTGCGCTA TCGTGCCCGT TCTGGCCAAT GGCCTGAGGG TGATCGGCAT CATTTTACTG GCCCGACACG CCGGCCACGA AACCGCCGCC GCTTTCGATC ATGTCATCTA TGGCTGGGTT TTCCTGAGCC TGGTCAGCGT CTTCATCTTC GCCCTTGGCT GGCTGACGCG CGAGCCGCCG CGACCGCTCC GGCGCAAGGG CGCGCACCCG CCGTCTTCGG CGCGCCCCTT GCGCCCGCTG CGGACGGGGA TCGCGGTTCT CGCCCTGGCC GCCACCCCGG TCGCGGCCAA TGCGGCGATC GAGACCTCGC TGGCCGGCCG CCTCGCCCCA GCGGCGCTGA TCCTGCCGGC GCCGGGCCCA GCCAGCACCG CCGATCCGCT GTGGACTCCC CAGATCCCCG GCGCCGATCT CGTTGATCAT CGGGCCTTCC TGATCGGCGG CCGGGCGGTG GACCGGGTGA TCGCCTATTT CACCCACCAG CGCCAAGGCG CCGAGGCCGT TTCGGCCACC ATCGATCCGG CCGGGCTGGA GATGGTCAGC CAAGGCGAGC GCGTCGTCGC CCTTACCGAC CCCGTCGCCG GCTTGGCGGC GGTTCGCGAA AGAGTGATCG CCGGACCGGG CCGCCAGCGC CTTGTCTGGT CGTGGCTTTG GGTCGGTGGC CATTTCACCA CCGATGCCAA GGCGGCCAAG ATCTTCCAGG TGCTGGGCGT CTTGCCCGGC GGCCAGCCCG CCGCCGCCCT GGTGATGATC TCGGTCCCGC TCGCCGCCGC CCCGGCATCC CCCCAGGCTC TCGACGACGC CCGGGCCCTG CTTGGCCGCT TCCTCGCCGG CCAAGGGGAT CTCGGCCCGG GCCTCGCCCT CGCCCAAGAG AAGGACCGCT GA
|
Protein sequence | MNRGREPLVA TAPLAPRWAI GGADASALAL GTAICLGIFL LFFETFGSMI GLWATIADYH HSFAVIPCAL WLGWERRDLV RGVAFAPWPL AFLAVVGAAL LWLAGRLGSV QFVEHLGVIL LLQSALIAAL GQGFAKAMAL PLAFLIFLLP IGDALIPSLQ AITASLAANG LALFDVPVFH DGVFLITPFG NFEVAQECSG LRFLVATLTF GAVISAVVHR SAWRRALFLL SCAIVPVLAN GLRVIGIILL ARHAGHETAA AFDHVIYGWV FLSLVSVFIF ALGWLTREPP RPLRRKGAHP PSSARPLRPL RTGIAVLALA ATPVAANAAI ETSLAGRLAP AALILPAPGP ASTADPLWTP QIPGADLVDH RAFLIGGRAV DRVIAYFTHQ RQGAEAVSAT IDPAGLEMVS QGERVVALTD PVAGLAAVRE RVIAGPGRQR LVWSWLWVGG HFTTDAKAAK IFQVLGVLPG GQPAAALVMI SVPLAAAPAS PQALDDARAL LGRFLAGQGD LGPGLALAQE KDR
|
| |