Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3126 |
Symbol | |
ID | 3836572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 3605249 |
End bp | 3606325 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637827241 |
Product | hypothetical protein |
Protein accession | YP_428208 |
Protein GI | 83594456 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.764656 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTGA TCAAGCTTTC GTCCTTCGCC GCCTTCAACG AGCGCTTCGG CGCCGCATTG AACGTCGGCA CGATCGGCGG CTTTTTTCAT TCCCCCGAAT GGTTCGCCTG CCTGTTCGCC CATGGCCTGG AACCCGCCGA CGCCGGGGCG ACCCCGGCGA TCTTCGTCCT CGATCAGGAG GGTCCCCGGG CCGCGCTATT TTGCCTGCGC CGCCAGAACG GCATCTTGCG CAGCCTGACC AGCTATTACA CCACCGACTA CACCGCCAGT TGCCCGGCCG GCGGCGGCGA GCGGGCGGCG ATCATCGAAG ATCTTGGCCG TCATCTGTCC CAAAAGCTGG GGCGACGGGG CGCGCCATCG CTGGAACTGC GCTATCTGCG GGCGGACTCG CCCGATCTGG CGGCTTTGGA GCGCGGGTTG CGCCGACCGC CGCTGGTCCA CGGCCGCTTC GCCCAATGGC AAAACTGGTA TGCGCCGGTC GAGGCTGGGG CCTATGGCGC TTACGCCGCC GCCCGTCCCT CGCGCCTGCG CCACACCCTG GCCCGCAAAA AGGCGAGGCT GGAGCGCCAG GGCACGATGA CCCTGCGCCT CCACCGCACC GCCGATGCCG GGGCTTTGGC CGCCTATGAG ACCGTCTATG GCGCAAGCTG GAAGCCGGCC GAAGCGTCAC CGGACTTTAT ACGCGCCCTG GCCGTCACGG CGGCCAAGGC CGGGGCCCTG CGCCTGGGGG TTCTCCACAT CGACGGCGCC CCGGCGGCGG CCCAGATCTG GCTGGTCTCG GCCGGGCGGG CGACCATCTA CAAGCTGGCC CACCATCCGC GTTTCGATGA CCTGTCGGTC GGCTCGCTGC TCACGCAAGC CCTGGCCGAG GAGGTGATCA ACCGCGACGG CGTGAGCGAG ATCGACTTCG GCCTGGGCGA CGAGCCCTAT AAGCGCGACT GGATGACCGC GGAGCGCCCG GTGGTCGGGC TGGAGATCCA GTCTCCGACC ACGGCGCGCG GCCTGATCGG CTGCGCCCGC CTCGGCCTGG GCCGCCTGCG CCGATCCCTT CGCCCGGTCA ATACAACTCG GTCCTGA
|
Protein sequence | MSLIKLSSFA AFNERFGAAL NVGTIGGFFH SPEWFACLFA HGLEPADAGA TPAIFVLDQE GPRAALFCLR RQNGILRSLT SYYTTDYTAS CPAGGGERAA IIEDLGRHLS QKLGRRGAPS LELRYLRADS PDLAALERGL RRPPLVHGRF AQWQNWYAPV EAGAYGAYAA ARPSRLRHTL ARKKARLERQ GTMTLRLHRT ADAGALAAYE TVYGASWKPA EASPDFIRAL AVTAAKAGAL RLGVLHIDGA PAAAQIWLVS AGRATIYKLA HHPRFDDLSV GSLLTQALAE EVINRDGVSE IDFGLGDEPY KRDWMTAERP VVGLEIQSPT TARGLIGCAR LGLGRLRRSL RPVNTTRS
|
| |