Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3721 |
Symbol | |
ID | 3837178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 4271848 |
End bp | 4272936 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637827846 |
Product | sugar transferase |
Protein accession | YP_428802 |
Protein GI | 83595050 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.243024 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCCG CAGAGACATC CTTGCACGCC TTGGCCGAAG ACAAGCGCGG CACGAGGCGC CCGTCCTCGC CGTCCCGCCT TCCTAGGCCG ATGCGCGTCG CCCTGGTCGG CGCCGGTCTT CCCGCCGCCG CCCTGATCGA ATTCCTCCAA TCCGACGACG CCCGCGACGA TTACACCGTC GTGGCGATCT TCGATGAACG CGGCGATCGC CGGCCACCGG TGTTGGGTGC CAAACCGGTG GAAAAGGGCC TGAGCGGCCT GCGCGACCTT GCCGAGGCCG GCAAGATCGA CGCCATTCTT CTGACGCTCT ACGGCGCCTC CTCGGCGCGC ATGTTCGAGA TCATCGAGCG CATCGGCACC ACCGCCGTCG ATATCTATCT GCCGCGCGAG CACAAAGATA AGCACTTCTC GTGGACCAGC TATCACCTGA TCGGCGGCCT GCCGTTCCTG TGCATCAAGG GGCGGCCCTT CCAGGGGTTC GCCGGCGTTT TCAAAAGGAT CGAGGATTAT ACCCTGGCGG TTCTGGCGCT GACGCTGATC GGCCCGATCC TGCTGCTGGC CATGCTGGCG ATCCGCCTGG AAAGCCCCGG CCCGGCCCTG ATCCATCAGC GGCGCATCGG CCTTGGCGGG AAGCTGTTTT CCATGCTCAA GTTGCGCTCC ATGCGCTTCG ATCCCAATGA CGACGGGCGG AACGGCGCCA TCGCCAATGA TCCGCGCATC ACCCGGGTCG GCGCCTTCCT GCGCGCCACC AGCATCGACG AGCTGCCCCA GGTCCTCAAT GTCCTGCGCG GCGACATGTC GATGATCGGC CCGCGTCCCC ATGTTCCCAA CATGCTGGTC GAAAACACCG TTTACGGGGT GAGTGTCAGC GAATACGTCG CCCGCCACCG GGTGCGGCCG GGGATCACCG GTTGGGCCCA GGTGAATGGC ATGCGCGGCG GCATTCATGA TATCGAAAAG GCCCGGCGCG GCGCGCAACT GGATATCTAT TACATCGAGA ACTGGACACC ATGGCTCGAC ATCAAGATCC TCTGGCGGAC GATTTTCGGC GGCCTGCGCG ACCCCTCCGC TCTGCGCTCG CCGGGCTGA
|
Protein sequence | MASAETSLHA LAEDKRGTRR PSSPSRLPRP MRVALVGAGL PAAALIEFLQ SDDARDDYTV VAIFDERGDR RPPVLGAKPV EKGLSGLRDL AEAGKIDAIL LTLYGASSAR MFEIIERIGT TAVDIYLPRE HKDKHFSWTS YHLIGGLPFL CIKGRPFQGF AGVFKRIEDY TLAVLALTLI GPILLLAMLA IRLESPGPAL IHQRRIGLGG KLFSMLKLRS MRFDPNDDGR NGAIANDPRI TRVGAFLRAT SIDELPQVLN VLRGDMSMIG PRPHVPNMLV ENTVYGVSVS EYVARHRVRP GITGWAQVNG MRGGIHDIEK ARRGAQLDIY YIENWTPWLD IKILWRTIFG GLRDPSALRS PG
|
| |