Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1007 |
Symbol | |
ID | 3909131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1152771 |
End bp | 1154021 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637882900 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_484628 |
Protein GI | 86748132 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.643015 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCATGA CGATCACCAC AGACACCCGG ACCGGGCACG ATCGCATCGG CGAGACCGAT GCAGCGTCAC CACAGCCGAC GCCGGCGCGC GACGCCGGGG CCGCGCCGGC GCGGCGCACG CGCGTGGTCG TGATCCAGAC CCAGGCGGAG AACGCCGGCG CGCAGGAGAT TTCGCGGCTG GTCGGCGCCG GGCTCGCCGC GCGCGGCTAC GACGTCCACA ATCTGTTCTT CTTCCGGCAG TCGCGCTCGT TCGACGAGCC GGCGCAGACG ACGTATTGCG CGGCCCGCCG GCCGGGCGAT CCGCTGTCGT TCCTGCGGTT TCTCGGCGCG CTCTACGCGC GCATCCGGAC GCTTCGGCCC GACGTGGTGC TGACCTTCCA GCATTACGGC AATGCGATCG GCGGGATCGC CGCGCGGCTG GCGAGCCCGG CGCCGGTGAT CGCCAACCAG GTGTCGGCGC GATTGACGAT GCCGGCCTGG CTGCGCGGCG TCGATCGGAT CATGGGCCAG CTCGGCGTGT TCGAGACCAT CACGGTCAAC TCGCACGACA TGCTGCGCGA CTATTCGCGC TATCCCGACG GCTATCGCAG GCGGCTGCAG CACGTGCCGC ACGGCTTCGA CCAGAAGCAC GCGACCATGT CGAAGGCGGA CGCGCGCCGG CAATTCGGGC TCAGGCCGGA TGCGGTCATT CTCGGCTCCG CCGCGCGGCT GCATCCGCTG AAGCAGCTCG ACGCCGCCAT CCGCGTGCTG GCGCAGCGGC CGGACTGGCG CCTCGCGCTG GCGGGCCAGG GCCCCGACGA GGCGCGCCTG CGCGAACTCG CCGACGGCCT CGGCGTGTCC GACCGCATCA CCTTCATCGG CGAGATCTCG CCCGAGCAGG TCGCGAACTT CCTGGCCTGC CTCGACGTGT TCGTGTTTCC CTCGCTGGCC GAGACCTTCG GCCTCGCCGC GGTCGAGGCC GCCCATGCCG GCGTGCCGGT GGTCGCCAAC GATCTGCCGG TGCTGCGCGA AGTGCTGTCG GCGCAAGGCG AACCGGCGGC ATTGTTCGTC GATGCGGCGG ACCCCGCCGC GATGGCGAAC GCGATCGCCC GGGCGCTCGA CGACGACGCG CTCCGCGCGC AGCTCCGCCG CGCCGGCGAC GGGCTGAAGT CGCGCTACGC GGTCGACGCC ATGGTCGACG AGTATGTCCG CATCATCGAG GGCGCAACGC AGCCGGCAGC GCGACGGCAA GGAGCCGGCC GTGATCGTTG A
|
Protein sequence | MTMTITTDTR TGHDRIGETD AASPQPTPAR DAGAAPARRT RVVVIQTQAE NAGAQEISRL VGAGLAARGY DVHNLFFFRQ SRSFDEPAQT TYCAARRPGD PLSFLRFLGA LYARIRTLRP DVVLTFQHYG NAIGGIAARL ASPAPVIANQ VSARLTMPAW LRGVDRIMGQ LGVFETITVN SHDMLRDYSR YPDGYRRRLQ HVPHGFDQKH ATMSKADARR QFGLRPDAVI LGSAARLHPL KQLDAAIRVL AQRPDWRLAL AGQGPDEARL RELADGLGVS DRITFIGEIS PEQVANFLAC LDVFVFPSLA ETFGLAAVEA AHAGVPVVAN DLPVLREVLS AQGEPAALFV DAADPAAMAN AIARALDDDA LRAQLRRAGD GLKSRYAVDA MVDEYVRIIE GATQPAARRQ GAGRDR
|
| |