Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4101 |
Symbol | |
ID | 3911908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4669760 |
End bp | 4670932 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637886005 |
Product | glycosyl transferase family protein |
Protein accession | YP_487705 |
Protein GI | 86751209 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0116147 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.150084 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTTGAAC AGTCATCGAC CAGGATCTCC GCCGCGACTT ACGGCGCTGT GGTCGCCATT CCTGCTCACA ATGAGGCCGA CACGATCCGT CGGTGCCTGG CGGCGCTGGC GATGCAGCGC GACGAGTCCG GTTGCCCGGT GCGCGCCGGG GCATTCGAAA TTCTGATCTA CGCCAACAAT TGCAGCGACA GCACCGTCGA GGTGGTCCGT CACTTTGCCT GCTGCATTCC GCATCCGATC ATCGTGATCG AAGCACAATT GCCGCCGTCG CAACTCTCGG CCGGCGCGGC ACGCAAGACG GCTATGGATC TCGCTGCCGC GAGGCTCGCC GAGCGTGGCG CAGCCGACGG GGTGATCCTC ACGACCGATG CAGACAGCTG CGTAGCGCCG ACCTGGTTCT CGACGACGAT GCGGGAATTG AGCGGCGGTG TGGATTGCGT TGCGGGATAC ATCGATGCCG AACCGCTCGA ACTGGTCGGC CTCGGGCCGG CGTTTCTGGC GCGCGGTCGG CTCGAAGACG CGTATCTGAG ATTGATCGCC GAAATCGACG CCCGTTGCGA TCCCCGCCGC CATGATCCCT GGCCGAACCA CCGTGTCGCG TCCGGCGCGA GCCTGGCCGT GGTGTTGAAG GCCTATCTGG CCATCGGCGG GTTGCCGCTG CGCGCGGTGG GCGAGGATGC CGCCCTCACC GCTGCGCTCG ACCGCGGGGG GTTCAAGGTG CGACATTCCA TGGCCGTCTC GGTGACGACG TCGTGCCGGC TCGACGGTCG TGCGCAGGGC GGCGCCGCCG ATACGATGCG GCTGCGCCAC GCGATGCCGG ACGCGCCCTG CGACGACGAT CTCGAGCCGG CGTTGCAGGC GACCCGCCGC GCCATCTATC GCGGACGTCT GCGCCGGCTG CTGGACGAAC AAAGGTATCG CGCCCGGCAG GTTCAGGATA TTCCGGCTCA GCAGCCACCG CGCCCAGGCG CTACGTTCGA CGAGGCGTGG CAGCAGCTTT GTCGCGACAA TCCGGTTCTT CGCCGCGGCG GGTCGTTGCG ACCGTCCGAT CTGCCGCGGC AGATCGCCGT TGCGACCATG GTGCTACGGC ATCTGCGGCT GCCGCTCAGT GCGACGACAG TCGTTCCAGC CGATATGTCG CGTCGCGAAC GATGGCTCGA GCCGGCAGCC TGA
|
Protein sequence | MFEQSSTRIS AATYGAVVAI PAHNEADTIR RCLAALAMQR DESGCPVRAG AFEILIYANN CSDSTVEVVR HFACCIPHPI IVIEAQLPPS QLSAGAARKT AMDLAAARLA ERGAADGVIL TTDADSCVAP TWFSTTMREL SGGVDCVAGY IDAEPLELVG LGPAFLARGR LEDAYLRLIA EIDARCDPRR HDPWPNHRVA SGASLAVVLK AYLAIGGLPL RAVGEDAALT AALDRGGFKV RHSMAVSVTT SCRLDGRAQG GAADTMRLRH AMPDAPCDDD LEPALQATRR AIYRGRLRRL LDEQRYRARQ VQDIPAQQPP RPGATFDEAW QQLCRDNPVL RRGGSLRPSD LPRQIAVATM VLRHLRLPLS ATTVVPADMS RRERWLEPAA
|
| |