Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4183 |
Symbol | |
ID | 3972540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4648272 |
End bp | 4649273 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637927286 |
Product | glycosyl transferase family protein |
Protein accession | YP_534027 |
Protein GI | 90425657 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGTG GCGACCGCCT GCAGATCATC GTGCCTTGTT ACAACGAGGA GCAGGTGCTG CAATCCACCG CCGCCACGCT GGCCTCCGTG ATCGAGTCCT GCATCAGCGC GGGCCTGATC GCGCCGTCGA GCGCGGTGCT GTTTGTCGAT GATGGATCCT CGGACCAGAC ATGGCGTCTG ATCGAAGACT TGCACGCCGC CGATCCGCAG CGTTTCGACG GCGTTCGGTT GTCCGCCAAT CGCGGACACC AGGCGGCGCT GTGGGCGGGG CTGTCGACCG CGGATGCCGA TCTTGTCGTG TCGATCGATG CCGATCTGCA GGACGACCCC CAGGCGATCG TCAGGATGAT CAAGGAATAC GACGCCGGCG CAGACGTGGT ATTCGGACTG CGCTCGAATC GCGAAAGCGA CGGCTGGTTC AAGCGCAGTT CGGCCACGCT GTTCTACCGC TTGTTGCGCC TGCTGGGCGT CAACATCGTG CCGCAGCACG CCGATTTCCG GTTGATGAGC CGCCCGGCGA TCGACGCCTT GCTGCAATAC TCGGAATCCA ACTTGTTCCT ACGGGCGTTG GTGCCGCAGC TCGGCTTCGC CACGGCGCAG GTCAGCTACC CCCGAACGTC GCGGGCCGCG GGAACCACCA AGTACCCTAT CGGCAAGATG CTCGGTCTGG CCATTGACGG GATCACCTCC TGGTCGGTGG CGCCGCTTCG GGCGATCGGG TTGCTTGGCC TGACGGTGTC GGCAATGGCC TTCTTGCTCG GTCTGTGGGC GCTGTGGGCC GCGCTGTTCA CCCATGCGAC CATCCCGGGC TGGGCCTCGA TCATGCTGCC GCTGCTGTTC TCCCAGGGCT TGCAGTTCAT CTTCCTGGGC CTAATCGGCG AGTACATCGG TAAGATCTTC GTGGAGACCA AGCGCCGGCC GAAATTCATC ATCCGCGCCC GGGCGGGGAC GAACCCGCGC TCGGCCGCGG CCCGCGCCGA GCGCGCCGAG AAAGTGAACT GA
|
Protein sequence | MASGDRLQII VPCYNEEQVL QSTAATLASV IESCISAGLI APSSAVLFVD DGSSDQTWRL IEDLHAADPQ RFDGVRLSAN RGHQAALWAG LSTADADLVV SIDADLQDDP QAIVRMIKEY DAGADVVFGL RSNRESDGWF KRSSATLFYR LLRLLGVNIV PQHADFRLMS RPAIDALLQY SESNLFLRAL VPQLGFATAQ VSYPRTSRAA GTTKYPIGKM LGLAIDGITS WSVAPLRAIG LLGLTVSAMA FLLGLWALWA ALFTHATIPG WASIMLPLLF SQGLQFIFLG LIGEYIGKIF VETKRRPKFI IRARAGTNPR SAAARAERAE KVN
|
| |