Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3742 |
Symbol | |
ID | 3970337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4166500 |
End bp | 4168041 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637926852 |
Product | glycosyl transferase family protein |
Protein accession | YP_533596 |
Protein GI | 90425226 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00121597 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.955907 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAGCGGC TGACGCAGCC GCAGAGCGAG GATCGGGGCG CCGTGCGGCG CGCGGTTTTG ATCGTCCTGG TCCTGGTCGG GCTGCGTCTG GTCGTGGCCG CGATCACGCC GTTGACCTTC GACGAAGCCT ATTACTGGAC CTGGTCGAAG AACCTGGCCG GCGGCTATTA CGATCATCCG CCGATGGTCG CGCTGATGAT CCGGCTCAGC ACGCTGATCG CCGGCGACAG TGAATTCGGC GTCCGCTGGC TCTCGGTGCT GCTGGCATTG CCGATGAGCT GGGCGGTGTA TCGCAGCGGC GCCATCCTGT TCGGCTCGGC GCGCGTCGGC GCCACCGCGG CGATCCTGTT CAACACCACG ATGATGGCCT GGCTCGGCAC CATCATGGCG ACGCCGGACG TGCCCTTGAT GCTGGCGTCG AGCCTGCTGC TGTGGTCGCT CGCCAAACTG CTGCAGAGCG GGCGCGGCGT ATGGTGGCTC GCGGCAGGTG CCGCGGTTGG CGCGGCGCTG CTGTCGAAAT ACAACGCGTT GTTCTTCGGT CCCACGCTGC TGATCTGGCT GATCGTGGTT GCGGATCTGC GGCGCTGGTT GCGCTCGCCG TGGCCTTATC TGGGCGGGCT GGTGGCGCTG GCACTGTTCT CTCCGACGCT ATTGTGGAAC GCACAGCACG AATGGGCGTC GTTCCTCAAG CAGTTCGGCC GCGTCGGCGC CGCTGATTTT CGCCCCGGCT TCCTGCTCGG CATGCTGGGC GGCCAGTTCC TGGTGATGAC GCCGGCGGTG GCGATCCTTG GCTGCAGCGG GCTGGTTGCG ATGGCGCGCG GCGCCACCGG ACTACGCGGC GCCGCCGCGC TGCTGCACAT CACGATCTGG GTGGTGGTGG CCTATTTCCT GGTGCACGCG TTGCACGAGG AGGTGCATCC CGACTGGCTG TGTCAGATCT ATCCGGCGAT GGCGATTGCC GGGGCGGTCG CGCTGGAGCG GATGACGTGG CGGTCGCGCT GGCAACGCGT TGTGAATTTC CTCGGCCGCT GGGCGGTGCC GGGCAGCGCG GCGATGGTGG CGCTGATCGT GCTGCAGCTG CACACCGGCG TGCTCAGCGG CTATCGCAAC GAAGAGGGCG TGCGGCTGGT CGGCGTCGGT TTTCGCGTTG CGGCGCACCA GATCGAGGCG ATCCGCGTTC GGCTCGGCGC CAGTTGCATC CTGGCGGCAG ACTACGGCAC TACGAGTTGG CTGATGTTCT ATCTGCCGCC CGGCAGCTGC GTGGCGCAGC ATTTCGAGCG GATCCGCTGG GCCAATGCCA AGGAGCCCGA TGCCGCGCTG CTGAACGGCA AGTTGCTGTT CGTCGGGCGC TCTTCCTATC AGCATTGGCT TCACCCATGG CTGCAGGAAG CATTCGCGAG CGTTGACAGC GTGGCGGAGG TCTCGCGCAT GCGCGGCGCG ACGGTGATTG AAACCTACCG CATCGACCTG CTGGAAGGCG CCAAAGGCGA TATACTCCTC CGGTGGCCGC CGCCGGAGTT GATCCGACGG CGCGGTCTCT GA
|
Protein sequence | MQRLTQPQSE DRGAVRRAVL IVLVLVGLRL VVAAITPLTF DEAYYWTWSK NLAGGYYDHP PMVALMIRLS TLIAGDSEFG VRWLSVLLAL PMSWAVYRSG AILFGSARVG ATAAILFNTT MMAWLGTIMA TPDVPLMLAS SLLLWSLAKL LQSGRGVWWL AAGAAVGAAL LSKYNALFFG PTLLIWLIVV ADLRRWLRSP WPYLGGLVAL ALFSPTLLWN AQHEWASFLK QFGRVGAADF RPGFLLGMLG GQFLVMTPAV AILGCSGLVA MARGATGLRG AAALLHITIW VVVAYFLVHA LHEEVHPDWL CQIYPAMAIA GAVALERMTW RSRWQRVVNF LGRWAVPGSA AMVALIVLQL HTGVLSGYRN EEGVRLVGVG FRVAAHQIEA IRVRLGASCI LAADYGTTSW LMFYLPPGSC VAQHFERIRW ANAKEPDAAL LNGKLLFVGR SSYQHWLHPW LQEAFASVDS VAEVSRMRGA TVIETYRIDL LEGAKGDILL RWPPPELIRR RGL
|
| |