Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1209 |
Symbol | |
ID | 4021685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1366541 |
End bp | 1368127 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637961401 |
Product | hypothetical protein |
Protein accession | YP_568348 |
Protein GI | 91975689 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.688773 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGCGG CCAAGCTGGT TGCCTGTGGC GGCGGCGGGC TCTATCACGG GCCAACCGAA CTCCGCTGGC CCCTCATGCG CTTTACCTCC CTGATCATCG AGCTGATCCG CGCCCGGCCG CGGCTGATGT TCTGGCTGGT GGTGTCGGCG CAGGCGCTGC TCTGGGTGCT GGTGCCGCTG CTGGTCTATT CCAGTCCACC CGAAGGCGTC GCCACGGTCC TCGCCTATGG CCGCGAGTAT CAGGTCGGCA GCGACCTCGG GCCGCCGCTG GCGTTCTGGC TCGCCGATAT CGCGTTTCGC GCCGCGGGCG GCCATATGGT GGGCGTCTAT CTGCTGGCGC AGGTCTGTTT CATCATCACC TTCTACGGGC TGTTTCAGCT CGCGCGCAGC ATGGTCGGCC CGCAGCACGC GGTGATCGCC GTGCTGCTGA CGTCGACGGT GACCGCCTTT GCGGAGCAGG GCGCCGAATT CGGTCCGCTG GTCCTGGCGC GGCCGCTCTG GGCGCTGGTG TTGTGGCATA GCTGGGAAAT CATCGGCCGC GGCCGGCGCA GCGCCTGGTT CGCGCTGTCG ATTGAGGTCG GGCTGTTGCT GCTGACCACG GTGGCTGCGC CGGCCCTGTT GCTGCTGCCG ATCGGCTTCG CGCTGTCGAC CGCGCGCAGC CGGCGCGCTT TGATGTCGCT GGATCCGATG TTCAGCCTGC TGGTGGTCGC AGTGCTGGTG CTGCCCTATG GGATCTGGCT GCTACGCGCC GACATTTTCG CGCTGCCGTC GCTGCCCGCG TTGGGCGATC TCGGCGATCG TGCGCTGCTC GGCGTCGAGC TGTTCGGCGG CCTGGTGGTC GCAATCGGCG GGATGGCGCT GCTGGTGCTG CTCAACACCA GCCGTTTTGA TCCGAGGCCG GACGACGCGC CGGTCGTGTA TCGCGCGCCG GTCGATCCGC TGGCGCGGCA GTTCGTGTAC TTCTTCGCAC TCGCGCCGGC GCTCCTCGGC GCCATCGTCG CCGGCCTGTT CGGGCTGAAG CACGTCATTG GCGGGGCGGG GATCGCTCTG CTGATGGTCG GACTCGCGGT GGTGATCGCG ACCGGCGACC TCATCCATCT GCGCCGGCAG CGCCTGCTGC GCGCGGCGTG GGCCGCGCTG GTGGCGGCAC CCGCGTTGGT GGTGATTGTG GCGTCCGTGG TTCAGCCCTG GGTCAGCCAG ACCGAACTCG CCACATCGCT GCCCGCCAAG GACATCGCCC GCTATTTCGG CGACAGCTTC GAGCGCCGCA CCGGCCGGCC GCTATCGGCG GTGGCGGGCG ATCCAGAGCT TGCCGGGCTG ATCGCGATGG GCGCGTCGCG GCCGCATCTG TTTCTCGACG CGACGCCGTC GCGCACGCCC TGGGTGACGC CGGCGACGTT CAACGAGCGC GGCGGCGTGG TGGTGTGGCG CGCCGCCGAT ACCGCGGGCA GGCCGCCGCC GGAACTCGCC ATGCGGTTTC CCGATATCGT GCCCGAGCTT CCGCGCGCGT TCGAGCGGAT GATCGCCGGG CGTCAGCCCT TGCTGCGGAT CGGTTGGGCG ATCGTGCGGC CGAAAGCGGC GCCTTAA
|
Protein sequence | MSAAKLVACG GGGLYHGPTE LRWPLMRFTS LIIELIRARP RLMFWLVVSA QALLWVLVPL LVYSSPPEGV ATVLAYGREY QVGSDLGPPL AFWLADIAFR AAGGHMVGVY LLAQVCFIIT FYGLFQLARS MVGPQHAVIA VLLTSTVTAF AEQGAEFGPL VLARPLWALV LWHSWEIIGR GRRSAWFALS IEVGLLLLTT VAAPALLLLP IGFALSTARS RRALMSLDPM FSLLVVAVLV LPYGIWLLRA DIFALPSLPA LGDLGDRALL GVELFGGLVV AIGGMALLVL LNTSRFDPRP DDAPVVYRAP VDPLARQFVY FFALAPALLG AIVAGLFGLK HVIGGAGIAL LMVGLAVVIA TGDLIHLRRQ RLLRAAWAAL VAAPALVVIV ASVVQPWVSQ TELATSLPAK DIARYFGDSF ERRTGRPLSA VAGDPELAGL IAMGASRPHL FLDATPSRTP WVTPATFNER GGVVVWRAAD TAGRPPPELA MRFPDIVPEL PRAFERMIAG RQPLLRIGWA IVRPKAAP
|
| |