Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_5006 |
Symbol | |
ID | 5705461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5674650 |
End bp | 5676212 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641274399 |
Product | undecaprenyl-phosphate galactose phosphotransferase |
Protein accession | YP_001539740 |
Protein GI | 159040487 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0572033 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000714928 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGATGAGG TGACGACAAG CCTCCAGCGC CCGGTAACCA ACGGAGGCCG GAGCAGAACC GTGCGGCACG TCGACAGCTT CGAGATCCAG CCGCCGGCAC CGCCGTCTCA CAACGGCGTA CCCCGGTCGG CCTGGGCCCG CAGCCGACGC CGGGTCTCCC GTTGGCACCG CCCCTACACA GCGATCTTGG TTCTACTGGA CTTCGCCGCG GCCGCCTTCG CGAGCTTCAC CGCGATCCGC GCCTTCGACC AGGCGCGGGC CGGGTTCTAC AACGACCCGA CCTGGTTCTA CACGGTGGCG TGTGTGCTGC TGCCGCTCGG GTGGGTGGCG ATCCTCTGGT TCAACGGCTC CTACGACCGA CGGTACCTGG GCCTCGGCCC GGACGAGTTC AAGCGGGTCC TCCGCGCCGG AGTGGCGGTC TGTGCGACGG TCTCCTTCCT CGCGTTCGCC ACCAAGACGG ACCTCTCCCG GTACACCGTC GGCACCGCCC TACTCCTCGC CCTGCTGCTG ATCCTGCTGG TCCGGATCCT GGCCCGGTTC GTGCTGCACG TGATCCGGCG CAACGTCGGC CGGGCCGGGC ACCGCATGGT CCTGGTGGGC ACACTGCCCG AATGTCTGGA GGTCTACCGG CAGGTCACCC GCAGCCCGGC CGCCGGCCTG GTGCCGGTCG CGATCCACAT CACCGACGGC TACGCGGCTG CCCGAGGCAT GGAGACACCG GTCCCGGTCT ACACCGGGCG CGACATCCTC GCCCTGGTCC GTGAGGTCGG CGGCGACACC ATCGCGGTCT GCGGGTCGGC CAGCGCCGAG CCCGGCGAGC TGCGCCGCAT GGCCTGGCAG CTGGAAGGTT CCGGGGTCGA CCTGGCGGTG GCCCCACAGC TGACCGACAT CGCCGGTCCA CGGGTGCACA TCCGGCCGAT CGAGGGTCTG CCGCTGCTGC ACGTGGAGGA GCCGACCCTC TCCGGTCCGG CGCTGCTGGC AAAGAACCTA CTCGACCGGA TGGCCGCCGG CCTCGGTCTG CTGATGCTGT TGCCGATGTT CCTCGCGATC GCGGTCGCGA TCCGAATCTC CGACCCCGGT CCGGTCTTCT TCCGGCAGCC CCGGGTGGGC CACGAGGGCC GGACGTTCCG GGTCTGGAAG TTCCGCACCA TGTACGTCGA CGCCGAGGAG CGGCTGGCCA GCCTGGTTGA CCAGAACGAG ACCGACGGCA TGCTGTTCAA AATGAAGCAG GACCCTCGGG TCTTCCCGGT GGGCCGCTTC CTGCGGGCCT CGTCGCTGGA CGAGCTACCC CAGCTGATCA ACGTGCTGAA GGGTGAGATG TCGCTGGTCG GGCCCCGTCC GCTGCCCGCC GACGACGGTG ACTTCCTCGG CGACGTACGT CGACGACTGC TGGTCAAACC CGGCATTACC GGCCTCTGGC AGGTCTCCGG CCGCTCCGAC CTGTCCTGGG ACGAGGCGGT CCGCCTCGAC CTCTACTACG TCGACAACTG GTCGCTGGCG TACGACCTGA GCATCCTGTG GCGCACAGTC GGGGTGGTAC TGGCCCGCAA GGGGGCGTAC TAA
|
Protein sequence | MDEVTTSLQR PVTNGGRSRT VRHVDSFEIQ PPAPPSHNGV PRSAWARSRR RVSRWHRPYT AILVLLDFAA AAFASFTAIR AFDQARAGFY NDPTWFYTVA CVLLPLGWVA ILWFNGSYDR RYLGLGPDEF KRVLRAGVAV CATVSFLAFA TKTDLSRYTV GTALLLALLL ILLVRILARF VLHVIRRNVG RAGHRMVLVG TLPECLEVYR QVTRSPAAGL VPVAIHITDG YAAARGMETP VPVYTGRDIL ALVREVGGDT IAVCGSASAE PGELRRMAWQ LEGSGVDLAV APQLTDIAGP RVHIRPIEGL PLLHVEEPTL SGPALLAKNL LDRMAAGLGL LMLLPMFLAI AVAIRISDPG PVFFRQPRVG HEGRTFRVWK FRTMYVDAEE RLASLVDQNE TDGMLFKMKQ DPRVFPVGRF LRASSLDELP QLINVLKGEM SLVGPRPLPA DDGDFLGDVR RRLLVKPGIT GLWQVSGRSD LSWDEAVRLD LYYVDNWSLA YDLSILWRTV GVVLARKGAY
|
| |