Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_4032 |
Symbol | |
ID | 5198829 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 4434835 |
End bp | 4436358 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640583589 |
Product | PEP-CTERM locus polysaccharide chain length determinant |
Protein accession | YP_001264514 |
Protein GI | 148556932 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.077437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.946432 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGCCA TCTACACCGA GATCAGGATT GCCCTGTACG CGATCTGGCG GATGCGCTGG CTGGCGCTCG CCGTCGCCTG GGCCTTCTGC CTGGTCGGCT GGGTGATGGT GATGCGGGTC CCCGCCGCCT ATGAGAGCAG CGCGCGCATC CAGGTCCAGG TCAAGTCGCT GTTCGCCGAC GGGGCCGAGA ACGACATGCA GCGCAATGTC GATCGCGTCC GCCAGTCGCT GACCTCGACC GAGATATTGA AGCGGGTCGT CCGCACCGTC TCCAACGGCG AGCGGCCGCT GACCTCCTAC GAGTCCCTGT CGCTGATCGG CGCGCTGCGC TCGGGCATCT CCATCACCGC GCAGGGCGAC GACCTGCTCG AGATCAAGAC GCGGATCAGC CTGAAGGGCA TTTCCGAGCA GCAGACCGCC GCGATCGCCC GCAACGTCAC CCAGAAGCTG ATCGAGATAT TCATCCAGGA GAATGTGATC GGCAGCAAGG CGAACAACGC CGAGACGCTG AGCTTCCTCG ACCAGGAACT GGCCCGCCGC GCCAAGGAGC TGGCCGAGGT CGACCGCCAG CGCGCGCAGA TCACCCAGAA CACGCTCGGC TCGCTGCCGG GCACGGGCTC GCTCGACCAG CGGATGGACG CCGCGCGCAA CGAGATGGTC AATCTCGATT CGAACCTGAT GCAGGCGCGC AGCGCGCTGG CGGCGATGAA CGGCCAGCTC GCCGCCACCC CGGCGCAGAT ACCGGCGGGC AGCGTCAACG GCATCGACAC GCTCGACCAG CGCATCGGGA CGCTCGAAGG GCAGCTTTCG GAAGCGATCT CGCGCGGCTG GACCGAGAAG CATCCCGACG TCATCGCGAT CCGTTCCCAG CTCAAGCAGC TCCGCGCCGA AAAGGCGCGC GGCGGCAGCC GGTCGGTGCC GATGGCGCCC AACCCCGTCT ATGTTTCGCT CAAGTCGATG CAGGCCGAGC GCGAGGGCAA TGTCGCGGCG CTCCAGGCGC GCAAGTCGCA GCTCGAACAG GCGGTGACGA CGATCGCGCA GCGCCAGCTC ACCGCGCCCG GTGCCCAGAT CGACCAGGCC CGGCTCGACC GCGACTATGA CGTGCTCAAG GACCAGTACA ACAAGCTGCT CGCCCAGCGC GAGTCCGCGA AGCTGCGGTC CGACGCGACC GGCAAGACCG ACGGCGTCCA GTTCCGGGTG ATCGACCAGC CGTCGCTGCC GCGCCAGCCG GCGGCGCCCA ACCGGGCGAT GCTGCTCACC GGCGTGCTGG TCGCCGGCAT CGTGATCGGC GCGGGCGTGG CCTTCGCCAA GAGCCAGCTC GCCAATGTCT ACACGACCAC CCAGCAGCTC GCGAAGGCCA GCGGCCTGCC GGTGCTCGGG TCGATCTCGG AGGTGATCAC CGCCGACACG CGGGTGATCC GGCGCAAGCA GATGATGTGG TTCTCCTCGG CCGCCGCGGG GCTGCCGGGG ATGTTCATGC TGCTGATGCT GATCGAGTTC GTCAAACGCA TCATGGTGTC CTGA
|
Protein sequence | MEAIYTEIRI ALYAIWRMRW LALAVAWAFC LVGWVMVMRV PAAYESSARI QVQVKSLFAD GAENDMQRNV DRVRQSLTST EILKRVVRTV SNGERPLTSY ESLSLIGALR SGISITAQGD DLLEIKTRIS LKGISEQQTA AIARNVTQKL IEIFIQENVI GSKANNAETL SFLDQELARR AKELAEVDRQ RAQITQNTLG SLPGTGSLDQ RMDAARNEMV NLDSNLMQAR SALAAMNGQL AATPAQIPAG SVNGIDTLDQ RIGTLEGQLS EAISRGWTEK HPDVIAIRSQ LKQLRAEKAR GGSRSVPMAP NPVYVSLKSM QAEREGNVAA LQARKSQLEQ AVTTIAQRQL TAPGAQIDQA RLDRDYDVLK DQYNKLLAQR ESAKLRSDAT GKTDGVQFRV IDQPSLPRQP AAPNRAMLLT GVLVAGIVIG AGVAFAKSQL ANVYTTTQQL AKASGLPVLG SISEVITADT RVIRRKQMMW FSSAAAGLPG MFMLLMLIEF VKRIMVS
|
| |