Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_4009 |
Symbol | |
ID | 5199715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 4404775 |
End bp | 4408935 |
Gene Length | 4161 bp |
Protein Length | 1386 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640583567 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_001264492 |
Protein GI | 148556910 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG0438] Glycosyltransferase [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGGG GATTTCTACG GAGGCGCGCG GCCCGGCGCG CATTTGTGCA AGCTGGCGTC GAGCGTGACG GCGGGCGATG GGCGGGTGCC GCCGTCCATT ATCGCCGCTA TCTCCGCTAC CGTCCCGACG ATTTCGCGAT CTGGGTGCAG CTTGGCCACA TGTTGGGCGA GAGCGGCGAT CCGGCGGGGG CCGACCAGGC CTATCGAACC GCGCATGCGC TGAACCCGGA CGATGCCGAC CTCGCCCTGT GCCGCGGGCA CCTCGCCCGG CGCGGCGGCG ATGTCGAGGC GGCGATCGTC TTCTACCGCC GCAGCTTCGA GATCGACGGC AATGAGCGCG CGGTGAACAT GCTGGCGGAG CTCGATGCCG AGACCGGCGG CGAGTCGGAG ACCGAACAGG CGCCGGTCCA TGGCGGGCGG CTCGACGGCT GGTTCGAGCG ATCGGTCTCC GGCGTCCTGT CGGAGGACGG CGTCGACGAT CCCTGGGTCG TGTTCCGCGC CGGCGACCGG CTGGTGGGAC AGGCGCGGCC GGGCCGCAAC GACGACGGCG CGCTCGAATT CCGCGCCACG CTCGATGTCG AGCTGGGCGA GAATGGCGAG GGGGTCGAGG TGATCGCCCG GCGCATGCCC GACGGCGCCA TGCTCGAACC CGGCCCGTTC TTCCTGTTCC CGCCGTCGCG CCACCCGGCC GACCATGCGG CCGCTTGGTC GGTCCCGGCC GAGATCGTCA AGCCCTACGA CCTGCCGGCT GGGCGCGAGG TCGCGCTGTT CGTGACCCAT TCGCGTTCGG GCAAGATCAA GCCCAACGTC CTGCCCTATG TTCGCGCGCT CAGGCAGGCG GGGCTCGCCG TCTTCCTGGT CGCGACCGTC GACCGGCCCG TCGACCTGCC GGCCGAGCTG CTCGACAGCG TCGACGCGGC GATGGTCCGC CGCAATGCGG GCTACGACTT CGCCGCCTGG GCGCATGCGC TGAAGCTCCA TCCCCGGCTG TACGGGGCCG CGACCCTCTA TCTCGTCAAC GACAGCGTAG TGCCGGCGGC CGACGACGCG CGGATCGCCG CGATTGTCGA CCGGGTCCGC CGCAGCGGCG CCGACCTGAT CGGGCTGACC GAGAGCCACG AGTGGCGCTG GCACGTCCAG AGCTATTTCC TGGCGATCAA GCCGGGCCTG CTCGCGTCGC GCTCGTTCCA CGGCTTCATG GACGACGTCC GGCTGCTGAC CCGCAAGGAT CACGTCATCC GCGCCTATGA GGTGCGGCTG GCCGAGATCG CCGAGGAGGC CGGGCGCTCG GTCGAGATCC TCTTCCCCAG CGCGACCGCG ATCAACCCGA CGCTGTTCGG ATGGCGCGGG CTGCTCGGGC AGGGCATGCC CTTCGTCAAG GTGCTGCTGC TGCGCGGCAC CTTCGACGCG ATCGACACCG ACGGCTGGCG CGACGTGCTC GCCGGTGCGG GCTTCGACGT CGCGCTCGCC GAGGCGACGG TCGCGGCGGG GGCGGAGGAG GGACCGGTCG ACGATGGCGG CCGGCTGCTG GCCCGTCGCA TGCCGGCCGG GCTGCGCGCC GACCGGCCGC TCAAGGTCGC CTTCTTCGGG CCCTGGAACT ACGACAACGG CCTCGGCCAT GCCAGCCGCG GGATCATCGC GGCGATCCGG CGCACCGGCG TCCTGCTCAA CCTCCATCCG GTCAAGCGGC CCTTCCATAT CCACAAGCCG CTGGTCCCGC CCACCGACAT CCTCGACTTC GAAGGTCCGG CCGACGTCGC GATCGTCCAT CTCAATCCCG ACAGCTGGCA CCTGCTGACC GACGACCAGC GCGAGACGAT CGCCCGGGCG AAGCGTCGCA TCGGCTATTG GGTCTGGGAG ATGGGCCATA TCCCGCCGGC CTGGCGCCGC AACTTCGGCG CGGTCGACCG GATCTGGGCG CCGAGCCGCT ATTGCGCCGA GCTGTTCGCC GCGCAGGGCG GCGTGCCGGT CGACGTGGTC CCGCACGCCG TGCCGGTGGG CGAGCCGGCG ACGGTCGACC GGGCCGGTGC GCTCGCCCGG CTCGGCCTGC CCGCCGATCG CCGGGTCATC CTCTATGTGT TCGACGGGTC GAGCTATCTG GTCCGCAAGA ACCCGGCCGC GCTGGTCCGC GCCTTCTCGG CCTCCGGCCT CGCCGCGCGC GGCTGGTCGC TGCTGCTCAA GACCAAGCAT CTCCAGGACC GGCCCGAGGA AGGCGAGGCG TTCCGCGCCC TCGCCGAGGG GACCGAGGGC GTGGTGCTGG TTGACCGGGC GATGGCGGCG GAGGAACTGG CCGAGCTGAC GGCGCTCGCC GATCTCTACG CCTCGCCGCA TTGCTCCGAG GGGTTCGGCC TGACGATCGC CGAGGCGATG GCGGCGGGCA AGCCGGTGGT CGCGACCGAC TTCGGCGGCA GTCGCGACTT CCTCGACGCA TCGGTCGGCT GGCCGGTCAA GGCGCATCCC TGGCGGCTCG AGCAGGATTT CGGCCATTAT ACCGAAGGCG GCGACTGGGC GCGGATCGAC GAGCCGGCGC TCGCCGCCAC GCTGGCGTGC GCGGCCGACG CGATCGAGGC TGGCGGCGAC GGCAAGGGCA AGGCCGCCCG CGACCGGATC GCCGCGCAAC TCTCCTACGA CGCGGTGGCC GAACGGGTCG CGGCCAGCCT CGCGGCGCTG CGCGAGATGC CGGTGGGCGG CGGTGGCGTC CATGTCGAGC CCAATCTCCT GGCCGGCATG CCGGTCGAGC AGGCGATCCT CGGCCCGGGG CTGAGCCTTG TCGCCCTGGC CCCCGACGGC GCGCTCGATA CGCCGCTGCC CGAGGACCTG CCGACCGACC GCGCCGCGTG GGTCATCCTC GCCCCGCGCG GCGCTATCCT CGCGCCGATG ATCGAGCGCG ACTGGCGCCA GGCGGCCGAG GCCCGTCCCG ATGTCGGCAT CTTCTACGGC GACGATTTCG CGGCAGGGGC CGAGCGCGGG ATCGACCAGC TCCGGCTGAA GCCGGCCTTC GACCTGACCC TGCTCGCCGC GCAGGACTAT ATCGGCGCGC CGCTGATCGT CCGCGCCTCG GTGCTTGCCG ACCTCGGTCT GCGCGCGGGG ATGGGCACGG CGCTGCTCGA CGATCTGCTG CTGCGCGCGT ACCATGCCGG CGTCTCGATC GAGCGGATCG CCAGGGTGCT GCTGCTCCAT CCGGGGCCGC GCCCGCAGGC CGACCCGGTG GTGCGACAGG CGATGCTCGC GGCGCAGCCG CGCTTCGCCC ACCATCGCTT CCGCCCCGGC CGCGCGCCGG GCACGCTGGC GCAGGAACGG CGGTTCGGCC GCGATCATCC GCCGGTGTCG CTGCTCATCC CGACCCGGCG GAGCAAGCTG CCCGGCGGGC GCACCGCCTA TGTCGAGCGG CTGCTCAAGG CGCTCGTCGC CACCGACTGG CCGATGGACC GGCTGACCGT GATCGTCGGC GACGACGTCG CGGGCGAGCC GGCCTGGGCG AAGCGCGCCT GGCCCTTCGC GCTGCGCCGG ATCGAGACGC CGCGCCGCGA CGATGAGCCG TTCAACTATG CCGCGAAGAT GAACCGGCTG TGGCGCGCCG CCGGGAGCGA GCAGATCGTG ATGATGAACG ACGATCTGCT TCCCACCGAA CCCGGCTGGC TGAGGGCGCT GATCGGCTTC ACGATGGACG AGGGCGTCGG CGGCGCCGGC GGGCGCCTGC TCTATGAGGA CGGCCGGCTC CAGCATGCCG GCATTGGCCC GCTGTTCGGC GCGATCGCCC ATGTCTGGGC GGGCCGGGAG ACGGTCGCGG GCAGCTATCA GGACTGGGCG CTGGTCCAGC GCGAATGGTC GGCGGTGACG GGCGCGCTGT TCGCGACGCG GCGCTCGCTG ATGGAGGAGG TCGGCGGCTT CGACGAGCAG TTCACGCTCG AATATAACGA CATCGACCTC TGCCTGCGGC TGCGGGCGCT GGGCTATCGG ATCGTCTGCA CGCCGGAGGC GGAGATGGTC CATGCCGAGC GCGCCTCGCG CGGCGACATG CCCGCGGGCG GGGACCAATA TGCCGCCTTC CACCAACGCT GGAAGCACTG GCTCGACGAC GATCCCTCCT GGCATCCCGG CCTCAGCCGC GACAGTTTCG AGCTGATGCC GATCGACGAT CCCCGCGCCT GGTATCGGTG A
|
Protein sequence | MTGGFLRRRA ARRAFVQAGV ERDGGRWAGA AVHYRRYLRY RPDDFAIWVQ LGHMLGESGD PAGADQAYRT AHALNPDDAD LALCRGHLAR RGGDVEAAIV FYRRSFEIDG NERAVNMLAE LDAETGGESE TEQAPVHGGR LDGWFERSVS GVLSEDGVDD PWVVFRAGDR LVGQARPGRN DDGALEFRAT LDVELGENGE GVEVIARRMP DGAMLEPGPF FLFPPSRHPA DHAAAWSVPA EIVKPYDLPA GREVALFVTH SRSGKIKPNV LPYVRALRQA GLAVFLVATV DRPVDLPAEL LDSVDAAMVR RNAGYDFAAW AHALKLHPRL YGAATLYLVN DSVVPAADDA RIAAIVDRVR RSGADLIGLT ESHEWRWHVQ SYFLAIKPGL LASRSFHGFM DDVRLLTRKD HVIRAYEVRL AEIAEEAGRS VEILFPSATA INPTLFGWRG LLGQGMPFVK VLLLRGTFDA IDTDGWRDVL AGAGFDVALA EATVAAGAEE GPVDDGGRLL ARRMPAGLRA DRPLKVAFFG PWNYDNGLGH ASRGIIAAIR RTGVLLNLHP VKRPFHIHKP LVPPTDILDF EGPADVAIVH LNPDSWHLLT DDQRETIARA KRRIGYWVWE MGHIPPAWRR NFGAVDRIWA PSRYCAELFA AQGGVPVDVV PHAVPVGEPA TVDRAGALAR LGLPADRRVI LYVFDGSSYL VRKNPAALVR AFSASGLAAR GWSLLLKTKH LQDRPEEGEA FRALAEGTEG VVLVDRAMAA EELAELTALA DLYASPHCSE GFGLTIAEAM AAGKPVVATD FGGSRDFLDA SVGWPVKAHP WRLEQDFGHY TEGGDWARID EPALAATLAC AADAIEAGGD GKGKAARDRI AAQLSYDAVA ERVAASLAAL REMPVGGGGV HVEPNLLAGM PVEQAILGPG LSLVALAPDG ALDTPLPEDL PTDRAAWVIL APRGAILAPM IERDWRQAAE ARPDVGIFYG DDFAAGAERG IDQLRLKPAF DLTLLAAQDY IGAPLIVRAS VLADLGLRAG MGTALLDDLL LRAYHAGVSI ERIARVLLLH PGPRPQADPV VRQAMLAAQP RFAHHRFRPG RAPGTLAQER RFGRDHPPVS LLIPTRRSKL PGGRTAYVER LLKALVATDW PMDRLTVIVG DDVAGEPAWA KRAWPFALRR IETPRRDDEP FNYAAKMNRL WRAAGSEQIV MMNDDLLPTE PGWLRALIGF TMDEGVGGAG GRLLYEDGRL QHAGIGPLFG AIAHVWAGRE TVAGSYQDWA LVQREWSAVT GALFATRRSL MEEVGGFDEQ FTLEYNDIDL CLRLRALGYR IVCTPEAEMV HAERASRGDM PAGGDQYAAF HQRWKHWLDD DPSWHPGLSR DSFELMPIDD PRAWYR
|
| |