Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_1948 |
Symbol | |
ID | 5199062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 2181496 |
End bp | 2183166 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640581493 |
Product | general substrate transporter |
Protein accession | YP_001262446 |
Protein GI | 148554864 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.83327 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGACG GGTCGGGTAC GGAACTGCCG AAGCACCACA GGGCGACGCA GAACGAGAAG CTGGTGATCG CCGCCTCGTC GCTGGGCACG GTGTTCGAAT GGTATGATTT CTACCTCTAC GGCCTGCTCG CCACCTTCAT CAGCAGCCAG TTCTTCTCGG GCGTCAACGA GACGACCGGC TTCATCCTGG CGCTGGCCGC CTTCGCGGCG GGCTTCGCGG TGCGGCCGTT CGGCGCGCTC GTCTTCGGGC GGATCGGCGA CCTGGTCGGC CGCAAGAACA CCTTCCTGGT GACGATGGCG ATCATGGGCC TGTCGACCTT CGCGGTCGGC CTGCTGCCCT CCTATGCCCA GATCGGCGTC GCGGCGCCCG TCATCCTGGT CGGGCTGCGG CTGCTCCAGG GGCTGGCGCT CGGCGGCGAA TATGGCGGGG CGGCGACCTA TGTCGCCGAA CATGCGCCCA ACAACCGGCG CGGCCTCTAT ACCAGCTGGA TCCAGACCAC CGCGACGCTC GGCCTGTTCG CGGCGCTGCT GGTGGTGATC GGGGTGCGCC GGCTGATGGG TGAGGACGGC TTCGCCGACT GGGGCTGGCG GGTGCCCTTC CTCGTCTCGA TGATCCTGCT GCTGGTGTCG ATGTGGATTC GCCTGCAGCT GGCCGAAAGC CCCGTCTTCC AGAAGATGAA GGACGAGGGC AAGACCTCCA AGGCCCCGCT GACCGAGGCG TTCGGCCGCT GGGGCAACCT GCGCTGGGTG CTGGTCGCGC TGTTCGGCGC GGTCGCGGGC CAGGCGGTGG TCTGGTACAC GGGCCAGTTC TACGCGCTGT TCTTCCTGGA GAAGACACTC AAGGTCGACG GCGCCACCAC CAACGTCCTG ACCGCGATCG CACTCGCCAT CGCGACGCCC GCCTTTGTCT TCTTCGGCTG GCTGTCGGAC CGGATCGGGC GCAAGCCGAT CATCCTGACG GGCTGCGCGC TCGCCGCGAT CGGCTATTTC CCGCTGTTCA GCGCGCTGAC CGTCGCCGCC AACCCCGCCC TGGCCCATGC CCAGGCGATC GCGCCGGTCG CGGTGGTCGC CCATGGCGAC GACTGCTCGG TCCAGTTCGA TCCGATCGGC AAGAACCGGT TCGACAGCCG GAGCTGCGAC ATCGTCAAGG CGTTCCTCGC CAAGGGCGGG GTCAGCTACG CCAATGTCGA GGCACCGGCG GGCACGATCG CGCAGGTGCG GATCGGCGAG CGGACGATCG CCGCGCCCGA CCCCGCCACC GTCTTCGGCG CGGCGCGCAA GGAGGCGATC GCGGCGTTCC AGGAAGAGAC CGGCGCCGCG CTGAAGGCCG CCGGCTACCC CGCTTCGGCC GATCCGGCGG CGATCGACAC AGTGATGGTG GTGGCGATCC TGACCGTGCT CGTGCTGCTG GTGACGATGG TCTACGGCCC GATCGCGGCG CTGCTGGTCG AGCTGTTCCC CAGCCGCATC CGCTATACGT CGATGTCGCT GCCCTACCAT ATCGGCAATG GCTGGTTCGG CGGCTTCCTG CCGACGATCG TCTTCGCGAT GGTCGCGGCG ACCGGGGATA TCTATTACGG GCTCTGGTAT CCGATCATCG TCGCGGCGGC GACAGTCGTG ATCGGGCTGG TCGCGCTGCC GGAAACCGCG CGGCGCGACA TCGACCAGTA A
|
Protein sequence | MTDGSGTELP KHHRATQNEK LVIAASSLGT VFEWYDFYLY GLLATFISSQ FFSGVNETTG FILALAAFAA GFAVRPFGAL VFGRIGDLVG RKNTFLVTMA IMGLSTFAVG LLPSYAQIGV AAPVILVGLR LLQGLALGGE YGGAATYVAE HAPNNRRGLY TSWIQTTATL GLFAALLVVI GVRRLMGEDG FADWGWRVPF LVSMILLLVS MWIRLQLAES PVFQKMKDEG KTSKAPLTEA FGRWGNLRWV LVALFGAVAG QAVVWYTGQF YALFFLEKTL KVDGATTNVL TAIALAIATP AFVFFGWLSD RIGRKPIILT GCALAAIGYF PLFSALTVAA NPALAHAQAI APVAVVAHGD DCSVQFDPIG KNRFDSRSCD IVKAFLAKGG VSYANVEAPA GTIAQVRIGE RTIAAPDPAT VFGAARKEAI AAFQEETGAA LKAAGYPASA DPAAIDTVMV VAILTVLVLL VTMVYGPIAA LLVELFPSRI RYTSMSLPYH IGNGWFGGFL PTIVFAMVAA TGDIYYGLWY PIIVAAATVV IGLVALPETA RRDIDQ
|
| |