Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_4675 |
Symbol | |
ID | 5200890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 5142197 |
End bp | 5143495 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640584229 |
Product | general substrate transporter |
Protein accession | YP_001265150 |
Protein GI | 148557568 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0145765 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.795521 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCGCCG CCGCCCGCCC CCGCAACCCG ATCCGCTCGG TAATCGGCGG GTCGCTCGGC AATCTCGTCG AGTGGTACGA CTGGTACATC TATTCGGCCT TCTCGCTCTA TTTCGCGAAG AGCTTCTTCC CGGAGGCCGA TCCCACCGCG CAGCTGCTGT CGACCAGCAT CGTCTTCGCG ATCGGCTTCC TGATGCGCCC GCTCGGCGGC TGGCTGCTCG GTATCCTGGC CGACCGCTAC GGGCGCAAGA CCGCGCTGAC CTGGTCGATC ACGCTGATGT GCGCGGGGTC GCTGATCATC ACCTGCGCGC CGACCTATGC CTCGATCGGC ATCTGGGCGC CGCTGCTGCT CACCTTCGCG CGCATGGCGC AGGGGCTGAG CCTGGGCGGC GAGTTCGGCA CCGCGGCGAC CTACCTGACC GAGATCGCCC CGCCCGACCG GCGCGGCTTC TGGTCGAGCT TCCAATATGT CACGCTGATC GCGGGCCAGC TCCTCGCGCT CGGCCTGCTG GTCGTGCTGC AATTCCTGTT CCTCGACGAG GCGCAGCTCG AAGCCTGGGG CTGGCGGCTC GCCTTCGCGA CCGGCGCGGT GCTGGCGATC AGCGTCTTCT GGCTGCGGCG CGGGATCGAC GAGACCCCCG ACTTCATCGA GGAGACGGCG GGCGAGCGGC GCAAGGGCGG CCTCGTCGCC CTGCTGCGCG AGCGGCCCAA GCAGGTGGCG CTGGTGTTCG GCCTGTCGAT CGGCAGCAAT GTCAGCTTCT ACGCCTTCAC CACCTATATG CAGAAATATC TGGTCGCGAG CGCGGGCTTC GCCAAGGACC TGACCTCGCT GATCTGCTCG GGCGCGCTGA TCTTCTACAT CGTCATCCAG CCGATGCTCG GCGCGCTGTC CGACCGGATC GGGCGCAAGC CGCTGCTCTA CTGGTGCTGG ATCGGCGGCA TCGTCGCGAC CGTGCCGCTG TTCACCGCGC TGGGCGCCGC CACCAGCGGG CTGGAGGCGT TCCTGCTGCT CTGCGTCGCC TATCTGATCA TCTCCGGGTC GAGCGCGACC AGCGCGGTGG TGAAGGCCGA GCTGTTCCCG CCGCACGTCC GCGCGCTCGG CGTCGGCCTG CCCTATGCGG TCAGCCAGGC GATCTTCGGC GGCACCGCGG AGTCGGTCGC GCTCGGCTTC AAGGCGGCGG GCGTCGAATC CGCCTTCTTC TGGTACGTCA CCGGCTGCAT GGCGGTGGCG CTGGTGACGA CCTTCTTCGT GCCCGAGACG CGCTGGCCCG GCGGCCGGCG GCCGACGCGC GGCGCCTGA
|
Protein sequence | MSAAARPRNP IRSVIGGSLG NLVEWYDWYI YSAFSLYFAK SFFPEADPTA QLLSTSIVFA IGFLMRPLGG WLLGILADRY GRKTALTWSI TLMCAGSLII TCAPTYASIG IWAPLLLTFA RMAQGLSLGG EFGTAATYLT EIAPPDRRGF WSSFQYVTLI AGQLLALGLL VVLQFLFLDE AQLEAWGWRL AFATGAVLAI SVFWLRRGID ETPDFIEETA GERRKGGLVA LLRERPKQVA LVFGLSIGSN VSFYAFTTYM QKYLVASAGF AKDLTSLICS GALIFYIVIQ PMLGALSDRI GRKPLLYWCW IGGIVATVPL FTALGAATSG LEAFLLLCVA YLIISGSSAT SAVVKAELFP PHVRALGVGL PYAVSQAIFG GTAESVALGF KAAGVESAFF WYVTGCMAVA LVTTFFVPET RWPGGRRPTR GA
|
| |