Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_4414 |
Symbol | |
ID | 5196803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 4863512 |
End bp | 4865185 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640583966 |
Product | protein of unknown function DUF894, DitE |
Protein accession | YP_001264890 |
Protein GI | 148557308 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0612118 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.91404 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCCG ACCTGCAATC GGCCCCGCGA TCGCGGCCGG CGTCACCGTT CGGCGTTCCC GTTTTCCGGT CGATCTGGAT CGCGTCGCTG GTGTCGAATT TCGGGGCGAT GATCCAGTCG GTCGGCGCCG CCTGGATGAT GACGTCGCTG ACCTCGTCCC CGAAGATGGT CGCGCTGGTC CAGGCCTCGA CGGTGCTGCC GTTCATGCTG CTGGCGCTGT GGGCCGGGGC GGTCGCCGAC AATCTCGACC GGCGCAAGGT GATGCTGGCG GCGCAGAGCT TCATGCTGTG CGTCTCGGCC GTGTTGGCGC TGTTCGCCTG GCAGGGCTGG CTGACGCCCT GGCTGCTGCT GAGCTTCACC TTCCTGATCG GTTGCGGGAC GACGATAGGC GGACCGGCCT GGCAGGCCTC GGTCGGCGAC ATCGTATCGC GCGAGCAACT GCCGTCCGCA GTTGCGCTCA ACTCGATGGG GTTCAACACC GCGCGGACCG CCGGACCGGC GGTCGGCGGC GCGGTCGTGG CGGCGGCGGG GGCTGCGGCG GCGTTCCTGG CCAATACCCT GTCCTATATC GGGCTGATCG TGGTGCTGCT GCGCTGGCGG CGGCCGCAGG CGCCCCGGCT GTTGCCGCGC GAAGGCCTGT TCATGGCGAT GGGGGCGGGC CTGCGTTATG TATCGATGTC GCCCAACCTG CGGATGGCCG TCAGCCGCGC GATGGCGTTC GGCCTTGCGG CCAATGCGGT TTCGGCGCTG ATGCCGCTGG TCGCCCGCGA CCTGGTGAAG GGCGGGGCGC TGACCTACGG CCTGCTGCTG GGCGCGTTCG GCGTCGGCGC GGTGCTCGGC GGCCTCTCGT CGGGGCCGGC GCGGGACCGG CTGTCGACCG AGCAGATCGT CCGGATGGCC ACGCTCATGC TGGCGGTCGG CACCGCGATC ACCGCGATCA GCCCCTTCTT CCTGCTGACG ATCGCCGCGC TGATGCTGGC GGGCTTCAGC TGGGTGCTGG CCCTGTCGAC CTTCAACATC AGCGTCCAGC TCGCTTCGCC GCGCTGGGTC GTCGCGCGCG CCCTGTCGGT CTACCAGATG GCGGCGTTCG GCGGCATGGC GATCGGCGCC TGGGTGCTGG GCATGATCGC CGACAGCCAT GGCGTCGCCG CCGGCCTGCT GGTCAGCGCG GCCTTCCTGG CGAGCACCGT CCTGATCGGC CTGGTCATGC CGCTGCCGCA GGTCGACGAC CTCAATCTGA CGCCGCTCAA GCAATGGCAG GAGCCGGAGG TGGCGGTCCC GCTCGAACCG CGCAGCGGCC CGGTGGTGGT GACGATCGAA TATCGGATCG AGCCGCACAA CATCGTCGCC TTCCTCACCG CGATGACCGA GCGGCGCCGG ATCCGGCGGC GGGACGGCGC CCATGGCTGG ACCCTGCTGC GCGACCTCAA CGAGCCCGAG CTGTGGATCG AGCGCTACCA CGTCGCGACC TGGCACGACT ATATCCGGCA TAATCAGCGT CGCACCCATG CCGATGCCGA GAACAGCGCC GAGGTGCACC AGCTCCAGAA GGAAGGCGTG CCGCTGCGCG TCCACCGGAT GATCGAGCGG CAGACCGGCT CGCTGCCGAG CGCCCGCCGC CATGAACCGG TGACCGTCGA CGCACAGATG AACGATCCGA CACGCTCGGC GTAG
|
Protein sequence | MASDLQSAPR SRPASPFGVP VFRSIWIASL VSNFGAMIQS VGAAWMMTSL TSSPKMVALV QASTVLPFML LALWAGAVAD NLDRRKVMLA AQSFMLCVSA VLALFAWQGW LTPWLLLSFT FLIGCGTTIG GPAWQASVGD IVSREQLPSA VALNSMGFNT ARTAGPAVGG AVVAAAGAAA AFLANTLSYI GLIVVLLRWR RPQAPRLLPR EGLFMAMGAG LRYVSMSPNL RMAVSRAMAF GLAANAVSAL MPLVARDLVK GGALTYGLLL GAFGVGAVLG GLSSGPARDR LSTEQIVRMA TLMLAVGTAI TAISPFFLLT IAALMLAGFS WVLALSTFNI SVQLASPRWV VARALSVYQM AAFGGMAIGA WVLGMIADSH GVAAGLLVSA AFLASTVLIG LVMPLPQVDD LNLTPLKQWQ EPEVAVPLEP RSGPVVVTIE YRIEPHNIVA FLTAMTERRR IRRRDGAHGW TLLRDLNEPE LWIERYHVAT WHDYIRHNQR RTHADAENSA EVHQLQKEGV PLRVHRMIER QTGSLPSARR HEPVTVDAQM NDPTRSA
|
| |