Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3337 |
Symbol | |
ID | 5712395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 3503472 |
End bp | 3504803 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641269266 |
Product | major facilitator superfamily (MFS) transporter |
Protein accession | YP_001534671 |
Protein GI | 159045877 |
COG category | [R] General function prediction only |
COG ID | [COG2270] Permeases of the major facilitator superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGCGG CGAGAAAACG CATAGCGGGC TGGATGATGT TCGATTGGGC CAGCCAGCCC TACAACACGC TGCTTCTGAC CTTCATCTTC AGTCCCTATT TCGCGACCGT GGTCGGCGAC CCGGTCGCGG CGCAGGCGAT GTGGGGCTAC ATGCTGACGG CGACGGGGTT GACCATCGCG GTGCTGGCCC CGGTCCTGGG CGCGCTGGCG GACCAGGCCG GGCGGCGGAT GCCGTGGATC CTCGCGTTCT CGGTGCTTTA CCTCGTGGGT GCGAGCATGT TGTGGATCGC GGTACCGGGG GCGGAGGCGG TGGTACTGAT CCTGTTCTGT TTCGGGCTTG GCCTGATCGG GATGGAGTTC GCCACGATTT TCACCAATGC GATGTTGCCG GATCTGGGAC CGAAGGCGGA GTTGGGACGC ATTTCGGGCA CCGGCTGGGC CGTGGGCTAT GCCGGTGGAG TCGTCGCGCT GATCCTGATG CTGCTGTTTT TCGCGGAAAA CGAGGCGGGG GTGACCTTGC TGGGTATCGC GCCGGTCTTC GGGCTGGACC CCGAGATGCG AGAGGGGACG CGCAGTGTCG GGCCCTTCGT GGCGCTGTGG TTCGTGGTCT TCATGATCCC GTTCTTCCTG TGGGTGCGGG AAACACCGCC CGTGCCGCCG CGCCGGACGG ACCTGCGTGC CGGGCTGAAG GGGTTGGCGG ACACCTTACG GCGGCTGCCG GGGCAGCGGA GCCTCGCGGC CTATCTCGCG TCGTCGATGT TCTACCGCGA TGCGTTGAAC GGAATGTACA CCTTCGGCGG GATCTATGCG CTGGGCGTTC TGGAATGGAG CGTGATCGAC ATCGGGATCT TCGGGATCAT GGCGGCGATC ACGGGCGCGG TTTTTGCCTA TATCGGCGGG TTCGCGGACC GCGCCTTTGG ACCCAAGCCG GTGATCGCGG TCTGCATCGT CATCCTGACG GGCGTCGGGA TCACTATCGT GTCAGTGTCG CGCGAGGCGG TGTTCGGGAT GCCCGTGGCG CCGGACAGCA CCTTGCCGGA CACGATTTTC TACATCTGCG GGGCGTTGAT CGGCGCGGCG GGCGGGGTGT TGCAGGCCGC AAGCCGGACC ATGATGGTGC GCCAGGCCAG CCGGGGGCGG ATGACGGAGG CTTTCGGGCT TTACGCCCTT GCAGGCAAGG CGACCTCGTT CCTGGCGCCG CTGACCATCG CGATTGCCAC CGATCTCAGC GGGACGCAAA GCGCGGGGCT CATTCCGCTG ATTGCCCTCT TCCTCTGTGG TTTGGGTCTG CTAAGGTTCG TGCATCCGGA CCCTGAGACG AGCAGCCCAT GA
|
Protein sequence | MEAARKRIAG WMMFDWASQP YNTLLLTFIF SPYFATVVGD PVAAQAMWGY MLTATGLTIA VLAPVLGALA DQAGRRMPWI LAFSVLYLVG ASMLWIAVPG AEAVVLILFC FGLGLIGMEF ATIFTNAMLP DLGPKAELGR ISGTGWAVGY AGGVVALILM LLFFAENEAG VTLLGIAPVF GLDPEMREGT RSVGPFVALW FVVFMIPFFL WVRETPPVPP RRTDLRAGLK GLADTLRRLP GQRSLAAYLA SSMFYRDALN GMYTFGGIYA LGVLEWSVID IGIFGIMAAI TGAVFAYIGG FADRAFGPKP VIAVCIVILT GVGITIVSVS REAVFGMPVA PDSTLPDTIF YICGALIGAA GGVLQAASRT MMVRQASRGR MTEAFGLYAL AGKATSFLAP LTIAIATDLS GTQSAGLIPL IALFLCGLGL LRFVHPDPET SSP
|
| |