Gene Dshi_3337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3337 
Symbol 
ID5712395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3503472 
End bp3504803 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content65% 
IMG OID641269266 
Productmajor facilitator superfamily (MFS) transporter 
Protein accessionYP_001534671 
Protein GI159045877 
COG category[R] General function prediction only 
COG ID[COG2270] Permeases of the major facilitator superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCGG CGAGAAAACG CATAGCGGGC TGGATGATGT TCGATTGGGC CAGCCAGCCC 
TACAACACGC TGCTTCTGAC CTTCATCTTC AGTCCCTATT TCGCGACCGT GGTCGGCGAC
CCGGTCGCGG CGCAGGCGAT GTGGGGCTAC ATGCTGACGG CGACGGGGTT GACCATCGCG
GTGCTGGCCC CGGTCCTGGG CGCGCTGGCG GACCAGGCCG GGCGGCGGAT GCCGTGGATC
CTCGCGTTCT CGGTGCTTTA CCTCGTGGGT GCGAGCATGT TGTGGATCGC GGTACCGGGG
GCGGAGGCGG TGGTACTGAT CCTGTTCTGT TTCGGGCTTG GCCTGATCGG GATGGAGTTC
GCCACGATTT TCACCAATGC GATGTTGCCG GATCTGGGAC CGAAGGCGGA GTTGGGACGC
ATTTCGGGCA CCGGCTGGGC CGTGGGCTAT GCCGGTGGAG TCGTCGCGCT GATCCTGATG
CTGCTGTTTT TCGCGGAAAA CGAGGCGGGG GTGACCTTGC TGGGTATCGC GCCGGTCTTC
GGGCTGGACC CCGAGATGCG AGAGGGGACG CGCAGTGTCG GGCCCTTCGT GGCGCTGTGG
TTCGTGGTCT TCATGATCCC GTTCTTCCTG TGGGTGCGGG AAACACCGCC CGTGCCGCCG
CGCCGGACGG ACCTGCGTGC CGGGCTGAAG GGGTTGGCGG ACACCTTACG GCGGCTGCCG
GGGCAGCGGA GCCTCGCGGC CTATCTCGCG TCGTCGATGT TCTACCGCGA TGCGTTGAAC
GGAATGTACA CCTTCGGCGG GATCTATGCG CTGGGCGTTC TGGAATGGAG CGTGATCGAC
ATCGGGATCT TCGGGATCAT GGCGGCGATC ACGGGCGCGG TTTTTGCCTA TATCGGCGGG
TTCGCGGACC GCGCCTTTGG ACCCAAGCCG GTGATCGCGG TCTGCATCGT CATCCTGACG
GGCGTCGGGA TCACTATCGT GTCAGTGTCG CGCGAGGCGG TGTTCGGGAT GCCCGTGGCG
CCGGACAGCA CCTTGCCGGA CACGATTTTC TACATCTGCG GGGCGTTGAT CGGCGCGGCG
GGCGGGGTGT TGCAGGCCGC AAGCCGGACC ATGATGGTGC GCCAGGCCAG CCGGGGGCGG
ATGACGGAGG CTTTCGGGCT TTACGCCCTT GCAGGCAAGG CGACCTCGTT CCTGGCGCCG
CTGACCATCG CGATTGCCAC CGATCTCAGC GGGACGCAAA GCGCGGGGCT CATTCCGCTG
ATTGCCCTCT TCCTCTGTGG TTTGGGTCTG CTAAGGTTCG TGCATCCGGA CCCTGAGACG
AGCAGCCCAT GA
 
Protein sequence
MEAARKRIAG WMMFDWASQP YNTLLLTFIF SPYFATVVGD PVAAQAMWGY MLTATGLTIA 
VLAPVLGALA DQAGRRMPWI LAFSVLYLVG ASMLWIAVPG AEAVVLILFC FGLGLIGMEF
ATIFTNAMLP DLGPKAELGR ISGTGWAVGY AGGVVALILM LLFFAENEAG VTLLGIAPVF
GLDPEMREGT RSVGPFVALW FVVFMIPFFL WVRETPPVPP RRTDLRAGLK GLADTLRRLP
GQRSLAAYLA SSMFYRDALN GMYTFGGIYA LGVLEWSVID IGIFGIMAAI TGAVFAYIGG
FADRAFGPKP VIAVCIVILT GVGITIVSVS REAVFGMPVA PDSTLPDTIF YICGALIGAA
GGVLQAASRT MMVRQASRGR MTEAFGLYAL AGKATSFLAP LTIAIATDLS GTQSAGLIPL
IALFLCGLGL LRFVHPDPET SSP