Gene Dshi_2354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2354 
Symbol 
ID5714009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2487671 
End bp2488822 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content72% 
IMG OID641268278 
ProductMFS transporter 
Protein accessionYP_001533691 
Protein GI159044897 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.632575 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCC GTCTTTATTA CCTGACCCTC GGCAACGTGA TGATCGGCAC CGGCACCATG 
GTGGTGTCGG GCATCCTCGA CCCGATCGCC GCCGATCTCG CGATCAGCGT CTCTGCCGCG
GGCCAGGTCA CGACCGTTTA CGCCCTCGCC TTCGCGCTCG GAGCCCCCGT CGCCGCCGCC
TTTACCGGCC GGTTCGACCG GTCGCGGGTG CTGGCCCTGT CGCTTCTGGT CTTCGCCGCC
GCCAGCCTGC TCAGCGCGGT GGCGCCGAAT TACACGACGT TGCTGATTGC GCGGGGGCTG
GGCGGGCTGG CCGCCGCGAC CTTCTCGCCA ACGGCCGTGG CCGTGGCCGC CTCGCTTGTG
CCCCCGGCGG AACGGGGCCG CGCCATCGCC CTCGTTTTCG GCGGGATGAC CATCGCCAGC
GTGCTGGGCG TGCCCCTCGG CACCTGGATC GGGCTGAATT TCGGCTGGCG CATCCTGCTG
GCCGGGCTCG GGGTGCTCAG CCTCGGCGCG GTGATCGCCC TGTGGCGCGG GGTGCCCGGT
GGGCTCTTCC TGCCCGCGGC CACGCTGGCC CGCTGGGACG AGGCCCTGCG CCTGCCCATC
ATCCGCGCGC TGCTTGGCGT GACGCTGTTC CAGATCGGCG GGAGCTTCGT ACTCTTTGCC
TTCTTCGGCC CCTATCTCGG CACGGTCACG GGGGTCGGCA CGGACGGGAT CGCGAGCTTG
CTGTTTCTCT TCGGGCTCGC CAGCCTGATC GGCAATTTCG CCGCCGGATG GGCCGTGGAC
CGGGTGGGCG CAGGCCCGGT GGCCCATGGC GCGATCGCGT TGACGGCGCT GAGCATGGTC
GCGCTTCTTG GCACCGCGGG GGCGCCGGTG CTGGCGGGTC TGGCGATCAT CCTGTGGGGC
AGTGCCGTGT TCGGCATCAA CACGGCGCAG CAGGCCCGCC TCGTTGACGC CGCGCCGGGG
CTGGCCACGG TCGTGCTGCC CGCCAATTCC TCGATTCTCT TTGCAGGACA GGCGCTGGGG
GCCGCGCTCG GTGGCGCGGT GCTGGCCATG GGCGGGCTCG CGGCCTTGCC GCTGGTGGGC
GCGCTGGTGG TGGGCATCGC CCTCGTTCTG TCCCTGCGTG CCATGCGTCA GACCGGGCGC
GCGCCGGTTT GA
 
Protein sequence
MDARLYYLTL GNVMIGTGTM VVSGILDPIA ADLAISVSAA GQVTTVYALA FALGAPVAAA 
FTGRFDRSRV LALSLLVFAA ASLLSAVAPN YTTLLIARGL GGLAAATFSP TAVAVAASLV
PPAERGRAIA LVFGGMTIAS VLGVPLGTWI GLNFGWRILL AGLGVLSLGA VIALWRGVPG
GLFLPAATLA RWDEALRLPI IRALLGVTLF QIGGSFVLFA FFGPYLGTVT GVGTDGIASL
LFLFGLASLI GNFAAGWAVD RVGAGPVAHG AIALTALSMV ALLGTAGAPV LAGLAIILWG
SAVFGINTAQ QARLVDAAPG LATVVLPANS SILFAGQALG AALGGAVLAM GGLAALPLVG
ALVVGIALVL SLRAMRQTGR APV