Gene Dshi_1559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1559 
Symbol 
ID5712703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1620493 
End bp1621701 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content68% 
IMG OID641267474 
Productmajor facilitator superfamily transporter 
Protein accessionYP_001532902 
Protein GI159044108 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00710] drug resistance transporter, Bcr/CflA subfamily 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.418069 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.23279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCCG CCCGCCTGTC CCAAGCCGAG TTCATTGCCC TGATGGCGAT GCTGTTCGCG 
ACGATTGCGT TTTCCATCGA CGCGATGCTT CCCGCCCTGC CCGAGATCGC CGCCGAACTG
ACGCCCGGTG ACGTGAACCG GGCGCAGTTG ATCGTGACCT CGTTCGTATT CGGGATGGGT
TTGGGCACCC TGGTGGCCGG GCCCTTGTCG GATGCCTTCG GGCGCAAGCC GGTGATCGTG
GCGGGCGCGG TGCTCTATTG CGCGGCGGCG GGGCTGGCTT GGGCGGCGCA ATCGCTGGAA
CTGGCATTGG CGGCACGGGT GCTGCAGGGG TTCGGCGCGG CCGCCCCGCG GGTGGTGGCG
ATCGCCATGG TGCGCGACCT CTATGTCGGG CGGCACATGG CGCGGATCAT GTCGCTTGTC
TTCCTGATCT TCGCGCTGAT CCCGGCCATC GCGCCGAGCC TTGGCGCGGT CATCATCCAT
TTCGCCGGGT GGCGGGCGAT TTTCGCCTCC TTCATCCTGT TTGCCATGCT GTCGGTGGGC
TGGATGATGC TGCGCCAGGC CGAAACCCTG GCGCCCGAGG CGCGCAGGCC GCTCTCGGTG
CGGGGTGTGG CGGACAATGT GGTCGAAGTG CTGCGCGACC GGGTGGTGCG CCTGTCGATC
CTGGCGCAAA CCATGGCCTA TGCCACGTTA TTTGCGACGC TGTCCTCGAC CCAGCCGGTG
TTCGATGTGA CCTTCGGCAA GGCGGAGACC TTCCATCTGT GGTTCGCGGT GATCGCCCTT
CTGGCGTCGA GCGCCAGCTA CATCAATTCG CGGCTGGTGG TGCGGCTGGG CATGCGGCGC
ATGGTGCGCG GGGTGCTGAC CGGGCAGATC GCGGTCTCGG GCGTGTTCCT GTCCGTGAGT
GTCGTGGGCT GGCCCGAGGC ACTGCATTTC TGGGCCTATT TCGTCTGGGT GACGGGGGTG
TTCTTCATGG CGGGCATGAC CCTGGGCAAC CTCAATGCCA TCGCGATGGA GCCGATGGGG
CATATCGCGG GCACGGCGGC CTCGGTGGTG GGGGCGCTGT CGACCATGGG GTCGGTGTTA
CTGGCCATTC CCATCGGGCT GCTGTTCGAC GGCACGCCGG TGCCGGGCGT TGCGGGGGTT
CTGGTGCTGT GCCTCGGGGC GCTGGCCGTG ATGAAGGTGC TGGGCGCGCG GGGTGAGGCG
CCGGCCTGA
 
Protein sequence
MPPARLSQAE FIALMAMLFA TIAFSIDAML PALPEIAAEL TPGDVNRAQL IVTSFVFGMG 
LGTLVAGPLS DAFGRKPVIV AGAVLYCAAA GLAWAAQSLE LALAARVLQG FGAAAPRVVA
IAMVRDLYVG RHMARIMSLV FLIFALIPAI APSLGAVIIH FAGWRAIFAS FILFAMLSVG
WMMLRQAETL APEARRPLSV RGVADNVVEV LRDRVVRLSI LAQTMAYATL FATLSSTQPV
FDVTFGKAET FHLWFAVIAL LASSASYINS RLVVRLGMRR MVRGVLTGQI AVSGVFLSVS
VVGWPEALHF WAYFVWVTGV FFMAGMTLGN LNAIAMEPMG HIAGTAASVV GALSTMGSVL
LAIPIGLLFD GTPVPGVAGV LVLCLGALAV MKVLGARGEA PA