Gene Dhaf_3037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_3037 
Symbol 
ID7260048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp3255963 
End bp3257228 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content45% 
IMG OID643562957 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002459496 
Protein GI219669061 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGCT TACATAAGAA TAATTCTTTA TACCGGGCTT TTCCAGCACT CAGTCATTCC 
CCGTTTCGGT GGTTTTGGGG TGGACAGATT ATTTCGCTTA TTGGGACATG GACCCAAAAT
ATCGGCCAGG CCTGGTTGGT CCTGCAGCTG ACAAATTCGC CGTTTTTGCT CGGGTTGGTG
GCCGCTATGC AGTTTGTACC GACCATGCTG TTCTCTCTGC AAGCCGGAGC TTGGATCGAT
CACTTGCCGA AACGCAAAGT ATTGATTGCC ACCCAGACTG TGATGATGAT CCTAGCCTTT
GCCTTAGCTT TTTTAGTAGG TTCGGGTTCC CTTCGCTATT GGATGCTTCT TGTGATGGCC
TTCATTCTGG GTGTGACCAA CACTGTAGAT GTCCCCACCC GCCAATCCTT TATTATTGAA
TTGGTAGGAA GAGAGCATTT AGCCAATGCC ATCGCCCTCA ATTCGGCGAT CTTTAATGGT
GCCCGTTTAG TTGGCCCTGC CATAGCCGGA GTGATTATGG GAATCTGGGG ACCCATGTGG
TGTTTTTTGA TTAATGGCTT GAGCTTTATT GGTGTTTTGG CAATTTTAAT TTTTGTTCCG
GCTATTCCTC ATCAGGAAAA GATCACTCCT AAAAAAGAAA CCTTGAGGAA AGATATCCTC
AATGGTTTGA GCTATATAAG AAAAACTCCC TCCATTCTGA TCGTCATGAT GATGATGGGT
TTTCTGAGCA CTATCGCCAT GAATTTTAAT GTACTGGTCC CTGTTCTGGC CAAAATTGAT
TTGCAGGCAG AGGCCCTGGG CTATGGGCTT CTGATGAGCG CTTTGGGATT GGGCGCTCTG
ATCGGCGCTT TAACCGTGGC GATCAGAAGT GCAGAAGGCC CGCAACCTCG TTTGCTTTTG
GTGGGAGCCT TTGGATTGGG AATGTTTAAT GTGGTTGTAG GATTGCAGAA CACCTATTTT
TTTAGTGCCT TTTTCTTAGC TTTTCTTGGT TGGTCCATGA TTGTTTTTTC AGCTTCAGCC
AATTCATTGA TCCAGATTAC AGTGGACAGT CAATATCGGG GAAGAGTCAT GAGTGTTTAT
AATCTGGTTT TTGGAGGTAT GATTCCCATA GGAAGCCTTT ACGCTGGGAC ATTATCCGAT
TTATGGGGAG CGAGGATGAC CTTTATCATC AGTGGAACCA TTACCTTACT GTTTATGGGT
GGGATAGTTT TTTGGCTAAG GCGTTATCGA AAGGATGAAG GTCATGAGAA TAGCAGTTTT
GTCTGA
 
Protein sequence
MASLHKNNSL YRAFPALSHS PFRWFWGGQI ISLIGTWTQN IGQAWLVLQL TNSPFLLGLV 
AAMQFVPTML FSLQAGAWID HLPKRKVLIA TQTVMMILAF ALAFLVGSGS LRYWMLLVMA
FILGVTNTVD VPTRQSFIIE LVGREHLANA IALNSAIFNG ARLVGPAIAG VIMGIWGPMW
CFLINGLSFI GVLAILIFVP AIPHQEKITP KKETLRKDIL NGLSYIRKTP SILIVMMMMG
FLSTIAMNFN VLVPVLAKID LQAEALGYGL LMSALGLGAL IGALTVAIRS AEGPQPRLLL
VGAFGLGMFN VVVGLQNTYF FSAFFLAFLG WSMIVFSASA NSLIQITVDS QYRGRVMSVY
NLVFGGMIPI GSLYAGTLSD LWGARMTFII SGTITLLFMG GIVFWLRRYR KDEGHENSSF
V