Gene DvMF_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_1049 
Symbol 
ID7172945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp1275173 
End bp1276438 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content69% 
IMG OID643539556 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002435472 
Protein GI218886151 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCCCT TTCCCGATCA CCAGGATCCC CGCGCCACCG GTCCGTTGCG GGTATTCCGC 
CATCGCAACT ACCGGCTGTT CTTCGCCGGG CAGGCCATTT CGCTGCCCGG CACCTGGATG
CAGTCCATGG CCCAGTCGTG GCTGGTCTAC CGGCTGAGCG AATCCAGCTT CGTTCTGGGG
GCGCTGGGCT TTGCCGCGCA ATTGCCGCTG TTCGTGCTGT CCGTGTTCGG CGGCGCGCTG
GCCGACACGC GCGACAGGCG CGCCATACTG GTGGCCACGC AGGTGGCCTC CATGCTGCTG
GCGCTGACTG CCGCCGCGCT GACCATGACC GACGTGGTGC AGGTGTGGCA CGTATTCGTG
CTGGCCACGG CGCTCGGCAT CGTCAACGCC TTCGACGTGC CCACGCGGCA GTCCTTCATC
ATGGACATGG TGGGGCGCGA CGATCTGCCC ACGGCCATCG GCCTCAACTC GTCCATGTTC
AACGCGGCGC GTGTGGTCGG GCCAACCCTG GCGGGGCTGG TGGTGGCCGC CGCGGGCGAA
GGGTGGTGCT TTCTGCTCAA CGGCATCAGC TTTGTGCCCG TCATCGCGGG GCTGATGATG
ATGCGCCTGC CCGTCCACGT GCCCCCGCCG CCCGGCCCTT CCACGTTGCA GCGCATCCGC
GAGGGGCTGG GCTTTGCCGC GCGCCACGAA GGTATCCGCA CCACCCTGCT GCTTGTGGGG
GCCACCAGCC TCATCGCGGT GAACTATTCC GTGCTGATGC CGGTGGTGGC CGACAAGGTG
CTGGGCGGCA ACGCCAGGAC ACTGGGCCTG CTGCTGGGGG CCGCCGGGGC GGGTGCGCTG
CTGGGCGCGC TGTGCCTTGC CCTGCGGCGC AGCAGCGACG GGCTGTCACG ATGGGCGCTG
TACGGGGCTG TTGGACTGGG GGCCAGCCTG ACGGCATTCG CGCTGTGCCG GTCGGTGTGG
ACGGCGCTGG TGGCGCTGGT GCCCGTGGGC ATGTGCATGG TGGTGCTGAT GGCATCGGCC
AACACGCTGC TGCAAATCAT GTCGCCCGAC GCCTACCGGG GCCGGGTCAT GGCCCTGTAT
TCCATGATGT TCCTGGGCAT GGGGCCGTTC GGCTCGCTGC TTGGGGGCAG CGTTGCCCAT
GCGCTGGGCC CATCGCTCAC GCTGCTGCTG TCCGGCATCG TCTGCCTGGG CAACGCGCTG
TGGTTCGGGG TGTGGCTGCG GCGGCACGGC CCGTCGCTGG CTGGCGTGGG GCGCGAGACA
AGTTGA
 
Protein sequence
MPPFPDHQDP RATGPLRVFR HRNYRLFFAG QAISLPGTWM QSMAQSWLVY RLSESSFVLG 
ALGFAAQLPL FVLSVFGGAL ADTRDRRAIL VATQVASMLL ALTAAALTMT DVVQVWHVFV
LATALGIVNA FDVPTRQSFI MDMVGRDDLP TAIGLNSSMF NAARVVGPTL AGLVVAAAGE
GWCFLLNGIS FVPVIAGLMM MRLPVHVPPP PGPSTLQRIR EGLGFAARHE GIRTTLLLVG
ATSLIAVNYS VLMPVVADKV LGGNARTLGL LLGAAGAGAL LGALCLALRR SSDGLSRWAL
YGAVGLGASL TAFALCRSVW TALVALVPVG MCMVVLMASA NTLLQIMSPD AYRGRVMALY
SMMFLGMGPF GSLLGGSVAH ALGPSLTLLL SGIVCLGNAL WFGVWLRRHG PSLAGVGRET
S