Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_2962 |
Symbol | |
ID | 4661938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008741 |
Strand | - |
Start bp | 10948 |
End bp | 13890 |
Gene Length | 2943 bp |
Protein Length | 980 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639813882 |
Product | major facilitator superfamily transporter |
Protein accession | YP_961161 |
Protein GI | 120586816 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCACA TGACCGCCAA CCCCGCCCGC ACGTTGCGTC TCAAGGTCAT GGCATGGGGG CTTCTGGCCC TGCTGTGCGC CCAGATGTTC TACGGCGTGC TCGTCGTCTC GTCCCTGTAC CGCCAGTACC GGGGACCCAT GCTCACGGTG CAGGCCATCG CCGGTGACGA CCTCGCCCGG CGGCTTGGCC GCATGGCGCG CCTCGGCAAG CCGCTCGACC GCATCAGCCA TCTCGACGCC ATGCTCGCCC CGTTCCGCGA CACCACCTCG GCCACGGCCC TGTACGTGAC CGACGCCACA GGCGCGGTGC TGGCCGCATG GGACGCCGAC ACGGTGCTGC CGCGCCTTGA ACCGCCCACC CAGACCTCGC CGGTGGGGGT CACCGGGGCG CAGGAATTCC GGCGCGACGA GGCCACGTGG CTTGTCTCGC CCATCCTTGG CAAGGACGGC GACCTCGTGG GGCGCGTGCT GCTGCGTACC GACCCGCAAC GCCTGCAGGC CGCACTGCGC GACGCCGCCT CGTACCTGCC GCTCTTCGCG TCGGTGGCGG GCGGGTCGGC CCTGCTGCTG GTGGTGCTGT GCCTCACGCT GCTGCCCGCC TCCGGCGGGG GCGTGGGGGC CACGGGAACG GGCCAGACGC GACAGACTGG ACAGACTGGA CAGGCGGGCC AGACTCAAAG GGCAGGACGC ACCCTCGACG CGCGTCAGTG GGGCCGCCGT GCCCGCGTGG CCCTCGTGGC ACCGCTCCTC GTGGGGCAGC TGTGTTGCAG CGCCGCCCTC TACGCGCCGC TGAAGGGCCT CCACCAGAGG CAGAGCGCCG ACATCGCCGC GCAACTGGCT GAACAGCTTG GCAGAGACCT GACATCCATC GTCCACAAGG GTGTGGCCCT CGGCGACATC CCCGGCATCG ACCTGCACCT GCGCAGCCTG CAACAGGCCC TGCCGCAGGT GGCGGCCATG GGCGTGACCG ATACGGCAGG ACGCCTGCTG GCCGGGGCCA CCGCCTCGAC GACGCTGACA CCCGAAGCAT GGCTGCGCCT CTCGGATGAC GGGCCTACGG GCCGCTTCGA CGTGCCGCCC CCGCCGGGGG TGGGTCGCGC CGCGACACCA AGCCCTGCTG ACGGCAACGC GCAAAGCGCA GACGCAGCCG CCGGTGAACC TGTGACCGGA AACGCAGTCA CCGGAAACGC GGCTGCCGGC GACACCATCA CCGGGGGCGA AGCTACGGGG GGCGGCGAAG TGCGCGTTCT CATGTCCGGT GCCGCCATCC GCACAGGGCT GCGCGAGGCC CTGCTCGATA CCGCCACCGT CGCCGTGGTG GCGATGCTGC TGCTCATGGA ACTTGCGTCC ATGCTGCTGA TGCGTGCGCA TCGCGCTCTG GACGGCATGG GGGATGCGGC GCACGCGCCG ACGCGTGAGG ACGGAACGAA CGCCCCCGAC GCCCCCCGCA TCAACGACAT CAACGGCATC ACGCATACCC CGGGCAGTAC CGGCGGCACG GCCACCGCGA GCCTCACCGA TAGCCCGAGG CTTGCGGACA CAGCGAAGCT CATGGACACC GAAGACCGGG CAACCACCGA TAACCTCACG GACACCCCGC CCCGTGCCCA TGCCGCAGGT CGGGCCACGG CCCATGCGAA CCTGCTGGCA GAGACGCCGG GCTTCATGCG CCCCGTCATC TTCTTCTGCA TGTTCGCCAT CGACATGGGC GTGTCGTTCA TTCCGCTGCG TCTTGCCGAA CTCGACGCCA CGCTCTTGGG ACTGCCGCGT GACGTGGTCA TGGGGCTGCC CGTCTCGGCG GAGATGTTCA TGGTGGCGGT CGCCATCCTT CTCGGCGGTG CAGCGGGCGA GCGCTTCGGC TGGCGTCCGC TGCTGCTGTG CGGCGTGCTG CTTGCCGCAC TGGGCAACCT CGTCAGCGGC ACGGCGGCGT CGGCACTGGG CTACATCGCC GCGCGCGGCA TCGCCGGGGC CGGGTACGGC TGCATCAACC TCGCCGCCCA GCTGCACGTC ATGAACCATT CCGACGAGAG CAACCGGGCG GGCAACCTCG CCCACATGTT CGCGGGACTG TTCGCCGGGG CCATGTGCGG CAGCGCCACG GGCGGCCTTG TGGCGGACAG GCTCGGCTAC GGGCCGGTGT TCTTCGTCGC CGCGGGCATG TTGCTGGCGG TGCTCGGTGT GCTGGCCCTG TGTCCGGCGC GGCCCCATGT GCCAGCCCTC GGGTCCCGCC CCGCCCCGCA TCCCGGACAG ACTCCCGAGG CCACGTCGCT GGACACAGCA CATGACGCCG ACGGCAACGC GGCGGGCGTG CTCGCCTTCC TGCGCGACAG GCGCATGGTG GGCCTGCTGG TGTGCAACAT CATGCCCGTG GCCTTCGTCA CGGTATGCCT GTTCCAGTTC TTCATCCCCG TCTACCTCAG CGAAGGGGGA GCAAGCCCCG CCGACATCGG GCGCGTCTCC ATGCTGTTCT GCCTCGTGGT GGTCTTCCTC GGCCCGCTGT GCGGCAGACT CATCGACGCC ACCCCGCGCA AGGACAGGAT GCTGGTCGTG GCGGGCATCG CCGGGGCACT GGCCATAGCC GCCCTGCTGA TGGGCGGCGG CATCGCAGAG GCTGTGCTCG CGGTCGTCAT GCTGGGGGTC TCCAACGCCA TCGCCGCCAG CGCGCAGGGC ACCTACGCCC TGAGCCTGCC CGTGGCCCTG CGCATGGGCA GGGCGCGCAC CATGGGCGTC TACAACATCA CCGAACGTGT GGGGCAGGTC ATCGGCCCCG TCACCTTCGC CGTGCTGCTT TCGCTGCTGG GGCGCGAAGG GGGACTGCTG GCCATGGCTG CGGGTGTCGC CGCCCTGACA CTGCTCTTCA TGTGGCTTTC GGGCGGGGAC ACCGCCAGGA CGGCTGCCGA CGCAACCGCC GTGACCACCA CCGGGCGGCA CTCCAACACC TAA
|
Protein sequence | MPHMTANPAR TLRLKVMAWG LLALLCAQMF YGVLVVSSLY RQYRGPMLTV QAIAGDDLAR RLGRMARLGK PLDRISHLDA MLAPFRDTTS ATALYVTDAT GAVLAAWDAD TVLPRLEPPT QTSPVGVTGA QEFRRDEATW LVSPILGKDG DLVGRVLLRT DPQRLQAALR DAASYLPLFA SVAGGSALLL VVLCLTLLPA SGGGVGATGT GQTRQTGQTG QAGQTQRAGR TLDARQWGRR ARVALVAPLL VGQLCCSAAL YAPLKGLHQR QSADIAAQLA EQLGRDLTSI VHKGVALGDI PGIDLHLRSL QQALPQVAAM GVTDTAGRLL AGATASTTLT PEAWLRLSDD GPTGRFDVPP PPGVGRAATP SPADGNAQSA DAAAGEPVTG NAVTGNAAAG DTITGGEATG GGEVRVLMSG AAIRTGLREA LLDTATVAVV AMLLLMELAS MLLMRAHRAL DGMGDAAHAP TREDGTNAPD APRINDINGI THTPGSTGGT ATASLTDSPR LADTAKLMDT EDRATTDNLT DTPPRAHAAG RATAHANLLA ETPGFMRPVI FFCMFAIDMG VSFIPLRLAE LDATLLGLPR DVVMGLPVSA EMFMVAVAIL LGGAAGERFG WRPLLLCGVL LAALGNLVSG TAASALGYIA ARGIAGAGYG CINLAAQLHV MNHSDESNRA GNLAHMFAGL FAGAMCGSAT GGLVADRLGY GPVFFVAAGM LLAVLGVLAL CPARPHVPAL GSRPAPHPGQ TPEATSLDTA HDADGNAAGV LAFLRDRRMV GLLVCNIMPV AFVTVCLFQF FIPVYLSEGG ASPADIGRVS MLFCLVVVFL GPLCGRLIDA TPRKDRMLVV AGIAGALAIA ALLMGGGIAE AVLAVVMLGV SNAIAASAQG TYALSLPVAL RMGRARTMGV YNITERVGQV IGPVTFAVLL SLLGREGGLL AMAAGVAALT LLFMWLSGGD TARTAADATA VTTTGRHSNT
|
| |