Gene Dvul_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1031 
Symbol 
ID4663944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1264207 
End bp1265484 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content62% 
IMG OID639819255 
Productaromatic amino acid transporter 
Protein accessionYP_966478 
Protein GI120602078 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00837] aromatic amino acid transport protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAGT GCATGACAGC CAACGGGCAG ACCGCCACGC AAGCCGGGGT GAAGAATCCC 
TCGGTTCTTG GCGGCGCCAT GATCATAGCC GGGACGACCA TCGGCGCGGG CATGTTCTCG
CTGCCGAGCG TCTCTGCCGG CATGTGGTTC TTCTATTCCC TCTTCGTCCT GTTCGGCACG
TGGCTTTGCA TGTGCCATTC CGGCCTGATG ATTCTCGAGG CCAACCTCAA CTATCCCGCC
GGGACATCCT TCGACAACAT CGCCAAGGAC TGCCTTGCCA GACCCGTGCG GCTGCTGAAC
AGCCTCTCGG TGGCCTTCGT GCTCTACATT CTCACCTACG CCTATATCAG CGGCGGCGGG
TCCATCGTCG CGCATACGGT GAAGGCGGCT GTGGGCATCG ACGTTCCCAT GAAGCTGGGC
GGCTTCCTCT TCGCGCTTGT GCTCGCGTTC GTCGTATGGC TGAGCACCCG CGCCGTCGAC
CGCATCTCCA CCATCATGCT CGGCGGCATG ATACTCACCT TCTTCTCGTC GGTGAGCGGC
CTCATGTTCA ACGTGCAGCC TGCTGTGCTC TTCGACACCG GCGACACCAG TGCGCCATAC
TCGCCCTTCA TCCTCGCCAC GCTTCCCTAC TTTCTCACCT CGTTCGGGTA CCACGGCAAC
GTCCCCGGAC TCGTGAAGTA CTACAACAAG GACCCGAAGG CCGTCGCCAA GACCATCATC
TACGGCAGCT TCCTCGGCCT CATTCTCTAC GTGTGCTGGC AACTCAGCGT ACTGGGCAAC
ATCCCGCGCG AGGAATTCCT CGACATCGTC GCCAAGGGCG GCAACATGGG CATCCTCGTG
GGGGCGCTGT CCAAGGTCAC AGGCAGCACC AACCTCGACT ACCTGCTCCA GGTCTTCTCG
CATCTTGCCG TGGCGACCTC GTTCCTCGGC GTGACGCTTG GCCTGTTCGA CTGCATCGCC
GATACGCTCG GCTTCGACGA CTCGCGTCTC GGACGTACCA AGACCGCCAT CGTCACCTTC
GTGCCCCCGG CCATTGGCGG CCTGTTCTAC CCCGACGGAT TCATCATGGC CATCGGCTTT
GCCGGGCTTG CAGCCACCGT GTTCGCTGTC ATCGTACCCG CCATGATGGC ACTGGCCACA
CGCAGGAAGT TCGGCAACAC CACCTACCGC GCCCCCGGCG GCAACGTGAT GCTCTATGTG
ACCATCGCCT ATGGCATCAC AGTGGCCATC TGCCATGTGC TGACCATGTT CGACACGCTG
CCTGTCTACG GCAAGTAG
 
Protein sequence
MSQCMTANGQ TATQAGVKNP SVLGGAMIIA GTTIGAGMFS LPSVSAGMWF FYSLFVLFGT 
WLCMCHSGLM ILEANLNYPA GTSFDNIAKD CLARPVRLLN SLSVAFVLYI LTYAYISGGG
SIVAHTVKAA VGIDVPMKLG GFLFALVLAF VVWLSTRAVD RISTIMLGGM ILTFFSSVSG
LMFNVQPAVL FDTGDTSAPY SPFILATLPY FLTSFGYHGN VPGLVKYYNK DPKAVAKTII
YGSFLGLILY VCWQLSVLGN IPREEFLDIV AKGGNMGILV GALSKVTGST NLDYLLQVFS
HLAVATSFLG VTLGLFDCIA DTLGFDDSRL GRTKTAIVTF VPPAIGGLFY PDGFIMAIGF
AGLAATVFAV IVPAMMALAT RRKFGNTTYR APGGNVMLYV TIAYGITVAI CHVLTMFDTL
PVYGK