Gene Dvul_2254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2254 
Symbol 
ID4663769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2621732 
End bp2622862 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content63% 
IMG OID639820499 
Productextracellular ligand-binding receptor 
Protein accessionYP_967697 
Protein GI120603297 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGGTG TCGTCAGGTT GCTGGCGGTC TGCATGGTCA CGTCGCTGCT CATGGCTGCG 
ACGGCGTTCG CCGCCGGGCC GGTGCGTGTG GGGCTCATGT GTCCGCTGAC CGGCAAATGG
GCCAGTGAAG GGCAGGACAT GCGCAACATT GTCGAACTGC TGGCTGAAGA GGTGAACAAG
GCCGGGGGCA TCAACGGCAA CAAGGTCGAA CTGATCGTCG AGGACGACGG TGGCGACCCG
CGCACTGCAG CACTCGCCGC GCAGAAGCTT TCCACCTCCG GTGTTACCGC CGTCATCGGC
ACCTATGGCT CGGCTGTGAC CGAAGCCTCC CAGAACATCT ACGACGAGGC GGGCATCGCC
CAGATAGCCA CCGGGTCGAC CAACGTGCGC CTCACCGAAA AGGGCCTCAA GCTCTTCCTG
CGCACCTGCC CGCGTGACGA CGAACAGGGT CGCGTCGCCG CCAAGGTCAT CAAGAGCAAG
GGCTACAAGG CCGTTGCCCT GCTGCATGAC AACTCGTCCT ACGCCAAGGG CCTCGCCGAC
GAGACCAAGG CACTGCTCGA CAAGGACGGC ACCAAGATCG TGTTCTACGA CGCCCTCACC
CCCGGCGAGC GCGACTACAC CGCCATCCTG ACCAAGCTCA AGGCCGCCAA CCCCGACATC
ATCTTCTTCA CGGGCTACTA CCCCGAAGTG GGCATGCTGC TGCGCCAGAA GATGGAGATG
AAGTGGAACG TGCCCATGAT GGGCGGAGAC GCCGCCAACA ACCTCGACCT GGTCAAGATT
GCGGGCAAGC CCGCCGCGAA GGGCTACTTC TTCCTCAGCC CGCCCGTGCC GCAGGACTTC
GACACCGCCG AAGCCAAGGC CTTCCTCGCC GCCTACAAGG CCAAGCACAA CGCCCTGCCC
AACTCGGTGT GGTCTGTGCT TGCCGGTGAC GCCTTCAAGG TCATCGTCGA AGCCGTGCAG
AAGGGTGGCA AGGCCGACGG TGCCTCCATC GCCACGTACC TGAAGACCCA GCTCAAGAAC
TACCCCGGTC TTTCGGGGCA GATATCCTTC AACGAAAAGG GCGACCGCGT AGGCGACCTG
TACCGCGTGT ACGACGTCAA CGCCGAAGGC GAATTCGTCC TGCAGCGTTA G
 
Protein sequence
MKGVVRLLAV CMVTSLLMAA TAFAAGPVRV GLMCPLTGKW ASEGQDMRNI VELLAEEVNK 
AGGINGNKVE LIVEDDGGDP RTAALAAQKL STSGVTAVIG TYGSAVTEAS QNIYDEAGIA
QIATGSTNVR LTEKGLKLFL RTCPRDDEQG RVAAKVIKSK GYKAVALLHD NSSYAKGLAD
ETKALLDKDG TKIVFYDALT PGERDYTAIL TKLKAANPDI IFFTGYYPEV GMLLRQKMEM
KWNVPMMGGD AANNLDLVKI AGKPAAKGYF FLSPPVPQDF DTAEAKAFLA AYKAKHNALP
NSVWSVLAGD AFKVIVEAVQ KGGKADGASI ATYLKTQLKN YPGLSGQISF NEKGDRVGDL
YRVYDVNAEG EFVLQR