Gene Dvul_2397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2397 
Symbol 
ID4664088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2794966 
End bp2796084 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content62% 
IMG OID639820645 
Productextracellular ligand-binding receptor 
Protein accessionYP_967840 
Protein GI120603440 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.285564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.778884 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAAG GTTGGTTCAA GGCGCTCATC GCGGGAATGA CCGTCGCGGT CATGGCTGGT 
CCGGTCTTCG CCGGTGACAC CATCAAACTG GGCGTGCCCG GCGCACACAG TGGCGACCTG
GCCTCTTACG GCCTGCCCTC TGCCAACGCC GCCAAGATTG TCGCCAAGAT GTTCAACGAC
AAGGGCGGCA TCAACGGCAA GATGGTCGAA GTCATTCCGC AGGACGACCA GTGCAAGCCT
GAAATGGCCA CCAACGCGGC CACCAAGCTC GTCTCCGACG GCGTGGACAT CGTGCTGGGT
CACATCTGTT CCGGCGCCAC CAAGGCCGCG CTGCCCATCT ACAAGGAAGC CAACAAGGTC
GTCATGTCGC CTTCGGCCAC CACGCCTGCG CTCACCCAGA GCGGCGACTA CCCCATGTTC
TTCCGCACCA TCTCCTCGGA CGACCAGCAG GCGAAGCTGG GCGTCGATTT CGCCATCGAC
AAGCTCGGTG CCAAGAAGAT CGCCGTGCTG CATGACAAGG GCGACTACGG CAAGGGCTAC
GCCGAGTACG CAAAGCAGTT CATCGAGCAG AGCGGCAAGG CCACCGTCGT GCTGTTCGAA
GGCGTGACCC CCGGTGCCGT GGACTACAGC GCCGTGGTGC AGAAGGTGCG CAGCGAAGGT
GCCGACGCAG TCATGTTCGG CGGCTACCAT CCTGAAGCCT CGAAGATCGT CGCCCAGATG
CGCAAGAAGC GTATGACTAC TCCCTTCATC TCCGACGACG GCGTGAAGGA CGACACCTTC
ATCAAGGTCG CCGGCAAGGA CGCCGAGGGC GTGTACGCCT CCAGCTCCAA GGACGTGAGC
ATGCTGCCCA TGTACAAGGA AGCCATCGAA CTGCACAAGA AGGAGTTCGG CACTGAACCC
GGCGCGTTCT ACAAGGAAGC CTTCGCCGCT GCGCAGGCCC TTCTTACCGC CGTGCAGCGT
GCAGGCAGCA CCGAAACCCC CAAGGTTGTC GACGCCCTGC GTAACAACTT CGTCGAGACC
GCCATCGGCA AGATCAAGTT CGACAAGCGT GGCGATGCCG AAGGTACCGG CTTCTCCATG
TATCAGGTCA AGAACGGCGT GTACGTCGAG CTGAAGTAG
 
Protein sequence
MRKGWFKALI AGMTVAVMAG PVFAGDTIKL GVPGAHSGDL ASYGLPSANA AKIVAKMFND 
KGGINGKMVE VIPQDDQCKP EMATNAATKL VSDGVDIVLG HICSGATKAA LPIYKEANKV
VMSPSATTPA LTQSGDYPMF FRTISSDDQQ AKLGVDFAID KLGAKKIAVL HDKGDYGKGY
AEYAKQFIEQ SGKATVVLFE GVTPGAVDYS AVVQKVRSEG ADAVMFGGYH PEASKIVAQM
RKKRMTTPFI SDDGVKDDTF IKVAGKDAEG VYASSSKDVS MLPMYKEAIE LHKKEFGTEP
GAFYKEAFAA AQALLTAVQR AGSTETPKVV DALRNNFVET AIGKIKFDKR GDAEGTGFSM
YQVKNGVYVE LK