Gene Dvul_1964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1964 
Symbol 
ID4663469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2281683 
End bp2282804 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content63% 
IMG OID639820205 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_967407 
Protein GI120603007 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAC CAAGCATGAG CAGGCCGGAC GATGTGCGGC CAGAAGTGCT CGACTTCAAG 
CCCTACGTTC CGGGCTTGTC CATCGACGAG ATACGCGACC GCTTCGGACT CGCCGACGTG
GTGAAGCTGG CAAGCAACGA AAACCCCCTC GGCACTTCGC CCGTCGTGCA GCGTACCCTC
AAGACCAAGG CCGACCTCGC CTTCCGCTAC GCGCAGTCGG GCAACCCCCG CCTCACGCGT
GCCATTGCCG CCCATCATGG TGTCGCGCCG GAACGTGTCG TGGCGGGCAA CGGTTCAGAC
GAGATCATCG ACCTGCTCAT CCGCGTGCGC GCCACCCCCG GCAAGCACAA CATCGTGGCC
TTTCGCCCGT GCTTCAGCAT CTACGAGCTT CAGGCGAAGT TCTGCGGTCT GGAATTCCGG
CAGGCCGACC TGCGACCCGA TTTCACCTTC GACTGGGACG CCTTCCTCGC CGCCACGGAT
GAGAACACCG CCATCGCCTT CGTGACCACC CCCGACAACC CCTCCGGCTG GTGTCCGCCG
GTGTCTGAAC TTGAACACGT CGCCCGCACA CTGCCCCCGT CGTGCCTCTT CGTCATCGAT
GAGGCGTACA TGGATTTCTG CGGCGACGAA GCCGCGCATT CGCTGCTTTC TCGGCTTGAC
GCCTTCCCCA ACATCGCGGT GCTACGCACC TTTTCCAAGA GCTTCGGGCT TGCGGGACTT
CGCCTCGGCT ACGGCATCCT CCCGGAACGT CTGGCTGACT ACCTGCACCG GGTACGACTG
CCGTTCAGCG TGAACATCCT CGCCGAAGAA GCGGGACTTG CCGCCCTTGA GGATACTGTG
TTCAGAAGCG AGACCCTTCG CGTCACCGCC GAAGGCCGTG CATACATCGC CGAAGGACTG
ACGGCACTGG GGTGCGAGGT CCTGCCTTCG TGGGCCAACT TCATCATGTT CCGACCGCCC
ACGGATGCAA CCGACCTCTT CGAGGCGCTT CTGCGGCGCG GCATCATCAT CAGACCCCTC
AAAAGCTATG GCCTGCCCCA ACACCTGCGG GTGAGCATGG GCAACGCCGA CGAGAACAGA
CGTTTCATAG CAGCCTGCAA GGAGATTCTG CCTCATGCCT GA
 
Protein sequence
MTAPSMSRPD DVRPEVLDFK PYVPGLSIDE IRDRFGLADV VKLASNENPL GTSPVVQRTL 
KTKADLAFRY AQSGNPRLTR AIAAHHGVAP ERVVAGNGSD EIIDLLIRVR ATPGKHNIVA
FRPCFSIYEL QAKFCGLEFR QADLRPDFTF DWDAFLAATD ENTAIAFVTT PDNPSGWCPP
VSELEHVART LPPSCLFVID EAYMDFCGDE AAHSLLSRLD AFPNIAVLRT FSKSFGLAGL
RLGYGILPER LADYLHRVRL PFSVNILAEE AGLAALEDTV FRSETLRVTA EGRAYIAEGL
TALGCEVLPS WANFIMFRPP TDATDLFEAL LRRGIIIRPL KSYGLPQHLR VSMGNADENR
RFIAACKEIL PHA