Gene Dde_1453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDde_1453 
Symbol 
ID3756084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio desulfuricans subsp. desulfuricans str. G20 
KingdomBacteria 
Replicon accessionNC_007519 
Strand
Start bp1476595 
End bp1477746 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content59% 
IMG OID637782331 
Producthistidinol phosphate aminotransferase 
Protein accessionYP_387947 
Protein GI78356498 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0142373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAC CCGCAAAGCG GGAAGATTCC GGGCTGCCTG AGGCCGTTAC TGCCATACGG 
CCAGAGGTGG CGGCATTCAA ACCATACGCG CCCGGCCTTT CCATTGATGA AATCAAGGAA
CGCTACGGAC TTTCCCAGGT CGTGAAAATG GCCAGCAACG AAAATCCGCT GGGCACCTCG
CCGCTGGTTC AGCAGACACT GCGTACCCAT GCAGACCTTG CCTTCCGGTA TGTCCAGTCC
GGCAATCCCC GGCTGGTCAG CGCCATTGCC CGGTCTTTCG GCGTTGCTGC CGAAAGCGTG
GTCACCGGCA ACGGCTCTGA CGAGGTGATA GACCTTATCA TACGGGTCAA GGCCCGCCCC
GGAAAGCACA ACATTGTCGC ATTCAATCCC TGCTTCAGCA TGTACGAGCT GCAGACGCGC
TTCTGCGGTG TAGAGTTCCG GCAGGTACCG CTGCGTGCCG ACTTCAGTTT CGACTACGAC
GCTTTTGTCG GCGCCGCTGA TGCCGACACG GCCGTCGCAT TCATCACCAC GCCCGACAAC
CCGTCCGGCT ACTGCCCCCC GGTCGAAGAA ATAATCGATC TTGCGCGGCG TCTTCCTTCA
TCATGTCTGC TGGTAGTGGA CGAAGCCTAC ATGGATTTTG CCGATGACCC TGCCGCCCAC
TCTGTTCTGC CGCACCTGAC AGAATTTCCC AATGTGGCAG TGCTGCGTAC TTTTTCAAAA
AGTTACGGGC TGGCCGGTCT GCGTCTGGGT TTCGGCGTTA TGCATCCCGC CCTTGCTGAC
TATGTAAAAA GGGTGCGGCT GCCTTTCAGC ATCAACATAC TGGCAGAGTA CGCAGGCATA
GCGGCACTGC AGGACACCAC ATTCCACGCG CAGACCCTGC GTGTAACCCG CGAAGGCAGA
ACCTATCTGA CCGGCGCGCT GACCGAGGCC GGATGCACGG TGTACCCTTC CGCAGCCAAC
TTCATCATGT TCGCGCTGCC GGAAAACTGC CCGCACGATG CGCGCGCGGT ATTCGAGGCA
CTGCTGCGCC GCGGTATCAT CATACGTCCG CTAAGCAGCT ACAACCTGCC GCAGTGTCTG
CGGGTCAGCA TAGGCAACAG GCACGAAAAC GAGCTGTTCA TAGCTCAGTT CAAGGAGCTT
CTCCGTGGCT GA
 
Protein sequence
MTEPAKREDS GLPEAVTAIR PEVAAFKPYA PGLSIDEIKE RYGLSQVVKM ASNENPLGTS 
PLVQQTLRTH ADLAFRYVQS GNPRLVSAIA RSFGVAAESV VTGNGSDEVI DLIIRVKARP
GKHNIVAFNP CFSMYELQTR FCGVEFRQVP LRADFSFDYD AFVGAADADT AVAFITTPDN
PSGYCPPVEE IIDLARRLPS SCLLVVDEAY MDFADDPAAH SVLPHLTEFP NVAVLRTFSK
SYGLAGLRLG FGVMHPALAD YVKRVRLPFS INILAEYAGI AALQDTTFHA QTLRVTREGR
TYLTGALTEA GCTVYPSAAN FIMFALPENC PHDARAVFEA LLRRGIIIRP LSSYNLPQCL
RVSIGNRHEN ELFIAQFKEL LRG