Gene Dole_1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1983 
Symbol 
ID5694823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2399450 
End bp2400538 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content57% 
IMG OID641264581 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001529864 
Protein GI158521994 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000438985 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATAT CGCCGCCGGA CTATATTTTA TCAATCAAGC CCTATGTGCC GGGAAAGCCC 
ATTGAAGAGC TGGAGCGGGA ATACGGCATC TCGGGATCTA TCAAGCTGGC CTCCAATGAA
AACCCCCTGG GGCCGTCCCC CCTGGCCCTG GCCGCCATTG AAAAAGCACT TTCCGGGCTG
CACCGCTACC CCGACGGCAG CGGGTATTAC CTGGTCTCAA AACTTGCGCA AAAACTGGGG
GTGGCCCCGG AGTCCATTGT CCTGGGCAAC GGTTCCGACG ACATCATCGG CATGCTCACC
CGGGCCCTGC TGCTGCCCGG CGACGAGGTG ATCATGACCG ATCCCTCATT TGCCATGTAT
GATATCACCA CCTGCATGGT CAACGCGCGG TCGGTCTATG TGCCGCTGAT CGACCGGGCA
CTGCCTCTTG ACACCGTGGC CGGTGCCGTT ACGTCAAAAA CAAAGATGGT GTTTCTCACC
AACCCCAACA ACCCGACCGG CACGGTTTTT TCCGGAAAGG CGTTTGAACG GTTTCTGGAG
GCGGTGCCCT CCGATGTGGT GATTGTGGTG GATGAGGCCT ACATCGAGTT TGTTCAGGAC
CCGGAGTGTG CCCGGGCCTT TGATTTTCTT GACAACAGCC GTCCTCTCGT GGCGTTGCGC
ACCTTTTCAA AGGCCTATGG CCTGGCCGGC ATTCGGGTGG GATACGGCGT CATGCCGCCG
TATCTGGCGG CGATTCTAAA CCGCATTCGC CAGCCCTTTA ATGTCAACTC CCTGGCTCAG
GTGGCGGCTA TTGCGGCTCT GGATGATGAG GCCTTTTTAA AACAAACCCT GGCCGTGGTG
CATGACGGGC TGGCCTGGCT TTATGCCGAG CTGGAGAAAA TGGGCCTTCG CTGTTTTCCC
TCCCAGGCCA ATTTTTTTCT TGTCGATGTA AAAAAAGATG CCGCCGCTGT TTTTGAAGAG
ATGTTAAAGC AGGGCGTGAT CATTCGCTCC ATGGTCTCCT ACGGATATCC TTCCTATATC
CGGGTAACCG TTGGTCTGCC GGAGGAAAAC GCCCGGTTTG TGGCGGCGTT AAAGGCGGTG
CTGAAATGA
 
Protein sequence
MKISPPDYIL SIKPYVPGKP IEELEREYGI SGSIKLASNE NPLGPSPLAL AAIEKALSGL 
HRYPDGSGYY LVSKLAQKLG VAPESIVLGN GSDDIIGMLT RALLLPGDEV IMTDPSFAMY
DITTCMVNAR SVYVPLIDRA LPLDTVAGAV TSKTKMVFLT NPNNPTGTVF SGKAFERFLE
AVPSDVVIVV DEAYIEFVQD PECARAFDFL DNSRPLVALR TFSKAYGLAG IRVGYGVMPP
YLAAILNRIR QPFNVNSLAQ VAAIAALDDE AFLKQTLAVV HDGLAWLYAE LEKMGLRCFP
SQANFFLVDV KKDAAAVFEE MLKQGVIIRS MVSYGYPSYI RVTVGLPEEN ARFVAALKAV
LK