Gene Tbd_0952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_0952 
Symbol 
ID3672744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp1014462 
End bp1015616 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content69% 
IMG OID637709631 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_314710 
Protein GI74316970 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.221365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCT GCGATTCGAT CTGCGCTGCA ACCGCGCCGC CCCACGTGCG CGCGATCGCT 
CCGTATCAGC CCGGCAAGCC GATTTCCGAG CTTGCACGCG AACTGGGCCT GGCCGAAGCC
GATATCGTCA AGCTCGCTTC GAACGAAAAT CCGCTCGGTC CGAGCCCGTT CGCGCTTGCC
GCGGCCCAGG ACGCGCTCCT GGACATGGCC TTGTATCCGG ACGGTGCGGG CTACGCGCTG
AAGGCGAAGC TCTCGGCCAG GCTCGGCGTC GACGCGGCGC AGATCGTGCT CGGCAATGGT
TCCAACGACG TGCTCGACAT GGTCGCGCGC GCCTATCTCG CGCCGGGGAC CTCGGCCGTC
TATGCGCAGT ACGCGTTCGC GGTCTACCCG ATCGCCACGC ATACGGTCGG TGCCCACGGG
ATCGCGGTCG CGGCGCGGGA CTTCGGCCAC GACCTCGAAC GCATGCGCGC CGCGATCCGT
GACGACACCC GGGTGGTGTG GATCGCGAAT CCCAACAACC CGACCGGCAC TTTCCTGCCG
TGGAACGAGA TCGAGGCCTT CCTCGAGACC GTGCCGCCCC GCGTGCTGGT CGTCCTCGAC
GAGGCCTACG GCGAATACCT CGCGCCCGCG TCGCGCTGCG ACACGGCGGC CTGGGTGGTG
CGTTTCCCCA ATCTCTTGAT CAGCCGCACC TTTTCCAAGG CCTACGGTCT GGCCGGGCTA
CGGGTCGGCT ACGGGATCGG GCACGCCGAC GTCGTCGACC TGCTGAACCG GGTCCGCCAT
CCGTTCAACG TCAACGCCTC GGCGCTCGCC GCGGCGGAGG CCGCGCTCGA CGACGACGCC
TTTCTCGCGC GAAGCTATGC GCTCAACGCG GCGGGCATGC AGCAACTCTT AGGCGGGCTC
GCGGCCCTGG ACATCGAGAC CGTCCCGTCG AAGGGCAATT TCGTCCTCGC GCGGGTCGGC
GATGCGGCGC GCATCAACAC CGAGCTACTC AAGCGCGGCG TGATCGTACG ACCGGTCGCA
GCCTACGGGC TGCCCGAATT CCTGCGCGTG TCTGTCGGTC TTGCCGGCCA GAATGCGCGC
TTTCTCGACG CCCTGGGCGA GGTTCTCGCG GCGGCGCCCG GCCGGCACCC CGACAGCCGG
AAAGCCCTGC CGTGA
 
Protein sequence
MSGCDSICAA TAPPHVRAIA PYQPGKPISE LARELGLAEA DIVKLASNEN PLGPSPFALA 
AAQDALLDMA LYPDGAGYAL KAKLSARLGV DAAQIVLGNG SNDVLDMVAR AYLAPGTSAV
YAQYAFAVYP IATHTVGAHG IAVAARDFGH DLERMRAAIR DDTRVVWIAN PNNPTGTFLP
WNEIEAFLET VPPRVLVVLD EAYGEYLAPA SRCDTAAWVV RFPNLLISRT FSKAYGLAGL
RVGYGIGHAD VVDLLNRVRH PFNVNASALA AAEAALDDDA FLARSYALNA AGMQQLLGGL
AALDIETVPS KGNFVLARVG DAARINTELL KRGVIVRPVA AYGLPEFLRV SVGLAGQNAR
FLDALGEVLA AAPGRHPDSR KALP