Gene Moth_0515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0515 
Symbol 
ID3831817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp534374 
End bp535534 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content55% 
IMG OID637828449 
Producthistidinol phosphate aminotransferase 
Protein accessionYP_429388 
Protein GI83589379 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.839874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATGGTGA GAAAATCTAC CGCCTCTAAC CGGCGGTTGC AGGACAAAGG AGATGAAGAA 
CCAGTGGCTA CCGTTCCCAA GTGTCGTGAA GCCATCCTGA GCATTAAACC ATACGTTCCG
GGGAAACCCA TTGAAGAGGT CCAGCGTGAG CTGGGAGTTA AGGATGTCAT TAAGCTGGCT
TCCAATGAAA ATCCGCTGGG ACCTTCCCCG GATGCCGTCC AGGCCTTGCA AGAGGCCAGC
GATAGGATCT TCCTCTACCC TGACGGCAAC TGCTATTATC TTAAAGAAGC CCTGGCGGCC
AAACTGGGGG TAAAACAGGA GAACCTGATT ATCGGCAATG GTACTGATGA GATCCTGAAG
ATGCTGGCCG AGACCTACAT TAACCCTGGC GATGAAATCG TCGTCGCCGA CCCGACCTTT
TCCGAATACG AATTCGCCGC CCAGGTAATG GGGGGACGGG CCATAAAGGT TCCCACTCGT
AATTTCCGCC ACGACCTGGC AGCCATGGCG GCAGCTATTA CTCCCAGGAC GCGGCTGGTC
TTCGTCTGCA ACCCCAACAA TCCCACCGGG ACGATTGTCG GCCAGGCCGC CCTGGACGGT
TTCTTAAAGC AGGTGCCTCC TTCCGTACTA GTCGTCCTGG ACGAGGCCTA TTCCGATTAT
GTTACCGCCG AACACTATCC CAACAGCCTG GCCTATGTCC GGGCCGGCAG GGCCAATGTC
ATAATCCTGC GCACCTTCTC TAAAATTTAC GGCCTGGCCG GGTTGCGGGT CGGTTACGGG
GTCGCTGTTC CCGAGATTAT CAGGGACTTG AACCGGGTAC GGGAGCCTTT TAACGTCAAC
CTTCTCGCCC AGGTTGCCGC CGTTGCTGCC CTCAAGGACG AAGCCCACGT AGGTAAGAGC
AGGGAAGTTA ACAGCGAGGG CAAGGACTAT CTCTACAGCC AATTTGAATC CCTGGGACTA
AAGTACGTTC CCACCGAAGC CAATTTCATC TTTGTGGATA TCCAGCGGGA CAGCCGGGAG
GTTTTCCGTC AACTGTTGCA AAAGGGAGTT ATCGTCCGGA CTGGCGATAT CTTTGGCTAT
GATACCTTCC TGCGGGTAAC CATTGGTACC CGGCGCCAGA ACGAAACCTT CATTCGCGCT
CTGAGGGAAA TTTTGGCTTA G
 
Protein sequence
MMVRKSTASN RRLQDKGDEE PVATVPKCRE AILSIKPYVP GKPIEEVQRE LGVKDVIKLA 
SNENPLGPSP DAVQALQEAS DRIFLYPDGN CYYLKEALAA KLGVKQENLI IGNGTDEILK
MLAETYINPG DEIVVADPTF SEYEFAAQVM GGRAIKVPTR NFRHDLAAMA AAITPRTRLV
FVCNPNNPTG TIVGQAALDG FLKQVPPSVL VVLDEAYSDY VTAEHYPNSL AYVRAGRANV
IILRTFSKIY GLAGLRVGYG VAVPEIIRDL NRVREPFNVN LLAQVAAVAA LKDEAHVGKS
REVNSEGKDY LYSQFESLGL KYVPTEANFI FVDIQRDSRE VFRQLLQKGV IVRTGDIFGY
DTFLRVTIGT RRQNETFIRA LREILA