Gene GWCH70_2137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2137 
Symbol 
ID7976948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2203800 
End bp2204897 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content45% 
IMG OID644798953 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_002950113 
Protein GI239827489 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATTA AAACACAGTT GCGGGGGCTT CCCCCTTATC AGCCGGGAAA ATCTATTGAA 
GAAGTAAAAC GAGAGTACGG GCTTACCGAT ATTATTAAAC TAGCATCCAA TGAAAATCCA
TATGGTTGTT CACCTGCTGT GAAAGAGGCG GTGATGAAAC AATTAGATCA TCTTGCCATT
TATCCCGATG GATACGCACG TCTGTTGCGC GAAAAAGTTG CCACACATTT AGGGGTCAAC
GAAACACAGC TTATTTTCGG CAACGGGTCG GATGAAGTCG TGCAAATTAT TTGCCGCGCG
TTTTTATCTC CGAATACAAA TACGGTGATG GCTGCGCCGA CGTTTCCACA ATATCGTCAT
AACGCGGTGA TTGAAGGGGC GGAAATTCGT GAAATTCCGC TGGTGGATGG GCGACATGAT
CTAGAAGCAA TGCTGAATGC AATTGATGAA CAAACGCGCG TCGTTTGGAT ATGCAACCCG
AACAACCCGA CAGGGACGTA TGTGAACGAG CAGGAATTAA CCTCTTTCCT TGAGCGAGTT
CCTAGCCATG TCCTTGCCGT TTTGGATGAG GCGTATTATG AATACGCAAC GGCGAATGAT
TATCCGCAAA CCGTTCCACT TCTCCGCCAA TATGATAATT TAATGATTTT GCGTACGTTT
TCAAAAGCAT ACGGTTTAGC AGCGCTGCGG GTTGGATACG GTATTGCCAG CGAAACGCTC
ATTCGTGAGA TCGAACCGGC GCGCGAGCCA TTTAATACAT CAAGCGTCGC GCAGGCAGCT
GCCATTGCTG CTTTAGATGA TCAAGCATTC ATTCGCGAAT GTGTCGAAAA AAATAAACAA
GGGTTAGAGA CGTTTTATCG TTTTTGTGAG GAAAATGGGC TGCGCTATTA TCCGTCACAA
GCGAACTTTA TTTTAATTGA TTTTGGTATC GAGGGAAACG AAGTGTTTCA ATATTTGCTT
GAGCGGGGCA TCATCGTTCG CTCCGGCAAT GCGCTCGGTT TTCCGACATC GGTGCGCATT
ACGGTTGGTT CCAAAGAGCA AAACGAACGA ATCATTCATG CATTAACGCA AATGTTGAAA
GAAAAGCAGC TTATATAA
 
Protein sequence
MEIKTQLRGL PPYQPGKSIE EVKREYGLTD IIKLASNENP YGCSPAVKEA VMKQLDHLAI 
YPDGYARLLR EKVATHLGVN ETQLIFGNGS DEVVQIICRA FLSPNTNTVM AAPTFPQYRH
NAVIEGAEIR EIPLVDGRHD LEAMLNAIDE QTRVVWICNP NNPTGTYVNE QELTSFLERV
PSHVLAVLDE AYYEYATAND YPQTVPLLRQ YDNLMILRTF SKAYGLAALR VGYGIASETL
IREIEPAREP FNTSSVAQAA AIAALDDQAF IRECVEKNKQ GLETFYRFCE ENGLRYYPSQ
ANFILIDFGI EGNEVFQYLL ERGIIVRSGN ALGFPTSVRI TVGSKEQNER IIHALTQMLK
EKQLI