Gene GWCH70_1169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1169 
Symbol 
ID7977645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1217834 
End bp1219081 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content47% 
IMG OID644798122 
Productaspartate kinase I 
Protein accessionYP_002949295 
Protein GI239826671 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000398967 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCA TTGTTCAGAA GTTTGGCGGC ACATCCGTCC GTGACGGACG CGGGCGTGAT 
TTTGCACGCA AGCATATTGA AAAAGCGCTT GAAGACGGCT ATAAAGTAGT TGTTGTCGTC
TCGGCGATGG GGCGAAAAGG AGAACCGTAT GCGACGGATA CGCTCCTTAG CCTCATCGGC
GGGGCTAACA ATTATGTCAC GAAGCGCGAA CAAGATATGC TAATGGCGTG CGGTGAAATT
ATTTCTAGCG TTGTTTTTAC GAATCTATTA AATAAGCATG GAATAAAAGC AACTGCGTTT
ACTGGCGCGC AAGCAGGTTT CCGAACGAAC GATGATTATA CGAATGCGAA AATTATCGAA
ATGCGGTGCG AACGCCTGCT TAAAGCATTG AACGAATACG ATGTCGTTGT CGTTGCTGGC
TTTCAAGGCG CGACAGAAAA TGGCGATATT ACAACGCTTG GGCGCGGCGG AAGCGATACG
TCTGCGGCGG CGCTTGGTGC GGCGTTAAAC GCCGAATGGG TCGATATTTT TACCGATGTC
GAAGGGGTGA TGACTGCAGA CCCGCGCATT GTCGAGAACG CCCGTCCGCT CGATGTCGTC
ACGTATACGG AAATTTGCAA TATGGCCTAT CAAGGGGCAA AAGTGATTCA TCCACGCGCT
GTTGAAATTG CGATGCAGGC AAAAGTGCCG TTGCGCGTTC GTTCAACGTA TTCCGATTCT
TTAGGAACGC TTGTTACATC TGCTATTCGT TCGAAAAAAG GAAGCGATGT AAAAGAGCGG
TTAGTCACTG GCATTACTTA TGTTTCCAAT ATTACGCAAA TTAAAGTACA GGCGAAAGAG
GGACATTATG AATTGCAGTC CGATGTTTTT AAGGCGATGG CGAATGAAGG AATTAGTGTC
GACTTCATTA ACATCTCGCC AAACGGTGTT GTTTATACGG TTTCTGGTGA AATGACAGAG
CGAGCGGTCG CTGCCCTTCG CCGTATTGGT TATGAACCGA TCGTTACAAC AGGATGTGCG
AAAGTATCTA CAGTCGGAGC AGGGATTGCT GGAGTCCCTG GAGTGACGGC AAAAATCGTT
ACGGCTCTTT CTGAGCAAGG AATTCAAATT TTACAATCAG CCGATAGCCA TACGACCATT
TGGGTATTAG TGAAAGAGGA AGATATGAAA AAAGCGGTGA ACGCGTTGCA TGATGCATTC
CATCTTTCCG AGGAATCGGC GGAAGAGTAC GATTTAAAAT TGGAGTGA
 
Protein sequence
MKIIVQKFGG TSVRDGRGRD FARKHIEKAL EDGYKVVVVV SAMGRKGEPY ATDTLLSLIG 
GANNYVTKRE QDMLMACGEI ISSVVFTNLL NKHGIKATAF TGAQAGFRTN DDYTNAKIIE
MRCERLLKAL NEYDVVVVAG FQGATENGDI TTLGRGGSDT SAAALGAALN AEWVDIFTDV
EGVMTADPRI VENARPLDVV TYTEICNMAY QGAKVIHPRA VEIAMQAKVP LRVRSTYSDS
LGTLVTSAIR SKKGSDVKER LVTGITYVSN ITQIKVQAKE GHYELQSDVF KAMANEGISV
DFINISPNGV VYTVSGEMTE RAVAALRRIG YEPIVTTGCA KVSTVGAGIA GVPGVTAKIV
TALSEQGIQI LQSADSHTTI WVLVKEEDMK KAVNALHDAF HLSEESAEEY DLKLE