Gene Moth_1067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1067 
Symbol 
ID3833332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1096999 
End bp1098240 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content58% 
IMG OID637828995 
Productaspartate kinase I 
Protein accessionYP_429924 
Protein GI83589915 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0350826 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0457717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGTCC TGGTCCAAAA GTTCGGTGGT ACGTCGGTAG CCAGTCCCGA GCAACGACTG 
GTGGTAACCG GGCATATCGA AAGGGCCTGC CGGGTGGGTT ACCAGGTGGT AGTAGTCGTT
TCGGCTATGG GGCGCCGGGG CGCGCCCTAT GCTACCGATA CCCTCCTTGA ACTGCTGGGG
GATAACGAGG TTGAGCCCCG GGAGCGGGAT CTCCTCCTGG CTTGTGGCGA GGTTATTTCC
GGGGTGGTCC TGACCGGGCT CCTTAAAAGT AAGGATCTGC CGGCAGTTTT CCTGACGGGA
GGCCAGGCCG GCATCATTAC CGACGCCCAG TTTGGGGATG CCCGTATTCT TAGGGTTGAA
CCGCGCCGGA TTCAATCCTA CCTTGACCAG GGCCGGGTAG TGGTGGTAGC CGGTTTCCAG
GGAGTCACTG AATCCGGGGA AGTAACCACT CTAGGCCGTG GTGGCAGCGA CACCACGGCG
GTGGCCCTGG GAGTGGCCTT GGGTGCGGAA GCAGTTGAAA TTTTTACCGA TGTGGATGGG
GTTAAAACTG CCGACCCACA TATTGTCAGC GATGCCAGGA CCCTGAGCAC CATCACCTAC
AATGAGGTTT GTCAGATGGC CTATGAAGGG GCGAAGGTCA TCCACCCCCG AGCCGTAGAA
ATAGCCCGGC AGAAGAATAT TCCCTTACGG ATCAAGTCAA CCTTTAATGA CGGCCCTGGT
ACCCTGGTGG TAGCCTGGCA ACCGGGGGTC ACCGGCGTCC ATATCAGCCG GGACCGGGTC
ATTACCGGCA TTACCCACAT GGACGGTCTG ACCCAGTTGC GGGTTTCCCT CCCCTCAGGG
GAGGGAGCCG GGGAGGTTTT CCCGCTGCTG GCTCAAAATA ATATCAGCGT GGACTTTATC
AATATCTTTC CCGGGGAACT GGTCTTTACC GTTAAAAGCG AGGTTGCCCG GCAGGCCCGG
GAACTGATTG AAGGGCTGGG CCTGAAGGTG ACTGCCCGCC CCGGTTGCGC CAAGGTGGCC
ACGGTAGGGG CCGGTATGCG CGGCGTACCC GGGGTCATGG CTACCATTGT TACGGCCCTG
GAGCGGGAGG GCATTAAAAT TCTCCAGTCA GCAGATTCCT ATACTTCTAT CTGGTGCCTG
GTGGACAGGA AGGATATGGA ACGGGCCGTA CAAACCCTTC ATCGGGAGTT TAAACTTAAC
GACGGTAAAA CAGGCGAGGT GAAAGTTTAT GCAGTGGGGT AG
 
Protein sequence
MKVLVQKFGG TSVASPEQRL VVTGHIERAC RVGYQVVVVV SAMGRRGAPY ATDTLLELLG 
DNEVEPRERD LLLACGEVIS GVVLTGLLKS KDLPAVFLTG GQAGIITDAQ FGDARILRVE
PRRIQSYLDQ GRVVVVAGFQ GVTESGEVTT LGRGGSDTTA VALGVALGAE AVEIFTDVDG
VKTADPHIVS DARTLSTITY NEVCQMAYEG AKVIHPRAVE IARQKNIPLR IKSTFNDGPG
TLVVAWQPGV TGVHISRDRV ITGITHMDGL TQLRVSLPSG EGAGEVFPLL AQNNISVDFI
NIFPGELVFT VKSEVARQAR ELIEGLGLKV TARPGCAKVA TVGAGMRGVP GVMATIVTAL
EREGIKILQS ADSYTSIWCL VDRKDMERAV QTLHREFKLN DGKTGEVKVY AVG