Gene Moth_1304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1304 
Symbol 
ID3831790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1346992 
End bp1348209 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content58% 
IMG OID637829240 
Productaspartate kinase 
Protein accessionYP_430160 
Protein GI83590151 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.506172 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTCA TCGTCCAAAA GTACGGCGGC ACTTCCGTTA ACGGCCCGGA ACGGGTCAAA 
AACGTAGCCC GCCGGGTAGT AAATACCCGA CGCGCCGGGA ACGACGTGGT CGTTATTGTG
TCAGCTCCGG GCGATATGAC CGACGATCTC ATCGCCATGG CCCACGAGAT CAGCCCCAAC
CCGCCGGCCA GAGAAATGGA CATGCTTCTG GCTACCGGGG AGCAGACATC GATAGCCCTC
CTGGCCATGG CCATCCACGA GCTTGGCGAA CCGGTTATCT CCCTGACCGG CCCCCAGGTG
GGCATCCTGA CCGACAACGT CCATTCCAAG GCGCGCATTA TGGAAGTGAG CTGCGAGCGC
CTGCGCCGGG AATTAGAACA GGGCAAGATC GTTATTGTAG CCGGCTTCCA GGGCAAGACC
TGTGAAGGCG AGATAACGAC CCTGGGCCGG GGAGGCTCCG ATACAACGGC CGTGGCCGTG
GCCGCCGCCC TGAAGGCCGA CGTTTGCCAG ATCTATACCG ATGTGGACGG CGTTTATACG
GCCGATCCCC GGGTGGTGCC GGAGGCCAGA AAATTACCGG TTATTTCCTA CGATGAAATG
CTAGAATTGG CGAGTCTAGG TGCCCAGGTG CTGCAACCCC GGTCGGTAGA GTTTGGCAAA
CTCAACCATG TCGTCCTCGA GGTACGATCA AGCTTTAATG ATCATGAAGG AACCCTGGTC
AAAGAGGTGA CGGAAATGGA GAGGAAAATG GTCGTCAGCG GCGTAGCCGG TGACCGCAAC
GTAGCCAGGA TAGCCCTGCA CGACGTCCCC GACCGGCCGG GCATCGCCAG GACCCTCTTT
GTAGCCCTGG CCCGAGAGAG CATCAATGTT GATATGATCG TCCAGAGCGC CATGCGGGAC
GGGATTAATG ATATCGCCTT CACCGTAGGG CGTGACGATC TCCAGAAGGC GGTTGAGGTA
ACGGAAAGGG TACGTCAGGA AATTGGTGCC AGCAAGGTGA CTTCTAACGA CCGGGTGGCC
AAGGTATCCA TCGTCGGCGC CGGTATGATC ACCAATCCCG GCGTGGCTGC CGACATGTTC
GCCTGCCTGG CTGAGGAAGG CATTAATATT CACATGATCA GTACTTCAGA GATCAAGGTA
TCCTGCATCA TTGACGAAGA ACACCTGACC CGGGCCATGC AAGCCCTGCA CCGTCACTTT
AAACTGGACC GGGAGTAA
 
Protein sequence
MALIVQKYGG TSVNGPERVK NVARRVVNTR RAGNDVVVIV SAPGDMTDDL IAMAHEISPN 
PPAREMDMLL ATGEQTSIAL LAMAIHELGE PVISLTGPQV GILTDNVHSK ARIMEVSCER
LRRELEQGKI VIVAGFQGKT CEGEITTLGR GGSDTTAVAV AAALKADVCQ IYTDVDGVYT
ADPRVVPEAR KLPVISYDEM LELASLGAQV LQPRSVEFGK LNHVVLEVRS SFNDHEGTLV
KEVTEMERKM VVSGVAGDRN VARIALHDVP DRPGIARTLF VALARESINV DMIVQSAMRD
GINDIAFTVG RDDLQKAVEV TERVRQEIGA SKVTSNDRVA KVSIVGAGMI TNPGVAADMF
ACLAEEGINI HMISTSEIKV SCIIDEEHLT RAMQALHRHF KLDRE