Gene Moth_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1066 
Symbol 
ID3833331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1095907 
End bp1096914 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content56% 
IMG OID637828994 
Productaspartate semialdehyde dehydrogenase 
Protein accessionYP_429923 
Protein GI83589914 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0136] Aspartate-semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01296] aspartate-semialdehyde dehydrogenase (peptidoglycan organisms) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.058115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00506315 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGAATT TGAATGTTGC CGTAGTAGGT ACCGGAGCTG TAGGCCAGAC CATGCTTAAA 
GTTCTGGAGG AAAGGAATTT TCCCGTCGGC AGGTTAAAGG TCCTGGCGAC CAGTCGCTCC
GCAGGAAAGA AAGTCACCTT CAAGGGCGAG GAGTACCGCG TTGAGGAAAC CACTCCGGAA
TCCTTCGCCG GGGTTAACGT AGCCCTCTTT GCCGGGGGTG AAGCCAGTAA AATCTTTGGC
CGGGCGGCGG TGGCCGCCGG AGCAGTGGTA ATTGATAATA GCAATAACTT CCGTATGGAT
CCGGAGGTAC CCTTGGTGGT ACCGGAGGTT AATCCCCAGG ATGTACGCTG GCATAAAGGG
CTGATTGCCA ACCCCAACTG CTCCACCATT CAGATGGTGG TTGCCCTGAA GCCCCTGTAT
GACGCCGCCG GCATCAAGCG GGTAGTAGTC TCGACCTACC AGGCTGTTTC TGGCGCCGGC
CAGGAAGCCA TCGATGAGCT GCGGAAACAG AGCCAGCAGG TCTTGGAGGG CAGGGAAGTG
AGCGGCAGGG TCTTCCCCTG GCAGATTGCC TTCAACTGCC TGCCCCATAT CGATATCTTC
CTGGAGAACG GTTATAGCAA GGAAGAAATG AAGATGGTCA ACGAGACCAA GAAAATTATG
GGAGATAATG ATATCCGGGT GACGGCCACC ACGGTACGGG TTCCGGTCTT TAACGGCCAT
TCGGAAGCAA TTAATGTAGA GACAAGGGAA AAGCTGACGG CCTCCCAGGC CAGGGAACTC
TTGAGCCGGG CCCCCGGGGT GGTGGTAGTC GACGATCTTG ATAATAAGGC CTATCCCCTG
GCCATCCAGG CCGACGGCCG GGACGAGGTA TTCGTCGGGC GTATCCGGGA GGATTTCTCC
ATTGCCAACG GCCTGAACCT GTGGGTGGTT GCCGATAACC TGCGCAAGGG TGCAGCGACC
AATGCCGTGC AGATTGCCGA ATTACTGCTG CAGGAAGGCC TTCTTTAG
 
Protein sequence
MANLNVAVVG TGAVGQTMLK VLEERNFPVG RLKVLATSRS AGKKVTFKGE EYRVEETTPE 
SFAGVNVALF AGGEASKIFG RAAVAAGAVV IDNSNNFRMD PEVPLVVPEV NPQDVRWHKG
LIANPNCSTI QMVVALKPLY DAAGIKRVVV STYQAVSGAG QEAIDELRKQ SQQVLEGREV
SGRVFPWQIA FNCLPHIDIF LENGYSKEEM KMVNETKKIM GDNDIRVTAT TVRVPVFNGH
SEAINVETRE KLTASQAREL LSRAPGVVVV DDLDNKAYPL AIQADGRDEV FVGRIREDFS
IANGLNLWVV ADNLRKGAAT NAVQIAELLL QEGLL