Gene Moth_1490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1490 
Symbol 
ID3831717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1538302 
End bp1539492 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content60% 
IMG OID637829422 
ProductL-aspartate aminotransferase 
Protein accessionYP_430342 
Protein GI83590333 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTTG CCCAGCGGGC TGCCGGTATC AGTCCTTCAC CCACCCTGGC CATTGACGCC 
CAGGCCAAAG CCATGAAGGC TAAAGGGGTG AAGGTAATAA ACTTCAGCGC CGGCGAGCCT
GATTTCGGTA CACCGGAGCA TATCAAACAG GCAGCCATCG ACGCCCTGGC AGCCGGCTTT
ACCCGTTACA CGCCGGTAGC CGGGATTCCT GAACTGCGCC AGGCAATATG CGCCAGCCTG
GCTGCCCGGG GAGTGACCTA TGAACCGGCA GATATCGTCG TCTCCTGCGG CGCCAAGCAT
TCCCTCTATA ATGCCATGCA GGTTTTACTT AACGCTGGTG ATGAGGTGAT CCTTAGCGCC
CCCTACTGGG TAAGCTATTA CGAACAGGTT AAACTCGCCG GCGGTGTCCC GGTGGTAGTC
ACCACCGGCC CCGACACCGG CTTTAAGTTG ACGCCAGGGT TGCTGGAAGC GGCCATTACC
CCGCGGACAA GGCTCTTAAT CCTCAATTCG CCATGTAACC CAACCGGCGC TGTCTACAGC
CGGGAGGAAC TGGCAGCCCT GGCTGAAGTA ATTGTTGCCC GGGACCTGAT AGTCATTTCC
GATGAAATTT ATGCCGCCCT CCTCTACGAC GGCCTGACCC ACACCAGCAT CGCCTCCCTG
GCGCCGGAGG TAAAAGAACG GACCATCCTC ATTGACGGGG TGTCCAAGAC CTACGCCATG
ACCGGCTGGC GGATTGGCTA TGCCGCTGCG CCGCGGCCCG TGGCCAAAGC CATGACGGAT
CTCCAGAGCC ACTCGACTTC CAATCCCACC TCCATCGCCC AGAAGGCGGC CGTGGCTGCC
CTGACCGGCA GCCAGGAAGC CGTGGAAATG ATGCGTCGCG AGTTTGAACA ACGCCGCAAC
CGCATCCTGG CGGGCCTGCG GGAGTTACCG GGCATCGAAT GCAACCAGCC CGGCGGCGCT
TTCTACGTTT TCCCCTATAT CGGCAAGTTG TTCGGCCGCA AATTCCGGGG TCGGGTCCTG
GGCAACTCCA CCGATGTCGC TACAGCCCTG CTGAATGAAT TCCAGGTGGC AGTGGTACCG
GGCGTCGCCT TCGGCGCCGA ACCTTACCTG CGCCTCTCCT ATGCCACCTC CATGGACCAG
ATCGAAGCCG GCCTGGAAAG ACTCCGGGCC TTTGTAACCG AACTGGAATA G
 
Protein sequence
MQLAQRAAGI SPSPTLAIDA QAKAMKAKGV KVINFSAGEP DFGTPEHIKQ AAIDALAAGF 
TRYTPVAGIP ELRQAICASL AARGVTYEPA DIVVSCGAKH SLYNAMQVLL NAGDEVILSA
PYWVSYYEQV KLAGGVPVVV TTGPDTGFKL TPGLLEAAIT PRTRLLILNS PCNPTGAVYS
REELAALAEV IVARDLIVIS DEIYAALLYD GLTHTSIASL APEVKERTIL IDGVSKTYAM
TGWRIGYAAA PRPVAKAMTD LQSHSTSNPT SIAQKAAVAA LTGSQEAVEM MRREFEQRRN
RILAGLRELP GIECNQPGGA FYVFPYIGKL FGRKFRGRVL GNSTDVATAL LNEFQVAVVP
GVAFGAEPYL RLSYATSMDQ IEAGLERLRA FVTELE