Gene Moth_1955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1955 
Symbol 
ID3832306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2033116 
End bp2034294 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content59% 
IMG OID637829886 
Productaminotransferase 
Protein accessionYP_430796 
Protein GI83590787 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0276062 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGAAA AAATCTTTGC CGACAAAATG GCCAATCTGG GGACCGAAAC GGCCTTTATG 
GTCCTGGCCA AGGCCAAAGC TCTGGAAGCC CAGGGCAAGG AGATCATCCA CCTGGAAATC
GGCGAACCCG ATTTCGCTAC CCCCCGCAAC ATAATCGACG CCGGCATCCG GGCCCTGAAC
GAAGGTTATA CTCATTACAC CCCCGCCCCC GGGCTGCCGG AAGTCCGGGC GACCATTGCT
GAGTATGCCA CCCGGCAGAA GGGCGTTCAT TACGACCCGG AAGAAGTCGT CATCGTTCCC
GGGGGTAAAC CCATAATGTT CTTTACCATC CTGGCCCTGG TAAACCCGGG TGACGAGGTC
ATCTACCCCA ATCCCGGCTT CCCTATTTAT GAATCCGTCA TCAACTTCGT CGGCGGCAAG
GCAGTTCCCC TGCCCATCCG GGAAGAAAAC GACTTCCGCC TGGATGTAGA TGAACTGGCA
GGGCTCATCA CCCCCAAAAC CAAACTCCTG ATCATCAATT CCCCCGCCAA CCCCACCGGC
GGCGTCCTCA CGGCTGAAGA TATCGGCCGC ATCGCCGACC TGGTCCGGGG TAAGAACATT
GTCGTCCTGG CCGACGAGAT CTACGATCGC ATCGTTTACG ACGGTGCCCG TCCCGTATCC
ATTGCCGCCC AGCCGGGTAT GAAGGACTGG ACCATTATCC TGGACGGTTT CTCCAAGACC
TACGCCATGA CCGGTTGGCG GATCGGCTAC GGCCTGATGC ACCGGGAGCT CGCCGACCGC
ATCGCCCAGT TGATGGTCAA CTCCAACTCC TGCACCGCCG CCTTTACCCA AAAGGCTGCC
CAGGAGGCCC TGACCGGACC CCAGGACGCC GCCGAGGCCA TGGTGGCCGA ATTTAAGAAG
CGGCGGGACA TCATTGTTGA TGGCCTGAAC AGCATTCCCG GTATTACCTG CAAACGGCCT
CTGGGTTCCT TCTACGTCTT CCCCAACATC AAGGGTCTGG GCCTCTCCAG CCAGGAGCTG
GAAGCCTTCC TGATGGAAAA GGCGGGCGTA GCCGCCCTGA GCGGTACGGC CTTCGGTAAA
TACGGGGAAG GCTACCTGCG TCTCTCCTAT GCCAACTCGG TGGAGAACAT CGAGAAAGCC
CTGGAGAAAA TAGCGGCTGC CGTAAAGGAG CTGCGGTAG
 
Protein sequence
MFEKIFADKM ANLGTETAFM VLAKAKALEA QGKEIIHLEI GEPDFATPRN IIDAGIRALN 
EGYTHYTPAP GLPEVRATIA EYATRQKGVH YDPEEVVIVP GGKPIMFFTI LALVNPGDEV
IYPNPGFPIY ESVINFVGGK AVPLPIREEN DFRLDVDELA GLITPKTKLL IINSPANPTG
GVLTAEDIGR IADLVRGKNI VVLADEIYDR IVYDGARPVS IAAQPGMKDW TIILDGFSKT
YAMTGWRIGY GLMHRELADR IAQLMVNSNS CTAAFTQKAA QEALTGPQDA AEAMVAEFKK
RRDIIVDGLN SIPGITCKRP LGSFYVFPNI KGLGLSSQEL EAFLMEKAGV AALSGTAFGK
YGEGYLRLSY ANSVENIEKA LEKIAAAVKE LR