Gene Moth_1188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1188 
Symbol 
ID3832991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1220262 
End bp1221476 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content58% 
IMG OID637829121 
Productaminotransferase 
Protein accessionYP_430045 
Protein GI83590036 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0134588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTG CCCACGGTTT TGAAGCCAGG CGTTTTATCA CACCGGTGGT TAGGGATCTG 
CCGCCTTCAG GAATCCGCCG CTTTTTTGAA CTGGTAGCCA GCACCAAGGG CGTAATCTCC
CTCGGGGTGG GAGAACCAGA TTTCGTCACT CCCTGGCATA TCCGGGAGGC CTGCGTCCAA
TCCCTGGAAC GGGGCTATAC CATGTACACC TCCAACTACG GCCTGCCCGA ACTGCGCCGG
GCCATAGCCG ATTACCTGGC CTGGCGTTTC GGCTTGACCT ATGATCCCAT GAAGCAGATT
ATGGTGACCA TTGGTGCCAG TGAAGCAGTC GACCTGGCAC TGCGGACGGT ATTGAACCCC
GGGGATGAAG TCTTGATCCC CGAGCCCTGT TATGTTTCCT ACCAGCCCAT AACGCAACTG
GCGGGGGGTA TCCCGGTCCC TATCCCGACG ACAATGACTG ACGGTTTTGC CCTTACGGCC
GCCCGCCTGG AGCACTACAT TACCCCCCGG AGTAAAGTCT TGATTCTTTG CTTTCCCAAT
AACCCGACCG GCGCCGTCCT CAGCCGGGAG GAGATGCAGG CCATTGCCCG GCTGGTTGAA
AAGTACAACC TGCTGGTGAT CAGCGATGAA ATCTATGCAG AATTGCGTTA TGAGGGCCAA
CCGTTATCCT TTGCCTCTCT GCCCGGCATG CAGGAGCGCA CCATCCTGGT GAGCGGTTTT
TCCAAGGCCT TTGCCATGAC CGGATGGCGC ATCGGTTATG TAGCGGCTCA TCCCGATTTT
CTGGCAGCTA TGGTTAAAAT CCACCAGTAT ACCATCCTCT GCGCTCCCGT CATGGGTCAG
ATGGCGGCCC TGGAAGCCCT GCGCCACGGC CGCCAGGACG TGGAGAGGAT GGTGGAGCAA
TACGACCAGC GACGGCGCCT GGTTTACAGC CGGCTGCGGG AGATGGGCCT GGATTGTTTT
GAACCCCGGG GGGCCTTTTA CATCTTCCCC TCCATAGCTG CCACCGGCCT GGATTCCGTC
ACTTTCGCCG AAGAGCTTCT TAAGGAAGAA AAGGTGGCCG TAGTACCGGG TACAGCCTTT
GGCGCCAGTG GCGAAGGTTT TATCCGCTGT TCCTATGCCG CCTCCCTGGC CGACCTGACA
GAAGCCATGA ATCGCATGGA ACGCTTTGTC GCCCGGCGTC TGGCCTTTAA CCAGCGGGCC
GTAGCCCAGG CTTAA
 
Protein sequence
MTIAHGFEAR RFITPVVRDL PPSGIRRFFE LVASTKGVIS LGVGEPDFVT PWHIREACVQ 
SLERGYTMYT SNYGLPELRR AIADYLAWRF GLTYDPMKQI MVTIGASEAV DLALRTVLNP
GDEVLIPEPC YVSYQPITQL AGGIPVPIPT TMTDGFALTA ARLEHYITPR SKVLILCFPN
NPTGAVLSRE EMQAIARLVE KYNLLVISDE IYAELRYEGQ PLSFASLPGM QERTILVSGF
SKAFAMTGWR IGYVAAHPDF LAAMVKIHQY TILCAPVMGQ MAALEALRHG RQDVERMVEQ
YDQRRRLVYS RLREMGLDCF EPRGAFYIFP SIAATGLDSV TFAEELLKEE KVAVVPGTAF
GASGEGFIRC SYAASLADLT EAMNRMERFV ARRLAFNQRA VAQA