Gene Moth_0889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0889 
Symbol 
ID3831430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp923399 
End bp924571 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content62% 
IMG OID637828819 
ProductLL-diaminopimelate aminotransferase 
Protein accessionYP_429749 
Protein GI83589740 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID[TIGR03540] LL-diaminopimelate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000010407 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAG CCAGAAGGAT TCGCGAACTA CCGCCATACC TCTTTGCCCG TATCGAGAAA 
AAAATCGCCG AGGCCCGGGA GCGGGGAGTG GATATCATCA GCCTTGGCAT CGGCGACCCC
GATATGCCAA CTCCGTCCCA CGTGATAGAC AAGCTGGTCG CCGAGGCCCA CAACCCGGAG
AACCACCGCT ACCCCACCTC CGAAGGCCTG CTGGCCTTCC GCCAGGCGGT GGCCGATTGG
TACCAGAGGC TTTATGGTGT CGACCTCGAT CCCCGGCGGG AGGTGGTCAC CCTCATCGGC
TCCAAGGAGG GTATCGCCCA CATCTCCCTC TGCTACGTCG ACCCCGGGGA CATCAACCTG
GTGCCGGACC CCGGCTATCC CGTCTACAAT ATCGGCACCC TCCTGGCCGG CGGGGAATCC
TACTTTATGC CCCTGACGGC GGCCAACGGT TTTCTGCCGG ATCTGGGGGC CATCCCTAGC
GATGTAGCCC GCCGGGCGAA ACTCATGTTT ATCAATTATC CCAACAACCC CACCGGCGCC
GTGGCCGACC TCAAGTTCTT CCAGGAAGTC GTCGAGTTTG CCAGGAGCTA CGATTTAATT
GTCTGCCACG ACGCTGCCTA CAGCGAAATC ACCTACGACG GCTACCGCGC CCCCTCCTTC
CTGCAGGCTC CCGGTGCCAA AGAGGTGGGT ATCGAGTTTA ATTCCGTATC CAAACCCTAT
AACATGACGG GGTGGCGCCT GGGATGGGCC TGCGGCCGGG CCGACGTCAT CGAGGCTCTG
GCCCGCATCA AGTCCAACAT CGATTCTGGG GCCTTCCAGG CTGTCCAGTA TGCCGGCATC
GCCGCCCTGA CGGGACCCCA GGAGGGCCTG GCCGAAGTCC GGCGGGTTTA TCAGGAACGG
CGTGATATCA TCGTCGAAGG CTTTAACTCC CTGGGCTGGC ATCTGGAAAA GCCCAAAGCC
ACCTTCTACG TCTGGGCCCC GGTGCCCCGG GGGTATACCT CGGCCAGCTT TGCCGAGATG
GTCCTGGAAA AGGCGGGGGT CATCATCACC CCGGGGAACG GTTACGGTAA CTACGGGGAA
GGCTATTTCC GCATCGCCCT GACCATCAGC AAGGAGAGGA TGCAGGAGGC CATCGAGCGC
CTGCGCCGGG TCCTGGGGAA GGTCGAATTT TAA
 
Protein sequence
MQEARRIREL PPYLFARIEK KIAEARERGV DIISLGIGDP DMPTPSHVID KLVAEAHNPE 
NHRYPTSEGL LAFRQAVADW YQRLYGVDLD PRREVVTLIG SKEGIAHISL CYVDPGDINL
VPDPGYPVYN IGTLLAGGES YFMPLTAANG FLPDLGAIPS DVARRAKLMF INYPNNPTGA
VADLKFFQEV VEFARSYDLI VCHDAAYSEI TYDGYRAPSF LQAPGAKEVG IEFNSVSKPY
NMTGWRLGWA CGRADVIEAL ARIKSNIDSG AFQAVQYAGI AALTGPQEGL AEVRRVYQER
RDIIVEGFNS LGWHLEKPKA TFYVWAPVPR GYTSASFAEM VLEKAGVIIT PGNGYGNYGE
GYFRIALTIS KERMQEAIER LRRVLGKVEF