Gene Mthe_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0801 
Symbol 
ID4461988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp858657 
End bp859811 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content56% 
IMG OID639699817 
Productaminotransferase, class I and II 
Protein accessionYP_843230 
Protein GI116754112 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGCTG ATCGGATAAA ATCCCTTCCT CCCTATCTGT TCGCCGAGAT CGATGGAATC 
AAGAAGAGAG CAAGAGATAG TGGCGTTGAC GTCATAGATC TGAGCGTAGG GGACCCGGAC
ATACCAACGC CTGAGCATAT AGTTAAAGAG ATGTGCGAGG CTGTGAAGAG ACCTGCAAAC
CATCAGTACC CATCATATGA GGGAAAGATC GAGTTCAGAG AGGCAGTGGC TGAGTGGTAC
CGTGATCTCT TCGGTGTCGA TCTAGATCCA TCCACAGAGA TACTCACCCT CATAGGATCC
AAGGAGGGGC TTGCGCATGC CCCGCTGGCG TTCGTAAATC CGGGGGAGAT AGTGCTCGTC
CCGGATCCGG CGTATACAGT TTACAGCACA GCGGTGATGT TTGCTGGTGG CATCCCTGAG
AGGATGCCTC TCCTGAAGAA GAACTCGTTT CTTCCCGATC TCGGGAGCAT AAGGGCGAGG
CTTGAGCAGG ATCCTGACTG GAGGCCCAGG CTGATCTTTC TGAACTACCC GAACAATCCG
ACCGGCGCTG TCGCTGGAAT TGATTTCTTC AGAGAGCTCG TGGATCTGGC CCGCGAGTAC
GGGATTCTTG TGATGCACGA TAACCCCTAC TCCGAGATAT ACTTCGACGG GCGGCCACCG
AGCATACTCC AGGTACCCGG GGCGAGGGAT GTGGCTGTCG AGTTTCATTC ATTATCCAAG
ACTTACAACA TGACCGGCTG GCGCATAGGC ATGGTCTCAG GGAGCTCCAG GATCATCGAG
GGCATCGGAA AGGTCAAGTC GAACATAGAC TCCGGGAACT TCGGCGCGGT ACAGGATGCC
GGGATCGCTG CGCTCAGAAG CCCGCCGGAG GTTGTTGATG GGTTGAGGGC TGTGTACAGG
GAGAGGATCG AGATCCTGCA CTCGGCGCTC TGCGATATCG GTCTTGAGCT CTCGAAGCCT
AAGGCGACGT TCTACCTCTG GGCGTGGACC GGCGGGGACT CCAGGGAGTA CGCCAAGATG
CTGCTCGAGA AGACCGGTAT AGTCGTGACA CCAGGCGTCG GGTTTGGCGA ACATGGCGAA
GGTTACATAA GGCTCTCTGT GACCCAGCCC ACAGAGAGGA TCGAGATGGC CGCTGAAAGG
CTGAGGAATC TCTGA
 
Protein sequence
MYADRIKSLP PYLFAEIDGI KKRARDSGVD VIDLSVGDPD IPTPEHIVKE MCEAVKRPAN 
HQYPSYEGKI EFREAVAEWY RDLFGVDLDP STEILTLIGS KEGLAHAPLA FVNPGEIVLV
PDPAYTVYST AVMFAGGIPE RMPLLKKNSF LPDLGSIRAR LEQDPDWRPR LIFLNYPNNP
TGAVAGIDFF RELVDLAREY GILVMHDNPY SEIYFDGRPP SILQVPGARD VAVEFHSLSK
TYNMTGWRIG MVSGSSRIIE GIGKVKSNID SGNFGAVQDA GIAALRSPPE VVDGLRAVYR
ERIEILHSAL CDIGLELSKP KATFYLWAWT GGDSREYAKM LLEKTGIVVT PGVGFGEHGE
GYIRLSVTQP TERIEMAAER LRNL