Gene Msed_1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1684 
Symbol 
ID5105330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1623126 
End bp1624316 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content46% 
IMG OID640507578 
Productaspartate aminotransferase 
Protein accessionYP_001191763 
Protein GI146304447 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.121725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0684909 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAATT TCGCTGGTTC GCTTTCAAGA TTAACTGGAG AGTCAACCCT CCTTTATCAG 
GAAATAGCCA GAAACGTGGA GAGGACCAAG GGAATAAAGA CCATCAACTT CGGCATAGGC
CAACCTGATC TACCAACTCC TCAGAGAATA AGGGAAGAGG CTAAACTGGC ACTGGATAAG
GGTTTCACCG CATACACCCC AGCACTGGGG TTAGACGAGT TAAGATCCAA GATAGCGGAA
TTTCTTACCC AGAGATACGG GGATCAAATA AGGAAGGAAG AGGTAGCTGT TACGCCTGGG
GCTAAGACTG CACTCTTCCT GGCATTTCTC ATGTATGTTA ACCCTGGCGA TGAGGTAATC
CTATTCGATC CTTCCTTCTA CTCCTATGCA GAAGTGGTTA ACCTCCTAGG GGGAAAACCA
GTTTATGTCC CTCTCAGCTT TGACGAGAAC TCGGGTTTCA GTGTCGACAT GGATGAGTTA
GTTTCCAAGA TCACTCCCAG AACCAAGATG ATAGTTTATA ATAACCCTCA TAACCCGACT
GGAATGAACT TCAACGATAA ACTTGCGAGG GAACTAGTTG AGATAGCTAG GGAGAAGAGG
TTAATCCTTC TTTCAGACGA AATTTACGAC TACTTCGTCT ATGACGGCGG TTTCAGGAGC
GTGCTTCAGG AAGCGTGGAG AGATAACGTG ATTTACATAA ACGGGTTCAG CAAGACCTTC
AGCATGACAG GTTGGAGACT GGGATATATT GTTGCCAAAA GGGAGGTCAT TAACAAGGTT
GGCATATTAG CGTCCAATAT ATACACATGT CCCACTAGCT TTGCCCAAAG GGGTGCATTA
GCCTCCTTTG ATACCTTCGA CGAGGTGAGA CGCATGATAG ACCTGTTTAA GAGAAGGAGA
GACGTTATGT TCTCGGAGCT TAAGTCCCTC AAGGGAATTA GGGTTTACAA GTCGTCTGGG
GCATTCTACA TGTTTCCAGA CGTTAGCGAA ATCCTGAAGA CGACTGGAAT GGATTCAAAG
GCCTTAGCGG TGAAAATAAT TGAGGAGGGG GGTGTGGTAA CTATCCCAGG TGAGGTGTTT
CCAGAAAAGG TTGGGAGAAA CTTCCTGAGA TTGAGCTTTG CCCTAGATGA GGAGAAAATT
AAAGAGGGCG TGTCAAGGAT GAAAATGGCA CTGGAGAAAC TCACAGGTTG A
 
Protein sequence
MENFAGSLSR LTGESTLLYQ EIARNVERTK GIKTINFGIG QPDLPTPQRI REEAKLALDK 
GFTAYTPALG LDELRSKIAE FLTQRYGDQI RKEEVAVTPG AKTALFLAFL MYVNPGDEVI
LFDPSFYSYA EVVNLLGGKP VYVPLSFDEN SGFSVDMDEL VSKITPRTKM IVYNNPHNPT
GMNFNDKLAR ELVEIAREKR LILLSDEIYD YFVYDGGFRS VLQEAWRDNV IYINGFSKTF
SMTGWRLGYI VAKREVINKV GILASNIYTC PTSFAQRGAL ASFDTFDEVR RMIDLFKRRR
DVMFSELKSL KGIRVYKSSG AFYMFPDVSE ILKTTGMDSK ALAVKIIEEG GVVTIPGEVF
PEKVGRNFLR LSFALDEEKI KEGVSRMKMA LEKLTG