Gene Mpal_0003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0003 
Symbol 
ID7270115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2211 
End bp3377 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content60% 
IMG OID643568662 
Productaminotransferase class I and II 
Protein accessionYP_002465122 
Protein GI219850690 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00871395 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGGATC GGATATCTGA ACGGGCCCGG GTGATCCCAC CGTCAGGGAT CAGGAAGTTC 
TTTGATATCG TCCAGACGAT GGACGAAGTG ATCTCACTCG GTGTCGGCGA GCCGGACTTT
GTGACCCCCT GGAACATTTG TGAGGCCGCG ATCTACTCGA TCGAGCAGGG GAAGACCTCG
TACACCTCGA ACCGGGGGCT TCAGAACCTC CGCGAGGCCC TTGCGGCACG GATGGAGCGG
GTATATGCCC TCCAGTACCA CCCTGACCGG GAGATGATTA TCACCACCGG CGTGAGCGAA
GGGGTTGATA TCGCGGTCCG CGCGATCGTC GACCCGGGTG ACGAGGTGTT GATTGCAGAG
CCGAGCTATG TTTCGTACGC TCCCACAGTG ACCCTGACCG GTGGGGTTCC GATATCAGTG
GAGTGTCGAG AGGCAGATCG GTTCAAGCTG AACCCCGATA CCCTCGCCGA AGCGATAACA
CCGAAGTCCA AGGCCCTGAT CATCAACTTC CCGACCAACC CGACCGGAGC CGTGATGACC
AGGTCCGACT ACCGGGAGAT TGCCGATCTG ATCACCGACC ATGACCTTAT CCTGATCAGC
GACGAGGTCT ATGCCGAGCT GACCTATGAA GGGACGCATG TCCCTGCGGC CACGGTCGGT
GACCTCTGGG AGCGGACGAT CACGCTGAAC GGTTTCTCCA AGGCCTACGC GATGACTGGC
TGGCGGCTCG GGTACCTCTG CGCTCCAGAA GATCTCTGCG ATGCAGCCTT GAAGATCCAC
CAGTATGTGA TGCTCTGTGC TCCGATTATG GCCCAGATGG CGGCGAATGA GGCTATTCGA
TCTGCAGAGG AGGAGAAGGA CGCGATGATC AAAGAGTACC GGCAGCGGCG GAACCTCTTC
GTCGAGGGGT TGAATCATAT CGGCCTCCAC TGCCATCTGC CGGAGGGTGC GTTCTATGCG
TTCCCGTCTA TTGCCTCCAC CGGCCTTTCG GACGAGGACT TCGCCGAGCA GTTGCTGCAT
GAGCAGCATG TGGCGGTCGT CCCGGGATCG GTCTTCGGGG CTGGCGGAGT TAACCATATC
CGCTGCGCCT ATGCGGTCTC ACGGCCGGAC CTGACCGAGG CGGTCAGACG GATCGGTCTC
TTCATCGCTG ACCACCAGCG GGCTTGA
 
Protein sequence
MKDRISERAR VIPPSGIRKF FDIVQTMDEV ISLGVGEPDF VTPWNICEAA IYSIEQGKTS 
YTSNRGLQNL REALAARMER VYALQYHPDR EMIITTGVSE GVDIAVRAIV DPGDEVLIAE
PSYVSYAPTV TLTGGVPISV ECREADRFKL NPDTLAEAIT PKSKALIINF PTNPTGAVMT
RSDYREIADL ITDHDLILIS DEVYAELTYE GTHVPAATVG DLWERTITLN GFSKAYAMTG
WRLGYLCAPE DLCDAALKIH QYVMLCAPIM AQMAANEAIR SAEEEKDAMI KEYRQRRNLF
VEGLNHIGLH CHLPEGAFYA FPSIASTGLS DEDFAEQLLH EQHVAVVPGS VFGAGGVNHI
RCAYAVSRPD LTEAVRRIGL FIADHQRA