Gene Mpal_2151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2151 
Symbol 
ID7270234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2284943 
End bp2285992 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content60% 
IMG OID643570765 
Productaminotransferase class I and II 
Protein accessionYP_002467172 
Protein GI219852740 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.518477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTTC TGCATATCTA TCAAGGAGAT AATGGCAGGG TGGAGATGAA ACTGGGGAAG 
AGAAGGCAGT ACCGGCGAGC AGTCCATGGG GGTGTGCTTC CCGATCAGTC CCTTCCGGGA
GAGACGATCA TCGACTTCAG CGCAAGTATA AATCCGTTTC CCCCGGAGGT GGCCTGGGAT
CCAGCATCGG TTCCTGTTCA CCGGTATCCT GATAACCGGT ACTCCGCCCT TAAGGCGGTG
ATTGCAGAGA CCTTCCACCG GGACCCGGCA GAGGTGACCG TCGGAAACGG CTCTGCCGAA
CTGATGCGGG TCTTCTGTCA GGTGGCACTC AGTCCGGGGG ATTGCGTCAG GATCGACCGG
TCCACGTTCG AAGAGTATGC GGTCTCTGCC GAAATCGCCG GCGCCATCGT CGACGAACAC
GCCAAAAACC CTGTCGTTCG GTTCCTCTGC AACCCGAACA ACCCGACCGG GATGCTGGCC
CCAAAGAGTA CGATGCTCGA TCATCTCGAT CACTGCAGCA GTGCGGGGGC GACGCTCTTC
CTCGATGAGG CCTTCATCGA TCTGGCGGCT CCGGACCAGA GCCTCGTCGA TCAGCAGAGC
CCCGATCTCT TTCTGCTCCG ATCCCTGACC AAGGCCTTCT CGGTGCCAGG ACTCCGGTTT
GGGTACGGAT TTGGGGACCC CGAACTGATC GAAGCGATGG AGGCCGTCCG CCCACCCTGG
TCGATCAATG CCTATGCAGA GCAGTTCGCC ATCGCTGCGT TCGGATCCTA TGACCTCCTG
GCGGTGTCAC GGAAGGCGAT CGCGCGGGAA CGGGAGTTCC TCTGTTCCGG TTTGGATGAT
CTCGGGATCG CTTATTGCCC TTCATCGGTC AACTACCTGC TGCTCGACCC TGGGGTACCG
GCACCCGGAC TGACCCGGTC CCTGCTTGCC CATGGCATCC TGGTGCGCGA CTGCACCTCA
TTTCATCTGC CTGATTCGGT TCGGATCGCC GTCCTCACCC GGGATGAGAA CATACGGCTC
CTCGCAGCGC TGAACGCATG CTTGGTCTGA
 
Protein sequence
MPVLHIYQGD NGRVEMKLGK RRQYRRAVHG GVLPDQSLPG ETIIDFSASI NPFPPEVAWD 
PASVPVHRYP DNRYSALKAV IAETFHRDPA EVTVGNGSAE LMRVFCQVAL SPGDCVRIDR
STFEEYAVSA EIAGAIVDEH AKNPVVRFLC NPNNPTGMLA PKSTMLDHLD HCSSAGATLF
LDEAFIDLAA PDQSLVDQQS PDLFLLRSLT KAFSVPGLRF GYGFGDPELI EAMEAVRPPW
SINAYAEQFA IAAFGSYDLL AVSRKAIARE REFLCSGLDD LGIAYCPSSV NYLLLDPGVP
APGLTRSLLA HGILVRDCTS FHLPDSVRIA VLTRDENIRL LAALNACLV