Gene Mboo_1200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1200 
Symbol 
ID5411348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1214448 
End bp1215563 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content55% 
IMG OID640868426 
Productaminotransferase, class I and II 
Protein accessionYP_001404361 
Protein GI154150743 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.162477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0266801 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGAAC GGTTTTCATC GCGGGTCAGG GGCATTGAGA TCTCCGGGAT CCGGAAAATC 
TTTGAGGCTG CCGGGCCGGG ATCGATCAAT CTCGGCCTGG GTCAGCCGGA TTTTGATACG
CCGCAGCATA TCAAGGATGC TGCCATTGCG GCGATCCGGG AGGGAAAGAC AGGTTACACA
CCCAACACGG GGATACCCGA GCTCAGGGAA GCGCTCAGCA CCAAGTTCAG AAAAGAGAAC
AACGTGCAGT ACTCCACAGA TCAGATACTC GTTACTGCCG GTGCAAGTGA GGCCCTCCAT
ATTGTCATGC AGGCTCTTGT GAGCGACGGG GATCGTGTCC TTTGCGCTGA CCCGGGTTTT
GTTTCGTATG CAGCGCTTGC AACACTTGCC GGAGGCCGGC CGGTGGGTGT CCCGCTTGAT
GCAACTTTCC ATATCGATCT GGAAAAAGCA CAGGCTCTTA TGGATGGTGC CCGCCTTTTT
GTCCTCAACA CCCCGGCCAA CCCGACCGGT GCCGTGGAGA GCGCAGAGAC AATCCGGACA
CTCGTTGAAT ATGCGGGAGA TGCCGGGGTC ACCATCGTCA GTGATGAAGT ATACGAGCAT
TTTATCTATG GGAAAAAGCA CGTGAGTGCT GCACGGTTTG GCGACAATGT GATCACGATA
AATGCGGCAA GCAAGACCTA CGCGATGACC GGCTGGCGCC TCGGGTACCT TGCGGCTCCG
GCGGAAGTTG TCAGCCAGTG CCTCAAGGTG CACCAGTACT GCCAGGCATG TGCTACCTCA
ATTTCCCAGT ATGCTGCGCT TGCAGCGTAC ACCGGAGATC AGGCACCGGT GCAGCAGATG
AGGGATGAAT ACCATGCACG CCGTGACCTG CTCTGCAGGG GGCTTTCCGA TATCGGTTTC
TCGTTCCCGG TCCCGGAAGG GGCGTTTTAT GCGTTTGTGC CGATGAAACC GGCGCTGGTT
CAGAATATTA TCGAATCCGG GGTTATTCTC ACCCCTGGCT CAGCATTCGG TGCAAATGCA
CCTGATTATG CCCGGATCAG TTATGCTGCA TCGCGGGAGA ATCTTATGCA AGCTCTTGAT
AGGATCAAAA AGGCAACAGG AGAGTACCAT GGTTAA
 
Protein sequence
MMERFSSRVR GIEISGIRKI FEAAGPGSIN LGLGQPDFDT PQHIKDAAIA AIREGKTGYT 
PNTGIPELRE ALSTKFRKEN NVQYSTDQIL VTAGASEALH IVMQALVSDG DRVLCADPGF
VSYAALATLA GGRPVGVPLD ATFHIDLEKA QALMDGARLF VLNTPANPTG AVESAETIRT
LVEYAGDAGV TIVSDEVYEH FIYGKKHVSA ARFGDNVITI NAASKTYAMT GWRLGYLAAP
AEVVSQCLKV HQYCQACATS ISQYAALAAY TGDQAPVQQM RDEYHARRDL LCRGLSDIGF
SFPVPEGAFY AFVPMKPALV QNIIESGVIL TPGSAFGANA PDYARISYAA SRENLMQALD
RIKKATGEYH G