Gene Mboo_1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1996 
Symbol 
ID5410420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2061505 
End bp2062644 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content59% 
IMG OID640869238 
Productaminotransferase, class I and II 
Protein accessionYP_001405153 
Protein GI154151535 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.974269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTTT CCTCTTCCCG TGCGGACTCC TTTACCGAAT CCGTGATCCG GGAGATGACC 
CGTCTTGCCA TCGCCCACAA GGCGATAAAC CTTGCCCAGG GGTTCCCCGA TTTCCCCTGC
CCGGACGAGC TCAAAAAGGC CGCCTGTGAA GCGATCAATG GTGACTACAA CCAGTACGCG
ATTACCTGGG GAGCGCAGGA TCTCCGGCAG GCGATTGCAC GGAAGGTGAA GGGCTACAAC
GGGATCGATG CAGACCCGGA GACGGAGATC ACGGTCACCT GCGGCTCCAC CGAAGCGATG
ATGGCATCCA TGATCGCACT CGTAAACCCG GGTGACGAGG TGATCGTGCC CGAACCCTTC
TACGAGAACT ACGGGCCCGA CGCGGTTATC TCCGGGGCCG TACCCCGCTA TGTCCCGCTC
GGTAACGGGC CGCTTGACGA GGAGATCTGG AAAGCAGCGT TCTCGAAAAA GACCCGGGCG
GTCATTATCA ATACCCCCAA CAACCCGACC GGAAAGGTCT TTTCAAGAAA CGAACTGCAG
TTTGTCGCCG ACCTCTGTGC CGAGCACGAC GTGATCGCGA TCACCGACGA GATTTACGAG
CACATCCTCT ATGACGGCCA CCGCCATGTC TCGATCGGGT CGCTTGCCGG CATGGAAGAC
CGCACGGTCA CCATCAACAG CCTCTCCAAG ACCTACAGCG TGACCGGCTG GCGGGTGGGG
TACACGATTG CCGATGCCCG GCTGACCGCA CGCATCCGTA AGATCCATGA TTTCCTCACG
GTCGGGGCAC CGGCACCCCT CCAGCATGCC GCGGTTGCCG CACTCGACCT TCCCTCCACC
TATTACAATG AGCTTGCCCG TGACTATGAC CGCCGGCGGA AGATCCTTTA CGATGGTCTC
AGGAAAGCGG GATTCTCCTG CCAGCTCCCT GATGGTGCGT ACTATATCTT CACGGATATT
GCGGGATTTG GGATGACCGA CGTCGCGTTC GCCCGCCACC TGATCGAATC CGTCGGTGTT
GCTGCCGTGC CGGGCAGTTC ATTCTGCCAT GAGGGTGGAG AGACGAAGAT CCGGTTTACC
TTCTCCAAAA AGGAAGAGAC TCTCCGTGAA GCGTGCCGGC GCCTCGAAAA CCTCGGATAA
 
Protein sequence
MMLSSSRADS FTESVIREMT RLAIAHKAIN LAQGFPDFPC PDELKKAACE AINGDYNQYA 
ITWGAQDLRQ AIARKVKGYN GIDADPETEI TVTCGSTEAM MASMIALVNP GDEVIVPEPF
YENYGPDAVI SGAVPRYVPL GNGPLDEEIW KAAFSKKTRA VIINTPNNPT GKVFSRNELQ
FVADLCAEHD VIAITDEIYE HILYDGHRHV SIGSLAGMED RTVTINSLSK TYSVTGWRVG
YTIADARLTA RIRKIHDFLT VGAPAPLQHA AVAALDLPST YYNELARDYD RRRKILYDGL
RKAGFSCQLP DGAYYIFTDI AGFGMTDVAF ARHLIESVGV AAVPGSSFCH EGGETKIRFT
FSKKEETLRE ACRRLENLG