Gene Mboo_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0223 
Symbol 
ID5411823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp210826 
End bp211992 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content60% 
IMG OID640867437 
Producttryptophan synthase subunit beta 
Protein accessionYP_001403388 
Protein GI154149770 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGTA AAGGACGATT CGGAAAATAC GGCGGGCAGT ACGTGCCGGA AACCCTGATG 
AACGCACTTA TCGAGCTTGA ATGCGCATAC CGGGACGCGA CGCACGATCC GGCATTTGCA
CAGGAACTTG CAGCATACCA GTCAGAATAC GCCGGGCGCC CCACACCGCT TACGTTCTGT
GGGAACATCT CCCGGGACCT CGGCTTTAAG GTGTATCTCA AACGGGAGGA TCTGGTCCAT
GGCGGTTCTC ACAAGCTCAA CAACACGCTG GGCCAAGCGC TTCTTGCCAA ACGGATGAGA
AAGAAACGGC TGATTGCGGA AACCGGCGCC GGCCAGCACG GGGTCGCAAC CGCAATTGCC
GGGGCCGCGC TCGGCTTTAA GGTCGAGGTC TTCATGGGCG AGGTGGATAC AAAACGCCAG
GCCCTCAACG TCTTCCGGAT GGAATTGATG GGAGCAACGG TCCACCCGGT AACCTGCGGG
ACAAAGACGC TCAAGGACGC GACAAACGAA GCGCTCCGTG ACTGGGTCGC AAACGTAAAC
GACACGCATT ATCTGATCGG ATCGGTGGTC GGACCGCACC CGTTCCCGAC AATTGTCCGC
GACTTCCAGT CCGTGATCGG GCGCGAGGCC CGCCAACAGG TCATGCGAAA GGAGGGAAAG
ATGCCGGACG CGATTGTTGC CTGCGTGGGT GGCGGCTCAA ATGCGATCGG CATCTTCCAC
CCCTTCCTTG CCGATGATGT AGAGCTCATC GGCATCGAAG CGGCCGGCAA AGGCCTCGAT
ACCCCGGAAC ACTCCGCAAC GCTCTGTGCG GGAGATCCCG GCGTGCTCCA CGGCACGCTC
TCGTACCTGC TTCAGGACAA CAACGGCCAG GTGCTTCCTA CGCACAGCGT GGCGGCAGGT
CTCGATTACC CGGGTGTCGG CCCCGAGCAC GCGATGCTCA AAGATTCGCA CCGGGTTGCA
TACTACGCGG TAAAAGACCA CGAAGTGCTT GACGCCTTCC GGTACCTTTC GCGGACCGAA
GGGATCATCC CGGCGCTCGA ATCCTCACAC GCGGTGGCGT ACGTGCTGCA GAACTGTGAC
CGGTTCGATA AGGGCGATGT GGTGATCATC AATCTTTCAG GCCGGGGCGA CAAGGATGTC
GCAGGCATTG TACCGGAGGC GGCATGA
 
Protein sequence
MGRKGRFGKY GGQYVPETLM NALIELECAY RDATHDPAFA QELAAYQSEY AGRPTPLTFC 
GNISRDLGFK VYLKREDLVH GGSHKLNNTL GQALLAKRMR KKRLIAETGA GQHGVATAIA
GAALGFKVEV FMGEVDTKRQ ALNVFRMELM GATVHPVTCG TKTLKDATNE ALRDWVANVN
DTHYLIGSVV GPHPFPTIVR DFQSVIGREA RQQVMRKEGK MPDAIVACVG GGSNAIGIFH
PFLADDVELI GIEAAGKGLD TPEHSATLCA GDPGVLHGTL SYLLQDNNGQ VLPTHSVAAG
LDYPGVGPEH AMLKDSHRVA YYAVKDHEVL DAFRYLSRTE GIIPALESSH AVAYVLQNCD
RFDKGDVVII NLSGRGDKDV AGIVPEAA