Gene Mboo_2272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2272 
Symbol 
ID5411004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2341032 
End bp2342186 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content58% 
IMG OID640869524 
ProductCBS domain-containing protein 
Protein accessionYP_001405429 
Protein GI154151811 
COG category[R] General function prediction only 
COG ID[COG0517] FOG: CBS domain
[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.29511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGAT CTCTTCGAAT CGGGCGGCTT TTTGGGATCC CGATCATGCT CCACTGGACT 
TTTCTTTTGA TCATTCCGGT TTTTGCCTTC CTTATCGGCA GCCAGATCGG AGCTACCACC
GATTTGATCG GGGGGCTCTT TGGCATCGGG ATCGACTCCA CTATCATCTC CGGCGGCTAC
ATGCCCTGGA TCCTCGGGAC AATTGTTGCC CTCGGACTCT TTTTCGGAGT GTTTGTCCAC
GAGCTCGCCC ATTCGCTTGT GGCCCGGGTA AAAGGAATCC GGATGCAGAG CATCACACTC
CTGATGTTCG GGGGCGTTGC CCAGATGGAC GAGGGGGCCC CGGAACCAAG GACTGAGCTG
CCCATGGCGC TTGCCGGGCC GCTCACAAGC CTTGTCTTTG GCCTTGCCTG CTGCGGCCTT
GTGTACGTCA CCCCGGCACT CACACCTGCA CCCGCAATAC AGGGAGTGCT CATCTTTATT
TTCGGGTACG TGGGTGTGCT CAACATCATC CTCTTTGCGT TCAACCTGAT CCCCGCCTTT
CCCATGGATG GCGGCCGTGT GCTCCGGGCG GCTCTCGCAA CCCGGATGCC GCTTGATCGG
GCAACCCGGA TTGCGGCAAA CGTGGGCAAG GGATTTGCTA TCCTCTTTGG TATCGTCGGG
CTCTTTGGTA TCCCCGGGTA CATCGCTCCC TTCGATCCCT TCCTGATCCT GATCGCCCTC
TTTGTGTACC TGGGGGCAAG CCTCGAATCA TCGGCTGTGC AGTACAATGT CCTGCTCCGT
GATGTGACCG TGGGTGAGAT GATGAGCACC CCGGTGGTCT CGGTTCCCGC GAGCATGCAG
CTCGTCAAGG TTGTTGACAT GATGTACGCA AGCAAGCACC TTGGTTTTCC TGTCACCGAG
CGCGACACGC TCGTGGGCAT GGTGACTCTT GCGGACGTGA ACCGGACTTC TCCCATCGAC
CGCGAGGCAA TGCAGGTAAA AGATGTGATG ACCCGTGAGG TTGTCACCCT GCCCCCCACG
GCTTCGGTGA TCGATGCTCT CCGGATCATG TCTGCCCGGA ATATCGGCCG CATCCCGATC
CTGCAGGAAG ACCGGATTGT AGGGATCGTG ACCCGGACCG ATATCCTGAA AGTTACCGAA
TTAAAAAAGA TCTAA
 
Protein sequence
MDGSLRIGRL FGIPIMLHWT FLLIIPVFAF LIGSQIGATT DLIGGLFGIG IDSTIISGGY 
MPWILGTIVA LGLFFGVFVH ELAHSLVARV KGIRMQSITL LMFGGVAQMD EGAPEPRTEL
PMALAGPLTS LVFGLACCGL VYVTPALTPA PAIQGVLIFI FGYVGVLNII LFAFNLIPAF
PMDGGRVLRA ALATRMPLDR ATRIAANVGK GFAILFGIVG LFGIPGYIAP FDPFLILIAL
FVYLGASLES SAVQYNVLLR DVTVGEMMST PVVSVPASMQ LVKVVDMMYA SKHLGFPVTE
RDTLVGMVTL ADVNRTSPID REAMQVKDVM TREVVTLPPT ASVIDALRIM SARNIGRIPI
LQEDRIVGIV TRTDILKVTE LKKI