Gene Mboo_2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2040 
Symbol 
ID5411167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2115925 
End bp2117157 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content56% 
IMG OID640869282 
Producthypothetical protein 
Protein accessionYP_001405197 
Protein GI154151579 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.11269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.213161 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAAG GGGACATCTT CTTTAACCTC GCAGTACGAA GCGTCCGCAT CAATTTCCTG 
CGGTCGATGC TTGCCGCGAT TGGCATTGTG ATCGGCGTTG TTGCCATCTC CTCCATGGGG
ATGCTGGGCA CCAACATGCA GCTGGAAGTA AAAGACCAGC TCTCGGCAAG CGCAAATACC
ATTGTCATTA CCCCGGATGT GGTCCGTCTG GGGCCAACGG GTTTTGTCCC GGGCTCATCG
TCATCATCCT CAACCGGGAT CGATAAAGAC GATCTGGCAA AAATCACCAT GGCCACCGGG
TCGAACGGCA CGGTAATTCC CATCTACTCG ACCAATACCG AGTTTACGAT CAATTCAGTG
GCGGGAAGGG GTTCGGTGTA CGGGCTTAAC CCGCTGGATA TCCCCAAATT CCTCTCCTTA
AACCAGTCGT ATGGCAACGG GACCACCGAT ATCGGGGCAG GCGAGGTCCT TGTCGGGGCG
GAAATCGCGC AGAACTTCAA CCTTAAGGTC GGCACCCGCA TCAGGATCGG CTCGTTTAAT
TCCGCATCCC GGCCCGAGCT CCGCATTGCC GGCGTGCTCC AGCCCCGGGG GACCGTCGCC
GACGGTGTCT CAACCGATAA CGGGATTGTG GTGAACAATA ACTGGTATAC CAACCAGTAC
GGCGGGGAGG ACGAGTGGAG CCAGGTCAAT GTGATCGTCA ACGATGTGGA CAACATCAGC
GACATTGAAT CAATGATCAG TGCGAAAGTG AACACCAATG AAAAAACGCC GGTCATCCGG
ATCCGGGACG CCACCTCGCA GCTCGCCACC GAGACCTCAG CCCTGAGTAC CGTTACGACC
TTTATCATGG CCATCGGCGG GATCTCGCTT TTGGTCGCTG CCGTGAGCAT CTTCAACGTG
ATGATGATGT CGGTCTCGGA GCGGATCCAG GAGATCGGCA TCCTTCTCTC GATTGGGACC
GAGAAAGGGG AGGTACGGAG GATGTTTTTG TACGAAGCAT TTATCCTTGG ACTCCTTGGG
GCCGGGGTGG GCGGTGCATG CAGCCTTGCC ATCGGCTATA CCGTCGTAGA AGCGATGATC
GGGACAACCG CATATTTCTT CGAGCCTGCA AGTATCCTGT ATGTCCCGGC AGCCATGCTC
ATCGGCGTAG TGGTCTGTGT CATCTCGGGC ATGTACCCGG CCTGGCGGGC GTCCAACATG
GATCCCATTG ATGCGATACG GAGCGAAGAA TAA
 
Protein sequence
MIKGDIFFNL AVRSVRINFL RSMLAAIGIV IGVVAISSMG MLGTNMQLEV KDQLSASANT 
IVITPDVVRL GPTGFVPGSS SSSSTGIDKD DLAKITMATG SNGTVIPIYS TNTEFTINSV
AGRGSVYGLN PLDIPKFLSL NQSYGNGTTD IGAGEVLVGA EIAQNFNLKV GTRIRIGSFN
SASRPELRIA GVLQPRGTVA DGVSTDNGIV VNNNWYTNQY GGEDEWSQVN VIVNDVDNIS
DIESMISAKV NTNEKTPVIR IRDATSQLAT ETSALSTVTT FIMAIGGISL LVAAVSIFNV
MMMSVSERIQ EIGILLSIGT EKGEVRRMFL YEAFILGLLG AGVGGACSLA IGYTVVEAMI
GTTAYFFEPA SILYVPAAML IGVVVCVISG MYPAWRASNM DPIDAIRSEE