Gene Mboo_1646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1646 
Symbol 
ID5411191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1723097 
End bp1724287 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content53% 
IMG OID640868880 
Producthypothetical protein 
Protein accessionYP_001404806 
Protein GI154151188 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0000328071 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGGATA TCTTCTTTGA TCTTTCGGTC CGGAGTGTCC GGCTCAACTT CCTGCGTTCC 
CTTCTTGCTT CTATTGGGAT CGTGATCGGG GTAGTTGCCA TCTCATCCAT GGGGATGCTC
GGAACCAACA TGCAGCTCTC GGTAAAAGAC CAGCTCTCGG CAGATGTCAA TACGGTCATG
CTCACTTCTG ATGTCGTGAA AGTAAGCAGT GGTCTCGGGG CTGCCCCGGT TTCAAGTACA
ATTGATGAAA GCACCCTCAA TGATATCAAA GGCTCCGCAG GACAGAATGA TGTAGTGCCG
ATCCATCATA CGAGTACGCA GTTTAGCGTT GGTGACCAGA ATGGTCGCGG ATCGATCTAT
GGGCTCGACC CCAGTGATGT CCCAAAATTC CTTACCATCG CACAGGGATC AAATATCCGG
GGCAGTGATG TCCTTGTGGG CGCAACCATT GCCAGCAATT TTAACCTTGT GGTTGGAAGC
ACGATCAAGA TCGGTGAGAA TTGTGCCACC GTTTCACGGC CGGTGGTCAG GGTTGCCGGG
ATTCTCCAGG CCCGGGGCAT TGCGGCCGAT GGCGTAAACG TGGACAATGG GATTGTGGTA
GACGATACCT GGTACACAGA TCACTTCGGA GGCCTGGACC AGTACGACCA GGTCAACGTG
ATAGTAGCTG ATGTTGATAC CATCAACCAG ACCGAAGCGG CGATCAACGC AAAGGTCAAC
CGGAACAGCG ATGTGGTCAG GGTCTCCGAT GCCAGTTCCC GGTTGTCGAC CATTTCAAGT
ACACTTGGTA CGATTACCAC GTTTATCATG GCGATCGGCG GTATTTCGCT TGTGGTCGCT
GCGGTAAGTA TCTTTAACGT CATGATGATG TCGGTCAAGG AACGGGTCCA GGAGATCGGG
ATTCTTCTTT CGATCGGAAC GGAGAAAGGT GAAGTCCGCA GGATGTTCCT GTACGAGGCA
TTAATCCTCG GTATCATTGG TGCGGTGGTG GGCGGAATCA TGAGCTTTAT CATCGGCTAC
TCGGTCGTGA GTGCCATGAT CGGCTCCACC CAGTACTTCT TTACCCCGGA CAGCCTGATC
TTCATCCCGT ACGGGATGAT CATCGGTGTG GTTGTCTGCG TGGCATCCGG TATGTATCCC
GCGTGGGCGG CTTCGAATAT GGACCCGATA GACGCCCTTC GGGCTGATTA A
 
Protein sequence
MKDIFFDLSV RSVRLNFLRS LLASIGIVIG VVAISSMGML GTNMQLSVKD QLSADVNTVM 
LTSDVVKVSS GLGAAPVSST IDESTLNDIK GSAGQNDVVP IHHTSTQFSV GDQNGRGSIY
GLDPSDVPKF LTIAQGSNIR GSDVLVGATI ASNFNLVVGS TIKIGENCAT VSRPVVRVAG
ILQARGIAAD GVNVDNGIVV DDTWYTDHFG GLDQYDQVNV IVADVDTINQ TEAAINAKVN
RNSDVVRVSD ASSRLSTISS TLGTITTFIM AIGGISLVVA AVSIFNVMMM SVKERVQEIG
ILLSIGTEKG EVRRMFLYEA LILGIIGAVV GGIMSFIIGY SVVSAMIGST QYFFTPDSLI
FIPYGMIIGV VVCVASGMYP AWAASNMDPI DALRAD