Gene Mboo_1061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1061 
Symbol 
ID5410897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1047960 
End bp1049270 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content52% 
IMG OID640868287 
ProductS-layer-like domain-containing protein 
Protein accessionYP_001404222 
Protein GI154150604 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1361] S-layer domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.196734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACCTA CACTCATGAG AAGACATAAA GGACGGATTA TCATACTCCT CCTTGTTCTT 
GCGGCATTTC TTGTCTCGCC GGCGTTTGCG GGGACGAGAT ATTTCGAAGG AAGCCCGAAT
CTGACGGCAT ATGTTAGTGG CGCAAACCAG TTTGCACCAG GAAGTTCCAT CCAGATCCCG
GTGGTGATCA AGAACACGGG AATAAATACG TATTACGAGG TGGCATCAAA TATTGTCGAC
CGTGCGGATG TCCCCACAAC GGCGAAGTTT GTGACAGTTG CGATGGGTGC GGGAAATGCA
CCTGTAGTTA TCAAGACTGA CCCGCAGATG ATCGGTGACA TCGCAAGCCA GGACCAGCAG
ACCGCTACCT TTTCAGCTAC CGTCAATGCG GATGCAGCGG GTGGCACCTA TACCCTCCCG
CTCAACATCA CTTACCAGCA GTTTTCTCAT GTCGACCAGT ACGGGATGGA CACATTCCAG
TATTATTATG TCCCAATGAA CGTGACACTC ACCGTACCGC TGGTCATTAA ATCAGAGGTG
ATTCCTGAGG TGATTTCAGC GACCTCTGAC AACCTCGTCG CAGGAGCGGA CGGTTACGTG
AACCTGACGA TTAAAAACAT CGGGTCGTTT GACGGGACCA AGGCAACCGT CCAGATTGTC
CAGAACGATG ATAGTCCCGT CAGTCCGGTG GACAGCAACG TGTATATCGG GGATTTCCCG
GCCGGCAGCA CCGTTTCCTG CCAGTACAAG GTGGCAGTGG CAGACACGGC TCAGAACAAG
ACCTATCCCG TCGACGTTGT TGTGAACTAC CAGAACGACG AGGGTGATAT GGTACCCTCC
CAGTCCCAGA CCGTGGGCAT TGATGTAGGC AACAAGGTAA ATTTTGCCAT CCAGATCTCT
CCCATCGAGA TGAGCCCGGG AAGCAAACAC ACCATCCAGA TCGAATATCA GAATACCGGT
GATACCATGG TTTACAGCGC ACAGGCACGC ATCAGTGTAG CCGCACCCTT TACCAGTTCC
TCTGATGTCG CCTACCTGGG AGATCTCGCA CCGGGACAGA CCGCGGTTGC CACCTACCAG
ATCAGTGTTG CAAGCGATGC TACCCTCAAG GAGTACGGCC TTGATTCTGA GATCCGGTAC
AACAATGCCA TCGGCGATAC CTACGTCTCC GACCCCATGA AAGTCACCAT TGATGTACAG
AACCTCACCG GTCTTGAGGG CATCATCTCC AACCCGGTAT ATCTCTCCCT TATCGCTGCC
GTGATTATCG GCATCATTTA TGCTATCATC CATACCCGGA AGAAACACTA A
 
Protein sequence
MIPTLMRRHK GRIIILLLVL AAFLVSPAFA GTRYFEGSPN LTAYVSGANQ FAPGSSIQIP 
VVIKNTGINT YYEVASNIVD RADVPTTAKF VTVAMGAGNA PVVIKTDPQM IGDIASQDQQ
TATFSATVNA DAAGGTYTLP LNITYQQFSH VDQYGMDTFQ YYYVPMNVTL TVPLVIKSEV
IPEVISATSD NLVAGADGYV NLTIKNIGSF DGTKATVQIV QNDDSPVSPV DSNVYIGDFP
AGSTVSCQYK VAVADTAQNK TYPVDVVVNY QNDEGDMVPS QSQTVGIDVG NKVNFAIQIS
PIEMSPGSKH TIQIEYQNTG DTMVYSAQAR ISVAAPFTSS SDVAYLGDLA PGQTAVATYQ
ISVASDATLK EYGLDSEIRY NNAIGDTYVS DPMKVTIDVQ NLTGLEGIIS NPVYLSLIAA
VIIGIIYAII HTRKKH