Gene Mboo_1552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1552 
Symbol 
ID5411239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1621848 
End bp1622984 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content53% 
IMG OID640868786 
ProductCBS domain-containing protein 
Protein accessionYP_001404712 
Protein GI154151094 
COG category[K] Transcription
[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases
[COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.651546 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGCT CCTATAAGAT TGGACGGCTC TTTGGCATTC CTGTTCTGAT CCATTTTTCG 
TTTCTGATTG TTATCCCCCT TTTAGCGTGG ATCATCGGAA TCCAGATCGC ACTTACCACA
ACTACGATCC AGTATATTTT CCAGGTTCCC ATAGATACCA CCCTGATCAC GGCGGGATTC
ATGCCATGGA TCCTCGGCAC CATCGTCGCA CTCGGGCTTT TTTTGGGAGT TCTGGTCCAC
GAAATCGCCC ATTGCATTGT TGCCAGGAAA AAGGGTGTGA GGATCCAGAG TATCACCCTG
CTCATGTTCG GTGGTGTCTC CCGGATGGAG GAAGAAGGAG TACCGGATCC GAAAGTTGAG
CTCCCCATGG CACTGGTCGG GCCATTCACC AGCCTTCTCT TCGGTCTCGT ATGCGCCGGT
CTCGTCTACC TTGTCCCGGG AATGACCCCT TATCCCGCGA TCGCCGGCAT CCTCATTTTT
ATCTTCGGGT ACGTTGCCGT GCTTAACATC CTTCTCTTTG CATTTAACCT GATCCCGGCA
TTTCCCATGG ATGGCGGCAG GGTACTCCGG GCGGCACTGG CACAACGAAT GCCCGTCCAC
AAGGCGACCC GGATTGCTGC CAATATTGGT AAGGGGTTTG CCATTATTTT TGGCATCATC
GGCCTGTTAT TCTTCAACCC GTTCCTGATC CTTATCGCAC TTTTTGTCTA CATCGGGGCA
GGCTCGGAAG CAACCATGGA CCAGTTCACC TACCTGTTGC ACAACGTTAC GGCAGAGAGC
ACGATGAGTT CTCCGGTAAC GTCCGTGACA CCGGCCCTCT CACTTTCAAA GGTGGCAGAG
ATGATGCTTT CCACCAAGCA CCTAGGTTTC CCGGTTGTCG AACATGACAA ACTTGTAGGA
ATGATCACGC TTGTGGATGT GAACCGTATC TCACCGGCCG ATCGCGAGGC AAAGCAGGTC
CGCGACATCA TGACCCGCGA TCCGGTTACC CTTCCGCCAT CTGCACCGGT CATGGACGCC
TTAAGAATCA TGTCAGCCCG CAATATCGGG AGGATCCCCA TAGCTCAGGA CGGCAGGATC
ATCGGTATTG TCACCCGTTC CGATATCCTG AAAGTGGCCG AGCTCAAAAA GGCATGA
 
Protein sequence
MNGSYKIGRL FGIPVLIHFS FLIVIPLLAW IIGIQIALTT TTIQYIFQVP IDTTLITAGF 
MPWILGTIVA LGLFLGVLVH EIAHCIVARK KGVRIQSITL LMFGGVSRME EEGVPDPKVE
LPMALVGPFT SLLFGLVCAG LVYLVPGMTP YPAIAGILIF IFGYVAVLNI LLFAFNLIPA
FPMDGGRVLR AALAQRMPVH KATRIAANIG KGFAIIFGII GLLFFNPFLI LIALFVYIGA
GSEATMDQFT YLLHNVTAES TMSSPVTSVT PALSLSKVAE MMLSTKHLGF PVVEHDKLVG
MITLVDVNRI SPADREAKQV RDIMTRDPVT LPPSAPVMDA LRIMSARNIG RIPIAQDGRI
IGIVTRSDIL KVAELKKA