Gene Mboo_1272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1272 
Symbol 
ID5410292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1295956 
End bp1297464 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content59% 
IMG OID640868500 
Producthypothetical protein 
Protein accessionYP_001404433 
Protein GI154150815 
COG category[K] Transcription
[S] Function unknown 
COG ID[COG1900] Uncharacterized conserved protein
[COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.573905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.202624 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAAAT CCATCGGGCA GATCAACGAG CGGATCCGCG ACGGCAGTGC CCATGTTGTC 
ACCGCGGAGG AGATGCCCCG CATCGTTGAC GAGCTGGGCG AGGAAGGGGC GCTCAGGGAA
GTTGACGTTG TCACCACCGG AACGTTTGGC GCGATGTGTT CGTCCGGCGC TTTTTTGAAC
TTCGGGCACT CGGAGCCACC GATCCGTATG GAGCGGATCT GGCTCAATGA TGTAGAAGCA
TACGGGGGCC TTGCCGCGGT GGACACCTAC ATTGGGGCAA CCCAGCAGTC AGAGACACTT
GACGAGCGGT ACGGCGGAGC GCACGTCATC GAAGACCTGG TATCGGGAAA ATCAGTTGAA
CTGCGGGCAA GCTCCCGGGG TACCGACTGC TACCCGCGCA GGACGATTAC CACCGAGCTC
CTGCTGGAAA ACCTCAACCA GGCGATCATG TGCAACCCGC GCAATGCCTA CCAGCGGTAC
AATGCGGCAA CCAATACCAC CGACAGGCTC CTGCATACCT ACATGGGAAC GCTCCTTCCC
AATACCGGCA ATATCACGTA CTCGGGCGCC GGGCTGCTCA ACCCGCTCAC CAACGATCCC
CACCTGCGGT TGATTGGAAG CGGTGTCCCA ATCTTCCTGT GTGGTGCGAA GGGTATGGTT
GTTGGGGAGG GGACCCAGCA TTCACCCGGT GGCGGTTTTG GCACGCTGAT GGTGACCGGC
AATCTCAAGG AAATGTCTCC CGAGTACCTC CGTGCTGCCA CCATGACCGG CTACGGTGTT
ACGCTCTACG TTGGAATTGG CGTACCGCTA CCGGTGCTTG ACCTTGACGT GGTCCGGGCA
ACTGCCGTCC GGGACGAGGA CATCCCGGTT GACCTGCTTG ACTATGGCGT GCCCAGCCGC
GCCCGCCCGA AGGTCAGGAG CGTCACCTAT GCCGAGCTCC GGTCCGGTTC CGTGGAGATC
AACGGGGAGC AGGTGCGCAC CTCGTCCCTG TCGAGTTTCC GCAGGGCGCG GCAGGTAGCA
GCAGAGCTCA AAAACTGGAT AGGAAAAGGA AAGATGGAGC TTGCGCTTCC CACCCGCCCC
ATCGATGCCA CAAAAAAAGT GCACCCCATG CATGAGACCA CGAGCGGCCC GCGTGTGCTC
GACATCATGG ACCGACAGGT GGTCAGTGTG AGCGAGGGTG AGGAGATCCA GACTGCAGCG
CAGAAACTGC TCAAAGGGGA GACTAACCAC CTCCCGGTCA TCGGCCGTGA TGGCAGGCTT
GCCGGGATCA TAACCACTTT TGATATCTCT AAAGCCGTGG CAAACCCCGG CAAGGCATCG
ACCGTGGGCG ATATTATGAA AAAGAAGGTG GTGACCACGA CAACTGATGA GGCGGTGGAT
GTTGCCGTAC GAAAGCTCGA ACAGAACAAC ATCAGTGCGC TGCCCGTCCT TGATGCCGAT
CGCCATGTGA TCGGGATGCT CACGGCAATA AATCTCGGGA AGCTCTTTGG CGGAAGGTGG
CTGAAATGA
 
Protein sequence
MHKSIGQINE RIRDGSAHVV TAEEMPRIVD ELGEEGALRE VDVVTTGTFG AMCSSGAFLN 
FGHSEPPIRM ERIWLNDVEA YGGLAAVDTY IGATQQSETL DERYGGAHVI EDLVSGKSVE
LRASSRGTDC YPRRTITTEL LLENLNQAIM CNPRNAYQRY NAATNTTDRL LHTYMGTLLP
NTGNITYSGA GLLNPLTNDP HLRLIGSGVP IFLCGAKGMV VGEGTQHSPG GGFGTLMVTG
NLKEMSPEYL RAATMTGYGV TLYVGIGVPL PVLDLDVVRA TAVRDEDIPV DLLDYGVPSR
ARPKVRSVTY AELRSGSVEI NGEQVRTSSL SSFRRARQVA AELKNWIGKG KMELALPTRP
IDATKKVHPM HETTSGPRVL DIMDRQVVSV SEGEEIQTAA QKLLKGETNH LPVIGRDGRL
AGIITTFDIS KAVANPGKAS TVGDIMKKKV VTTTTDEAVD VAVRKLEQNN ISALPVLDAD
RHVIGMLTAI NLGKLFGGRW LK