Gene Mboo_0467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0467 
Symbol 
ID5411102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp450708 
End bp452003 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content52% 
IMG OID640867679 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001403628 
Protein GI154150010 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.466167 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.00940458 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGGACCA AAGCCGATCC AACTCCATTC GATCAGAGGC TCTTTCTTAT CTTCATTGTC 
ACGTTTGCCT CGATGACGGC ATTCGAGTTT ATCGGCCAGT ACCTCTATCC GTACGAGCCG
GACTGGCGCT CGAACCTGAT TATCAGCCTC TTTGCAAGTG GACTTGCGGT GATGCTTGCA
TACTTCCTCC TCAGCCCGTA CTATGAGAAT GTCAGTATCC TTGCAGAAGA AATAAAGAAC
CGGCACAGGG TGGAACGGGA CCTGCGGGAG CGTGAGGAGC GCCTCCGACG GACTTTTGAC
CAGTCACCAG TAGGAGCCGC ACTCTGTTCT CCCGATCTCC GGTTCATACG GGTAAATAAT
GCCCTCTGCA CAATTTCGGG ATATACCCCG GAAGAGCTCC TTACCCTGTC GATCCCTGCC
ATAACGGTTC AAAAAGAAAG TGCAGATCTT GCCGGTTGCG CTGAGGCCCT TATGTGTGGC
AGGGCTGATC TCGATGAGCG CGACCTGCAT CTGGTAAAAA AGAATGGCGA CAGGATATGG
GTGCGTCTTT CGGTCAGTCT TGTCCGGGAT GTGGAAGGTG CTCCACTCTA TTTCATCCCG
ATGTTTGTGG ACATTCATGA CCGCAAACTT GCGGAGGACG CTCTCCAGAA GACCAACAGG
AAACTCTCGA TACTCTCCTC CGTCACCCGC CATGACATAA AAAACCAGCT TACCGGCCTT
GGCGTTTATC TGCAAATGGT AAGAAACGAG GTCCCGGACA ACCCCATCCT GCAGGGATAC
ATCAGCAAAC TGGTGGCCTG CAGTGAAGCC ATCGACCGCC AGATCGAGTT CACGCGCTAT
TACGAGGAAC TGGGGACAGC AAATGCAGGA TGGTTCGATG TATACCAGGG GATACTTGAC
CAGGCTATCC AGCTCCCCCT CGAGGGAGTC ATGCTTGACC CGGGGAAAAA AGGTATCTAT
ATTTTCACCG ACCCGCTGAT AGGCAAGGTC TATTACAACC TCATCGAGAA CTCCATCAGG
CATGGCGTGA ATGTGAAAAC GATCATCTTT GATGCACATG AGACAAATAA CGGCCTTGTG
ATCCGGTACA CTGATGATGG CATCGGGATC CCGGACGCGG AGAAAGAAAA GATTTTCCTC
AAGGGATATG GGAAAAACAC CGGCCTTGGC CTTTTTCTGA TCCGCGAGAT CCTCGCAACT
ACCGGAATAA GCATTGCCGA GACAGGCATG ACCGGGAAGG GTGCACAATT CGAGATTCTT
GTCCCCCGGG GGGAATACCG GTTATCCCCG AAGTAA
 
Protein sequence
MGTKADPTPF DQRLFLIFIV TFASMTAFEF IGQYLYPYEP DWRSNLIISL FASGLAVMLA 
YFLLSPYYEN VSILAEEIKN RHRVERDLRE REERLRRTFD QSPVGAALCS PDLRFIRVNN
ALCTISGYTP EELLTLSIPA ITVQKESADL AGCAEALMCG RADLDERDLH LVKKNGDRIW
VRLSVSLVRD VEGAPLYFIP MFVDIHDRKL AEDALQKTNR KLSILSSVTR HDIKNQLTGL
GVYLQMVRNE VPDNPILQGY ISKLVACSEA IDRQIEFTRY YEELGTANAG WFDVYQGILD
QAIQLPLEGV MLDPGKKGIY IFTDPLIGKV YYNLIENSIR HGVNVKTIIF DAHETNNGLV
IRYTDDGIGI PDAEKEKIFL KGYGKNTGLG LFLIREILAT TGISIAETGM TGKGAQFEIL
VPRGEYRLSP K