Gene Mpal_2251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2251 
Symbol 
ID7272548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2398892 
End bp2399947 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content57% 
IMG OID643570863 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_002467267 
Protein GI219852835 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.258726 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACT CCTCTGATGA TCACTGGGAT CACCTGCGTG AGCAGATTAT CGGACTTGGT 
GAACATTCCA CCCGTAAGAA CTATTACCCG GAACTGCAAG CACAACTTAC AGATCTGGAA
CGGTTCAGGG CCCTCCTCGA CCAGTCCAAT GATGCTATCC TGCTGATCCA ACTGCCTGTA
GGGAGAATAA CAGACCTGAA CTGGTCCGCC TGCGAACAGC TTGGGTACCG GCGGGACGAG
TTGCTCAATC TCCAGATCAG CGATCTGACA CCCCCCGGTA CCCCCGATCC CTCCAGACTG
GTAACTGCAG ACCTGCACCC AACGCAGACC CTGACCACCT GCCTGCTCAG GCAGGATGGA
AGCAAGATCT GCGTGGAGAT GACTATTCGG TTCACCAGTT TTGGCGGAAA TGACTATCTG
GTGGCCGTGG CCAGGGACAT CTCCGAAAGG AATCGGATTC TGGAGGCACT GACCGAGTCG
GAGAACCGGT ACAGGAGGGT CGTTCAGCAG GTCTCCGATG GGCTGATGAT CGTGGACCCG
GAGAACGAGG GGATTGTGGA GACGAATATG GCACTCCAGC AGTTGCTTGG ATATACAGCT
GGGGAACTGG TCCACCTCTC ACCTGTCGAT CTGGTGGCTG ACGGGTCGAC AACTCTGACC
GCAGATCCGT CCGCCACAGT CCCGGTCGAG GGGGAACTGA TCCGCAAGGA TAGAAGCCGG
GTTCCGGTGG AGGTCCGGAA GAACCCGATC ACCTATCCAG AAGGAAAGGT GGTCTTCTGC
TGGATGATCC GGGACATCCG GGGACGCCGG GAACTCGATC GGATCAAGCG GGAGGCCCTA
CAGCAGATCG AGCAGAACAT CGTGCAGTTT GCAACCCTCG GCGACCATAT CAGAAACCCG
CTGGCGGTGA TCGTCGGGCT CGCAGACCTG GAGGAGGGGA TATATACCAA ACAGATTCTC
AGGCAGGCAG AGGAGATCGA CAGGATCATC GACCGTCTCG ACCGAGGGTG GTTCGAATCA
GAGAAGGTCA GATCCTTCAT CAAGAAGTAC TATTGA
 
Protein sequence
MKNSSDDHWD HLREQIIGLG EHSTRKNYYP ELQAQLTDLE RFRALLDQSN DAILLIQLPV 
GRITDLNWSA CEQLGYRRDE LLNLQISDLT PPGTPDPSRL VTADLHPTQT LTTCLLRQDG
SKICVEMTIR FTSFGGNDYL VAVARDISER NRILEALTES ENRYRRVVQQ VSDGLMIVDP
ENEGIVETNM ALQQLLGYTA GELVHLSPVD LVADGSTTLT ADPSATVPVE GELIRKDRSR
VPVEVRKNPI TYPEGKVVFC WMIRDIRGRR ELDRIKREAL QQIEQNIVQF ATLGDHIRNP
LAVIVGLADL EEGIYTKQIL RQAEEIDRII DRLDRGWFES EKVRSFIKKY Y