Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_2251 |
Symbol | |
ID | 7272548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 2398892 |
End bp | 2399947 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643570863 |
Product | signal transduction histidine kinase, nitrogen specific, NtrB |
Protein accession | YP_002467267 |
Protein GI | 219852835 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3852] Signal transduction histidine kinase, nitrogen specific |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.258726 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAACT CCTCTGATGA TCACTGGGAT CACCTGCGTG AGCAGATTAT CGGACTTGGT GAACATTCCA CCCGTAAGAA CTATTACCCG GAACTGCAAG CACAACTTAC AGATCTGGAA CGGTTCAGGG CCCTCCTCGA CCAGTCCAAT GATGCTATCC TGCTGATCCA ACTGCCTGTA GGGAGAATAA CAGACCTGAA CTGGTCCGCC TGCGAACAGC TTGGGTACCG GCGGGACGAG TTGCTCAATC TCCAGATCAG CGATCTGACA CCCCCCGGTA CCCCCGATCC CTCCAGACTG GTAACTGCAG ACCTGCACCC AACGCAGACC CTGACCACCT GCCTGCTCAG GCAGGATGGA AGCAAGATCT GCGTGGAGAT GACTATTCGG TTCACCAGTT TTGGCGGAAA TGACTATCTG GTGGCCGTGG CCAGGGACAT CTCCGAAAGG AATCGGATTC TGGAGGCACT GACCGAGTCG GAGAACCGGT ACAGGAGGGT CGTTCAGCAG GTCTCCGATG GGCTGATGAT CGTGGACCCG GAGAACGAGG GGATTGTGGA GACGAATATG GCACTCCAGC AGTTGCTTGG ATATACAGCT GGGGAACTGG TCCACCTCTC ACCTGTCGAT CTGGTGGCTG ACGGGTCGAC AACTCTGACC GCAGATCCGT CCGCCACAGT CCCGGTCGAG GGGGAACTGA TCCGCAAGGA TAGAAGCCGG GTTCCGGTGG AGGTCCGGAA GAACCCGATC ACCTATCCAG AAGGAAAGGT GGTCTTCTGC TGGATGATCC GGGACATCCG GGGACGCCGG GAACTCGATC GGATCAAGCG GGAGGCCCTA CAGCAGATCG AGCAGAACAT CGTGCAGTTT GCAACCCTCG GCGACCATAT CAGAAACCCG CTGGCGGTGA TCGTCGGGCT CGCAGACCTG GAGGAGGGGA TATATACCAA ACAGATTCTC AGGCAGGCAG AGGAGATCGA CAGGATCATC GACCGTCTCG ACCGAGGGTG GTTCGAATCA GAGAAGGTCA GATCCTTCAT CAAGAAGTAC TATTGA
|
Protein sequence | MKNSSDDHWD HLREQIIGLG EHSTRKNYYP ELQAQLTDLE RFRALLDQSN DAILLIQLPV GRITDLNWSA CEQLGYRRDE LLNLQISDLT PPGTPDPSRL VTADLHPTQT LTTCLLRQDG SKICVEMTIR FTSFGGNDYL VAVARDISER NRILEALTES ENRYRRVVQQ VSDGLMIVDP ENEGIVETNM ALQQLLGYTA GELVHLSPVD LVADGSTTLT ADPSATVPVE GELIRKDRSR VPVEVRKNPI TYPEGKVVFC WMIRDIRGRR ELDRIKREAL QQIEQNIVQF ATLGDHIRNP LAVIVGLADL EEGIYTKQIL RQAEEIDRII DRLDRGWFES EKVRSFIKKY Y
|
| |