Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1404 |
Symbol | |
ID | 8136732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1650922 |
End bp | 1652007 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644869018 |
Product | signal transduction histidine kinase, nitrogen specific, NtrB |
Protein accession | YP_003021221 |
Protein GI | 253700032 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3852] Signal transduction histidine kinase, nitrogen specific |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.0000236023 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTGAGA AACTGGAGAG TTACTACGCC AACGTGATCG ACAGCGTGGG TGACGGCGTG ATCGTCCTGG ACAACGCCGG GGCCGTGACG CTGGTCAATC CCGCCGCCGA GGAACTGGCC GGCGTTTCGC GCCGCCAGGC GATGGGCGTT CTCTTCAGCG AGATCTTCAA AGGGGAAGGA CCTCTCAACG AGATGGTCGC CAAGACCGTG GAGACAGGGA TGTCGGTATC CGACCACGAG AACATCGTGG TCAAGCGGGG GGGGAAGCTG ATCCCCGTCG GCGCCAGCAC CTCGCCGCTT TTAAGCGCGA GCGGGGAGCG CATCGGCACC ATTTTACTTC TGCGGGACCT GACCAACGTG CGCGAGCTGG AGTCGGCCGT GCGCCAGGCG GACCGCCTTT CGGCGCTGGG CGGGCTGGCA GCCGGGCTCG CGCACGAGAT CAAGAACCCA CTGGGCGGGA TCAAGGGGGC GGCGCAGCTA CTGGAACTGG AATTTCCCGA CAACGAGGAC CTGCGCGAGT ACATCAGGGT GATGCTGAAG GAGGTACAGC GGGTCAACCT CATCGTGGAG GAACTCCTGG CGCTCGCTTC GCCGGGGCGT CTGAAGCTCT CCAAGGTGAA CCTGCACCGG GTCCTTTCCG ACATCGTCTT GTTGCAGAAA AACGCCAGCG AGGGGAAAGA GGTCTACCTC CAGCAGTACT TCGATCCTAG CATCCCTCCC ATCCTGGGGG ACGAGGCGCT TTTAACCCAG CTCTTCCTGA ACCTGATCAA GAACGCGCTG GAGGCGGTGG AGGCAGGCGG CGTGGTGAAG GTGACCAGCC GGGTGCTGTC GGACTACAGC ATGACCCAGC GGGGGGAGCG GCGGGCGCGC ATGGTGGCCA TCGACATAGC CGACAACGGT CCGGGTATCG AGGCCGAGGT GCTGGAGAAC ATGTTCACCC CGTTTTTCAC CACCAAGTCC CAGGGGACAG GGTTGGGTCT TGCCATCTGC CAGAAGATCG TCTCTGAGCA TCGGGGCATG ATCAAGGTGG ATTCCGACGC CAAGCGCGGC ACCGTCTTCA CCGTCATGCT GCCGCTGGTG CAATAA
|
Protein sequence | MSEKLESYYA NVIDSVGDGV IVLDNAGAVT LVNPAAEELA GVSRRQAMGV LFSEIFKGEG PLNEMVAKTV ETGMSVSDHE NIVVKRGGKL IPVGASTSPL LSASGERIGT ILLLRDLTNV RELESAVRQA DRLSALGGLA AGLAHEIKNP LGGIKGAAQL LELEFPDNED LREYIRVMLK EVQRVNLIVE ELLALASPGR LKLSKVNLHR VLSDIVLLQK NASEGKEVYL QQYFDPSIPP ILGDEALLTQ LFLNLIKNAL EAVEAGGVVK VTSRVLSDYS MTQRGERRAR MVAIDIADNG PGIEAEVLEN MFTPFFTTKS QGTGLGLAIC QKIVSEHRGM IKVDSDAKRG TVFTVMLPLV Q
|
| |