Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1825 |
Symbol | |
ID | 8137156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2123721 |
End bp | 2125649 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644869436 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_003021636 |
Protein GI | 253700447 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2208] Serine phosphatase RsbU, regulator of sigma subunit |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.000986123 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTAAAAC CGCTTCGCGT GCTGATAGTG GAAGACTCGG AGGACGACGC GCTGCTGCTC GTCTTCGAGC TGCGTCGCGG CAACTATGCT CCGGTAACCA TGCGCGTGGA GAACTCCGAC TCTTTGCGCA AGGCGCTCAA GGAGGAAACC TGGGACCTGG TCATCTCCGA CTACGTCCTT CCCGGCTTCT CGGGGTTGGA CGCGCTGAAG CTGGTGCGTG CCTCGGGGAT GGACCTTCCC TTCATCATCG TTTCCGGGAA AATCGGCGAG GAAGACGCCG TAAAGGCCAT GAAGGAGGGG GCGAACGATT ACCTGATCAA GGGTAACACC TCGCGCCTCA TCCCTGCCAT CGAGCGGGAG ATGCAGGAGG CCGAGGTGAG GCGCAAGCGG CGCGAGGCCG AGTCGGCGCT GGTGAGAAGC GAGCGGCGCT ACAAGAGGCT GGTGTCGGCG GTCACCGATT ACATCTATAC GGTGATCATC CATAAAGGAG CGGTTGTTAA AACCTCGCAC GGACCGGGAT GTCTCTCGGT GACCGGCTAC AGCCACGAGG AGTACGTCGA CAACCCTTTT TTGTGGTACC AGATGATCTA CGAGGAGGAC CGGAGCGCGG TGACGAGCCT CACCGAGGAC CTGCGGGCGG GAAAAGACAT CCCTTCGCTG GAGCACCGGA TCCGGCACAA AGACGGCTCG CTGCGCTGGG TCATCAACAC CATCGTCCCA CGCTACAGCG AACAGGGCGA ACTGATCGCC TACGACGGCC TCATCTCGGA CATCTCGGAG CGAAAGCGCG CCGAGGAGTC GCTGCAGCTT CAAAGCGCGG CGTTGGAGGC GGCGGCCAAC GCCATAGTCA TCACGGACAG CAGCGGGGTC ATCATCTCGG TAAACGAGGC TTTCACCGGC ATGACCGGGT ACGGCCGCGA GGAGGCCCTA GGGCGCGATC TGAGTTTCCT GAAATCGGAG CGGCACACTT CGGAGTTTTA CCGCTGCCTC AGGGAAACCA TCAGCGCCGG CGAGGTCTGG CACGGAGAGA TGATCAACCG GCGCAAGGAC GGCACCCTCT ACCCCGAGGA GCAGACCATC ACGCCGGTTT TGGACGAAGA CGGGCGCATC AACCACTTCA TCTGCATCAA GCAGGACATC ACCGAGCGCA AGCAGGCAGA GCAGGCGCTG ATGCAAAACG CCAGCATGCT GAAGGAAATG GAGATCGCGA AGCAGATCCA GATGTCTTTG CTTCCGGTCA CCCCCCCCTG CCTCCCGGGC ATCGACTGCG CCGGGAACTG CTCCCCCGCC AACAACATCG GCGGCGACTA CTACGACATC CTCCCGCACG GAGAAGAACT CGACCTCGTC ATCGCCGACG TCTCGGGCCA CAGCGTCGGC GCCGCGCTGA TCATGGTCGA AACCCGCAGC GTACTTCGCG CGCAGCTTGC GACCTTGAAG GGGCCGGCCG AGATTGTGTC GGCCCTGAAC GAACTGCTGC ACGAGGACCT GAGTCGGGCC GAACTCTTCA TCACCATGTC GTACCTGAGT TACCACATCC CCACCGGCAC GCTCCGCTAC ACCAACGCCG GCCATCCCCC TCCCCTGCTC TACCGCCACC AAACCGACCA GTTTTTCGAG TTGGACGCCG AAGGGCTCAT CCTGGGGGTA CACCGGGAGG TGTTCTTCCA GGAGCCCTCG CTCCAGGTCC GGGACGGGGA CCTGCTGCTT CTCTATACCG ACGGCATCAC TGAAGCGGAA AACCGCGAGG GCGGGTTCTT CGGCATTGAT AGATTGCGGC AAGTAGTGGC GCGCGAGCAT ATGAAACCGG CAGCGGGAGT GATCGCCGCA GTCATGGAAT CCGTCAGGAC CTTCACCGGC TGCGACGCCT TCAACGACGA CATCTCTATG CTTCTTATCA AATTCGTCCC CGTAACCCCC CATTCTTAG
|
Protein sequence | MVKPLRVLIV EDSEDDALLL VFELRRGNYA PVTMRVENSD SLRKALKEET WDLVISDYVL PGFSGLDALK LVRASGMDLP FIIVSGKIGE EDAVKAMKEG ANDYLIKGNT SRLIPAIERE MQEAEVRRKR REAESALVRS ERRYKRLVSA VTDYIYTVII HKGAVVKTSH GPGCLSVTGY SHEEYVDNPF LWYQMIYEED RSAVTSLTED LRAGKDIPSL EHRIRHKDGS LRWVINTIVP RYSEQGELIA YDGLISDISE RKRAEESLQL QSAALEAAAN AIVITDSSGV IISVNEAFTG MTGYGREEAL GRDLSFLKSE RHTSEFYRCL RETISAGEVW HGEMINRRKD GTLYPEEQTI TPVLDEDGRI NHFICIKQDI TERKQAEQAL MQNASMLKEM EIAKQIQMSL LPVTPPCLPG IDCAGNCSPA NNIGGDYYDI LPHGEELDLV IADVSGHSVG AALIMVETRS VLRAQLATLK GPAEIVSALN ELLHEDLSRA ELFITMSYLS YHIPTGTLRY TNAGHPPPLL YRHQTDQFFE LDAEGLILGV HREVFFQEPS LQVRDGDLLL LYTDGITEAE NREGGFFGID RLRQVVAREH MKPAAGVIAA VMESVRTFTG CDAFNDDISM LLIKFVPVTP HS
|
| |