Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0312 |
Symbol | |
ID | 8135619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 383441 |
End bp | 386245 |
Gene Length | 2805 bp |
Protein Length | 934 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644867929 |
Product | Sigma 54 interacting domain protein |
Protein accession | YP_003020151 |
Protein GI | 253698962 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 172 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACTCAG AGGAACAGTT ACCCGTCATG GACCAGCAAC TGAAAAAACT AGGCGACCGG ATCGCCTCCT TCGGCAAGGA AAACCTGGAC GAGATGCTGC ACCTGGTCGG CGAAGGGTCG CGGCTCATCT CCGGGCAGGA GCGCGTGCGC ATCTACCTTG AGGATCTCAC CAAGGGCGCC CTCTCCTGCG CCTTCTCCTG CGGCGGCTTC GCCACCGAAA TAAGGCAGGA AACCTTCCCG ATCATCTCAG CCGAAGCCGC GGTCTCCCGC ACCTTCGTGA CGCAGCAGGT CGCGCAATAC CAGAACCCCG CCGAAACCGG CCTCCCGCTG GACCAGGAAT TCTCGCGACG GTTCCAGATC GAGGGGACGA CCATGCTCCC CATCACCAGC CAGGGCAAAT CGATCGGCGT CGCCTGCCTT GATGGCAGCA CGCTCTCCCA AGACAAGATC GACGAACTCA TCCCCTTTCT GGCGCGGGCG GGCGAACGTG TGGACCAGGC CAGGAAATAC CACCAGCAGT TGCTGTTGGC CCGGCGGGTC GAGCTCTACA AACGGCGCGA AGCCGCCGGA TTCATGGTGC GCTCGGCAGT AAACCTCATC GACGGGCTGA CGCTCGCCTC GGTGCTCGTC CCCGTCAAGG GGGGGATGGA AGTGCTGGCA AGCCACGCCC AGGACCCGGA CCTGAAATAC CTCTACGACA ACGTGGGAGG GATCGATCTC AAGCACGGCA CCTCGCTCGT CTCCCGCTAC GTCAACGAGG CCGGCGTAGT CACCGACCCG AGCCTTCTGA AGCCTATGTT CTTTCCCGAC CTTTTGGACC AGTCCATCCA ACGCCGCGCC CTCACCGAGG AGATGGGGCT GCGCACCCTG TACATGGTGC CGCGGGTAGA GCCGGAGACC AAGCGCATCA TCTGCCTGAT GAACTACTTC ACCCGCGACG TCACCAACTA CACCGATTTC GAGATGGGGC TTTTGCAGAC CCACGCCGAG ATGGTGGAGC GGGTCATCAA CGAGGTGGGC GGCGAGCACC TGGAGATCAA GGTGCTATCC GAGATATCCG ATCTGCTCCA GGAGCCGAAC GAAGGGCTGC ACGGCTTCCT CACCAAGGTC CTCTCCAAGG CGACGGAACT GATCGGCGCC GACACCGGGA GCATCGCCAT CGTCCAGGAG CGCGAGGGAG TGAAGTGGCT GGTGGTCGAA AACGAGGAAG GGACCATCGT CGGGGCGAAG AACAAGGAGT GGCTCAAGAA GAAGATCCCC CCCTTCAAGA TCGGCGGTGC CGACCTTCCC CCCGAGGAGC GTAGCCTTAC CGGTTACGTC GCCTGGACCA AGGAGCCCAA GATCATCGAG CAGGTGGCGG ACCAAAGCCG GCACGAGGGG TTCCACCGCC CGATGAACGA ACTGATACAA AGCGAGATGG CGGTTCCCGT GATCAGCGAC GACGAGGTGA TCGCCGTCAT CTGCCTCAAC TCCCTGCAAG ACGGGTACTT CACCGAAGAA CACAAGCGGA TCCTGCAGAT CATCGACCGG CTCACCTCGC GCCACATCTC CGACATCCAG CGCATAGAGC GGCTGCAGTC GGAGGTGAAC AAGCTGCAAA GCGACATCGC CTACAAGGAC CCGAAAGTCT CCTCCTACCG GCTGGGAAAC ATCATCGGCA ACAGCCGCAA GTCGCAGGAG ATCGTCGCCT TCATCAACAC CGTGGCGCAG CCTCTATCCA ACCGGATCGC GCTGTGGAGC AGGAACATCC TGCAGGAGGC GACCATAGGC CTTCCCTCCA TCCTGGTCCT TGGCCCCACC GGCGCGGGGA AGGAATTCTT CTTCAACAAC CTGTACAACA AGCTAAACGA GCTCTACCGG CAGCAGATCA ACCCGGACGG CGAGCTTCCG GTGAAGAAGA CCAACATCGC CGCCTACAGC GGAGACCTTA CCTACTCCGA GCTCTTCGGA CACATCAAGG GAGCCTTCAC CGGCGCCTAC AGCGACCGCA AGGGGATCAT CGAGGAAGCC GCCGGAGGGA TCGTCTTTCT CGACGAGATC GGCGACGCCG ACCCGAAGAC CCAGGTCCAG TTGCTTCGTT TCCTCGACAA CGGCGGGTTC GTGCGTCTGG GCGAGAACCG CGAGCGTATC AGCCGCGTGC TCTTAGTTGC CGCCACCAAC AAGGACCTGA GCAAAGAGAT CGCCATGGGG AACTTCCGCG AGGACCTCTT CCACCGCTTG ACCGAGCTTT CCGTCGTGGT GCCGTCACTG AACGAGCGGC GCGAGGACAT CCCCGACCTC TCCATCCACT TCCTGGGGAA GCTCTACCGC ACCTACCGAA GCCGCGAGGA AGAGGCGCGC GACGCCGAGC CGAGCCTCTC CAAAGACGCC AAAGACGCCT TGGTAAGCCA CAACTACAAG GGGAATATCC GTGAACTGCG CAGCATATTG TTGCGCGCCC TCTTCTTCAG GACCGGCAAG ATGGTCACCG GCGAGGACAT AAAGAAAGCC ATCCGGGACG GGATGCGGGA GCAGATGACG CCTGCGGCGG AAAGGCTATC CGAAGAAGTC GCCGGGGGGA TACTCGCCGA GATAGAAGCG GGGAGCGATT TTTGGGAAGC CGTGTACCAG CCCTACTCGC AAAGCCGAAT TTCCCGCGAC GTGGTGCGGC TGATAATAGA GAAGAGCCGT GTGGCCGCCG GCAAGAACAT GCCCGAAATA GCCAGGTACC TGAAAGCCAT CACCGGCGAT CCCCAGGAGG ATGAAGAAGA GAGGAAACGC TTCTTCCGGT TCAAGAACTT CCTTTACAAG ACGGTGAAGA TCTGA
|
Protein sequence | MYSEEQLPVM DQQLKKLGDR IASFGKENLD EMLHLVGEGS RLISGQERVR IYLEDLTKGA LSCAFSCGGF ATEIRQETFP IISAEAAVSR TFVTQQVAQY QNPAETGLPL DQEFSRRFQI EGTTMLPITS QGKSIGVACL DGSTLSQDKI DELIPFLARA GERVDQARKY HQQLLLARRV ELYKRREAAG FMVRSAVNLI DGLTLASVLV PVKGGMEVLA SHAQDPDLKY LYDNVGGIDL KHGTSLVSRY VNEAGVVTDP SLLKPMFFPD LLDQSIQRRA LTEEMGLRTL YMVPRVEPET KRIICLMNYF TRDVTNYTDF EMGLLQTHAE MVERVINEVG GEHLEIKVLS EISDLLQEPN EGLHGFLTKV LSKATELIGA DTGSIAIVQE REGVKWLVVE NEEGTIVGAK NKEWLKKKIP PFKIGGADLP PEERSLTGYV AWTKEPKIIE QVADQSRHEG FHRPMNELIQ SEMAVPVISD DEVIAVICLN SLQDGYFTEE HKRILQIIDR LTSRHISDIQ RIERLQSEVN KLQSDIAYKD PKVSSYRLGN IIGNSRKSQE IVAFINTVAQ PLSNRIALWS RNILQEATIG LPSILVLGPT GAGKEFFFNN LYNKLNELYR QQINPDGELP VKKTNIAAYS GDLTYSELFG HIKGAFTGAY SDRKGIIEEA AGGIVFLDEI GDADPKTQVQ LLRFLDNGGF VRLGENRERI SRVLLVAATN KDLSKEIAMG NFREDLFHRL TELSVVVPSL NERREDIPDL SIHFLGKLYR TYRSREEEAR DAEPSLSKDA KDALVSHNYK GNIRELRSIL LRALFFRTGK MVTGEDIKKA IRDGMREQMT PAAERLSEEV AGGILAEIEA GSDFWEAVYQ PYSQSRISRD VVRLIIEKSR VAAGKNMPEI ARYLKAITGD PQEDEEERKR FFRFKNFLYK TVKI
|
| |