Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1016 |
Symbol | |
ID | 8136338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1198934 |
End bp | 1200124 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644868628 |
Product | putative transcriptional regulator |
Protein accession | YP_003020836 |
Protein GI | 253699647 |
COG category | [K] Transcription |
COG ID | [COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 121 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCTAA ATAAACCGCT GGACCAACTG ACGGAAGCAG ACTTCCAGGA ACTCATCGCG AATAAAGTCC CCGAGAGCAA AACCCTCGAC TATAAGGTCG ATCTAAAGTT TGGTGACCGG GATAAGCGGG AGTTCCTCGC CGACGTGTCG TCGTTCGCCA ACACTGCCGG CGGCCACCTG CTCATTGGTA TCAAAGAGGA GGGCGGCATC CCGACCGCTC TCCCGGGCAT CGACCTCGAC AACCCCGATA CGGAAAAGCT GAAACTGATT AACCTCATTC GGGACTGCAC TCAGCCGCGC ATCCCGGGGG TTGCCATCAC ATCGGTCCCT CTCCAGAATT CCCGTTACAT CCTTGCCATC CATATCCCGA AAAGCTGGGC AGTCCCGCAC GTAGTAAGCA TCGAGAAGCA TTGGCGCTTT TATGCGCGGC ATTCCGGCGG CAAGTATCAG CTAGACGTCC CAGAACTGCG CCAGGCGTTC CTCATGTCGG AGTCCCTCGC AGAAAAAATT CGCCAATTTC ACAGCGAACG AGTGGGCATG GTGATATCCG GCGAAGTACC ACTAAACCTT GCCAACGGGC CGAAGTTCAT TGCCCACATC ATACCGGTGG ACGCATTTGG GTCAGGACAA CAAGTGGATA TGTCGATGCT GATGGAGAGG GGCATCCACT TCAATCCGCT GGGCGCCTCT GGCTATAACC GTAGATACAA CCTGGACGGC TACCTTACGT ATGAAGAGGA AAGAACCCAG GATGCATCCC ATGCGTTGTC CTACACCCAG CTATTCCGTT CCGGCATTAT CGAATCGGTC TGTGTTGACA AGGACCACCT AAATGCCAAT GAACGCGATA GAGGCATCCC CATCACTTAC TATCAAGAGC AATTGCTGCG GTTCTTGTCC GCTTCCCTGC AATCATTGAA ACAACTCGAG GTGGAGCCAC CCTATTCAAT GATGGTCACC ATGGCCGGCG TGAAACACAG GTATTTGCAT TTCGGCAATA GGTACTTCTC GCTCAGGAAT CCCTATATTG ATAGGGATGT GCTGCAATTG CCCGACATCC TTATTCAGGA CGCTGACTTC GCCGGCGGAA AGACAATGCG ACCGATATTC GACGCCATCT GGAATGCCGG AGGGCTTGAA AGGTGCTTCG ATTACGACGA AGAAGGCAGG TGGAACGGCT ATGGCCAATA A
|
Protein sequence | MSLNKPLDQL TEADFQELIA NKVPESKTLD YKVDLKFGDR DKREFLADVS SFANTAGGHL LIGIKEEGGI PTALPGIDLD NPDTEKLKLI NLIRDCTQPR IPGVAITSVP LQNSRYILAI HIPKSWAVPH VVSIEKHWRF YARHSGGKYQ LDVPELRQAF LMSESLAEKI RQFHSERVGM VISGEVPLNL ANGPKFIAHI IPVDAFGSGQ QVDMSMLMER GIHFNPLGAS GYNRRYNLDG YLTYEEERTQ DASHALSYTQ LFRSGIIESV CVDKDHLNAN ERDRGIPITY YQEQLLRFLS ASLQSLKQLE VEPPYSMMVT MAGVKHRYLH FGNRYFSLRN PYIDRDVLQL PDILIQDADF AGGKTMRPIF DAIWNAGGLE RCFDYDEEGR WNGYGQ
|
| |