Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1805 |
Symbol | |
ID | 8137136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2101875 |
End bp | 2103326 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644869417 |
Product | PAS modulated sigma54 specific transcriptional regulator, Fis family |
Protein accession | YP_003021617 |
Protein GI | 253700428 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.00407865 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTGAAC CGGTTATGAA TACTGCTCCC TCGCTCGATC TCGAGGAGAT GGCGCGCCAG ATGCGGGCGT TTCAGGATCT GACCCGCGAG CTGGACGCGA TCATCGATTC GTCCTCGGAC GGGCTCTGGA TCTGCGACGC CGAGGCCCGG GTCATCCGCA TCAACCCTGC CTCGGAGCGC ATCAACAACA TAAAGGCCTC GGAAGTTGTC GGTAAGAACA TGCGGGAACT CCTCGATGAA GGTTTCATCG ACCGTTCGGC GGCACTTGAG GCGATCACGA CCAAGAAGGT GGTCAGCCAA CTGCAGAATA GGGAAGGGCG CAAGCTCATC TCGACGGGGA CCCCGGTTCT GGACGCGAAC GGCGAGGTGA TCCGGGTCGT GGTGAGCGAG CGGGACATCA CGGAAATCGA TAACTTGCAG CGCGAACTGG AAGAGCAGGA GGCGCTGCGG GATCAGTTCC GCAACCACAT GCTGGAACTT CAACAGGCGG ACGTGGCATC CAAGAGCGTC GTCGCCAGGA GCCCGCTGAT GGTGAACGCC CTGAAACAGG CGCTCAAGGT GAGCGCGGTG AACTCGACGG TGCTGATCCT CGGGGAGTCC GGCGTCGGCA AGGGGCTGAT AGCGGAGTTG ATACACAAGA ATTCCACCAG GGCGGACAAG CCGCTGATTG AGATAAACTG CGGCGCGATA CCGGAGTCGC TGATCGAGTC GGAACTCTTC GGCTATGAGA AGGGGGCCTT TACCGGCGCG CAGACTACCG GCAAACCGGG CTATCTGGAA CTCGCGGACG GCGGCATCCT GTTTCTGGAC GAGATCGCGG AGCTGCCGCA GTCGGCGCAG GTGAAACTGC TTCGCTTCCT CGAAAACGGG AAGGTGATCC GTTTGGGGGG GACCAAGGCC AGGCATCTGG ATGTGCGCAT TCTCGCGGCG ACGCACCGAA ATCTTGACGA AATGGTGCGG CAGGGGAGCT TCAGGCTGGA CCTTTATTAC CGGCTCAACG TGATCCCGAT CGGCGTCCCG GCTTTGCGCG AGCGGCGGGA CTGCATTCTG CCGCTGGTAA GACACTACCT GGAACTTTTC GGCGCCCGCG ACTCCATCCG CAAGCGACTG ACACGTGCCG CCTCCGATGC GCTCCTTGCC TATGACTACC CCGGAAACGT GCGGCAGTTG ATGAACATCT GCGAGCGGCT CGTGGTCATG GCGGAAACGG ACCTGATCGA CTTGAAGGAT CTCCCCGCCG AGATATCCGC CGGCATCGGC AAACCTGCCG CTGTGGCCGG GGTCTGGCAG GAGGATGTGC CGCTTCAGGA GACGCTGGAT CAGGTCGAGA AGGCCGTCCT GGAAAAGGCG CTGGCCAAGC ATCGCAACCA GACGCGCATG GCGGAGGTTC TTGGGGTGAA CCAGTCGACC ATCGCCAGGA AACTCAGGAA ATACAAGCTG AACGGCAATT GA
|
Protein sequence | MTEPVMNTAP SLDLEEMARQ MRAFQDLTRE LDAIIDSSSD GLWICDAEAR VIRINPASER INNIKASEVV GKNMRELLDE GFIDRSAALE AITTKKVVSQ LQNREGRKLI STGTPVLDAN GEVIRVVVSE RDITEIDNLQ RELEEQEALR DQFRNHMLEL QQADVASKSV VARSPLMVNA LKQALKVSAV NSTVLILGES GVGKGLIAEL IHKNSTRADK PLIEINCGAI PESLIESELF GYEKGAFTGA QTTGKPGYLE LADGGILFLD EIAELPQSAQ VKLLRFLENG KVIRLGGTKA RHLDVRILAA THRNLDEMVR QGSFRLDLYY RLNVIPIGVP ALRERRDCIL PLVRHYLELF GARDSIRKRL TRAASDALLA YDYPGNVRQL MNICERLVVM AETDLIDLKD LPAEISAGIG KPAAVAGVWQ EDVPLQETLD QVEKAVLEKA LAKHRNQTRM AEVLGVNQST IARKLRKYKL NGN
|
| |