Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2070 |
Symbol | |
ID | 8137406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2400102 |
End bp | 2402039 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644869685 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_003021880 |
Protein GI | 253700691 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0000000000000789609 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGCAAG GTGACGACGC GGCCATGTCC ATGGACGCGG AAAAACTCGC AGCCATGGTG GAATCGTCCG GCGACGCGAT TATCGCAATG AACGTCGACG GCACCATCAC CAGTTGGAAT CCGGCGGCGA CAAAGCTCTA CGGCTATACC GCGAAGGAAG CTGCAGGCTG CGACATCTCC TTTCTGGAAT CGCCGGAGCG ACCCGGCGAG ATCTCGGCGG AACTGCAACA GGTGCAAAGG GATAGGCGAT CCCGGCACTT TGACCGCTAC CGCCGGCGCA AAGACGGCAG CCTGGTTTTC GTCTCGCTGA CGCTCTCCCC GATACTGGAC AAGCAAAATA CGCTGATCGG CCTTTCCTCC ATCGCCCGAG ATGCGGAGGA ACTGGTACAG CAACGTACCA AGGAACTGAT GCAGGCCATC CAGGCGCTGC AGGTTGAGAT CGCCGAGCGA CGCAGGGCAG AGGATGCGCT TACCCGGAGC GAGGATATGC TCCGCTTCGC TGCATTGGCG GCGGACATCG GCATGTGGCA CTACGATCTG TTGACAGGGG ACCTGGTCTG GAGCGACAGG TGCAAGGAGT TGTTCGGCTA CTCTCACGAC TTTCAGATGA CCTATGAAGC TTTCCTCGAC GCCGTCGCGG AAGAGGACCG GCAGGGGGTA GACCAAGCGG TTCAGAGATC CTTGCAGGAA AAATCCGAAT ACGCCGTAGA GCTGAGGGTG ATGCTGCAGG ACGGCCAGGT GCGTTGGGTA ATGAGCAAAG GGCACGCCTT CTACGATAGC CAAGGGAAAC CGTTGCGGAT GGCAGGCATA GCCTTGGACA TCACCCAGAG GAAAAAAACG GAAGCGGCGT TACTGCGGGC CAAGGAGGAA TGGGAAAGCA CTTTCAACAG CGTCCCCGAT CTCATCGCCA TACTGGACGA AAAGTGCCGC ATCGTCCGGG TCAACGAAGC CATGGCGCAG CGGGTCCATA TCAACCCCGA CGGATGTGTC GGGCTTTTTT GCTACCAGGT CCTTCACGGG GAGAATGCGC CGCCGCATTT CTGTCCCCAC GGCCAAAGCT TAATGGACAA TATGCAACAC ATTGCCGAGG TCTACGATCC CCACCTGACC GGAACCTTTC TGGTCAGCAC CACGCCGCTT GTGGCCGCCG ACGGCAAATC GATCGGAACG GTTCATGTCG CCAGGGACAT AACGGAGCGC AAGAGGGCGG AGGAGGAGAT CGCCCGGCTG AACGCCGATT TGGGGGCGCA TGTCGCCGAA CTGGAGGAGA GAAACCAGGA ATTGGACGCC TTCAACCGCA TGATATCCCA CGACCTGCGG CAGCCCTTGA ACATCATGTC CCTTGCGGGC CAGCACATCG ATATGCTGTG CAGCAGCGAT AACCCCGAGT GCCGGCAAAG CGTGCGGACG CTCGAACAGG CGGTATTGCG CATGAACGCC ATGATTGAGA CGCTGCTCTC CTTCTCGCGT TCCACGCATG GGGATCTGTT GCGCGAGGAT TTGGACATCA GCGAGACGGT GCAGGTGATA CTCGCTGAAT TATGTCTGGC CGAGCCTGTG CGCCGGATAA GGACCGTGAT AGAGGAAGGG GTCATGGTCA ATGCCGACCC CCGGCTGTTG CGGACCGCGC TGGAAAACCT TCTGGGAAAT GCCTGGAAAT ACACCGGCGG CCGTGAAGAG GGATACATCG AGTTCGGTGT GAGGGGAGGG GAGGTAGAAC CGGTCTACTT CATCAAGGAC AACGGGACTG GGTTCGACAT GGCCGATGCG GATAAGCTCT TCGTCCCGTT CCAGCGGCTG GCAGGGGCCG ACGCGTTCAA AGGATCGGGC ATCGGCCTGG CGACCGTGGA AAAGATCATC AAGCGGCACG GCGGAAGGAT CTGGGCAGAG GGGGAGCCGG ACAAGGGAGC CACCTTCTAC TTCACGCTTA AAAGTTGA
|
Protein sequence | MKQGDDAAMS MDAEKLAAMV ESSGDAIIAM NVDGTITSWN PAATKLYGYT AKEAAGCDIS FLESPERPGE ISAELQQVQR DRRSRHFDRY RRRKDGSLVF VSLTLSPILD KQNTLIGLSS IARDAEELVQ QRTKELMQAI QALQVEIAER RRAEDALTRS EDMLRFAALA ADIGMWHYDL LTGDLVWSDR CKELFGYSHD FQMTYEAFLD AVAEEDRQGV DQAVQRSLQE KSEYAVELRV MLQDGQVRWV MSKGHAFYDS QGKPLRMAGI ALDITQRKKT EAALLRAKEE WESTFNSVPD LIAILDEKCR IVRVNEAMAQ RVHINPDGCV GLFCYQVLHG ENAPPHFCPH GQSLMDNMQH IAEVYDPHLT GTFLVSTTPL VAADGKSIGT VHVARDITER KRAEEEIARL NADLGAHVAE LEERNQELDA FNRMISHDLR QPLNIMSLAG QHIDMLCSSD NPECRQSVRT LEQAVLRMNA MIETLLSFSR STHGDLLRED LDISETVQVI LAELCLAEPV RRIRTVIEEG VMVNADPRLL RTALENLLGN AWKYTGGREE GYIEFGVRGG EVEPVYFIKD NGTGFDMADA DKLFVPFQRL AGADAFKGSG IGLATVEKII KRHGGRIWAE GEPDKGATFY FTLKS
|
| |