Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2100 |
Symbol | |
ID | 8137436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2442238 |
End bp | 2444493 |
Gene Length | 2256 bp |
Protein Length | 751 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644869715 |
Product | GAF sensor signal transduction histidine kinase |
Protein accession | YP_003021910 |
Protein GI | 253700721 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 0.0262959 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACCTT CACGACTACG CGACATGCCG CTTTACAACA GCAGGATCAT CAACTCCTAC ATCCTGCTGA TAAAGCACAA GTACAGCCAC GTGGACATCT CCGAGCTGCT CCAGTACGCC GGGATGAAGG AATACGAGGT TGCCGACCAG GCGCACTGGT TCAGCCAGGA GCAGATCGAC AGGTTCCACG AGAAGCTGCA GCAGATGACC GGCAACGCGA GGATCTCGCG GGAGGCCGGG CGCTACGCCG CATCCCCGGA CGTTTTGGGG GCGATGAGGC AGTACATCCT GGGGCTCGTG GATGCCTCCA GCACCTTCGA CATCATCAGC AAGACCACCG CCAAGTTCAC CAGGTCCTCC AGCTACCAGT CGCGCTCCAT CGCCTCCAAC CAGGTCGAGA TAACCGTCAC CCCCTACGAG CCGGGGTTGG AGAAGCCGTA CCAGTGCGAA AACAGGATCG GCTTTTTCGA GGCGATCGTG CTTGTCTTCA ACCACAAGAT GACCGACCTG CAGTACTTCC CCGAGATAAG AAACCTGCCG ACCATCGAGC ATCCCGAATG TATGTTCAGG GGGGATCCGG TCTGCCGCTA CGTCGTCACC TGGGAGAAAA CCCTTTTCAC CTTCCTGAAA AAGATGAGGA ACATCCTGGC ACTCCCCCTG GCGGCGCTGA ACCTGGCGCT TTTGGCGGTG GGGGAGGCGG GGCTTTTGAC CTGGGTCTTC CCGTCGAGTC TCTGCGTACT GCTACTCGTC GCCCTGGCGA CGGAGACCAG CGAGAAAAAG GCGATCAAGG AGAGCCTTTG GAGCACCCGC GACTCCATCG AGAACCTCCT GGACCAGATC AACCTCAACT ACAACAACGC CATGCTCACC CACGAGATCG GGCAAAGGCT CGGCAACTAC ACCAGGATCG AGGAGATGCT CTTCGACGTG GCCCAGATCA TGAGGTACCG GCTGGAATAC GACCGCGGCA TGATCCTTTT GGCCGACGGC GGCAGAAAAC GCCTGGAGCT GCGCGCAAGC TACGGCTACA AGGACGAGGA ACTGGCCTGC CTCAACTCGC TCCCGTTCCT TCTGGAAAAC CGGGAGCAGG GGGACGTCTA CCTGGAATGC TTCCGGGGGC AGCGGCCGTT TCTGGTGAAC GACCTCTCCA TCGGCAGCGT GGGCGAAAAT ACACTCGCCT GCGCCCACAT GAGCGGCACA CGCGCCTTCA TCTGCTGCCC CATCGTTGCC GACGGCGCTT CCCTCGGGGT CCTGACCGTG GAAAACGTGC AGGTGAAAAG GCCGCTGCTG GAAAGGGACA TCAGCCTCAT CATGGGAACC GCCTCGGTGC TCGGCATCAG CATCCGCAAC TGCGAGCTGA TCGGGGCGCT GGAAACGGCC AACGAGGAGT TGGAACTGAG GGTGGCCAAG AGAACGGAAG ACCTGGAGAA AAGCCGCCAG AAGATGCAGC TGCAGCACGA GGAACTGGTG CGGACCTATT TCGAGCTGGA GGAGGAGACC GCCCAGCGGT TGAGCGCGCT GGAGGAGCTG GCGCGCAAGG AACGGATGCT TTTGCAGCAA AACCGGCTGG CGGCTCTCGG GGAGATGATC AGCAACATCG CGCACCAGTG GCGCCAGCCC CTGAACGAAC TGGGGCTGAT CGTCCAGGAA CTTCCAGTCA TGTACGACCG GGGGGACTTC AACAAGGAGT ACCTGCGCGA AAGCGTCTCC AAATTCATGA AGGTTTTGAG CCACACCTCG AAGACCATCG ACGACTTCAG GACCTTCTTC AAACCGGACC GGGAGATGGT TCCTTTCCGG GTCACCGAGG TGGTGGAAAA AGCGCTCTCC TTGGTCGGGG AGAGCTTGAA GCACCTGGAG ATACAGGTGA CGGTCCATTC TGTGGACGAC CCGGCGATCA TGGGGCACCC AAACGAGTTC TCCCAGGCGA TACTCAACAT CCTCTTCAAC GCCCGAGACG CCTTCAAAGA GCGCGGCATC TCTTCCCGGC AGATCGAGAT CCGCATCTTC CCGGAAGACG AAACCTGCGT AGTCACCATA GCCGACAACG CAGGCGGGAT CCCCGAGGAG ATCATGGACA AGATATTCGA CCCCTATTTC ACCACCCGCG GACCTGAACA GGGGACCGGG ATCGGCCTCT ACATGACCAA GATGATCGTC GAGAAAAACA TCCCCGGGAA ACTGTCGGTG CGCAACACGG AAAAGGGGGC GGAATTCAGG ATCGAGGCGA GCAAAGTTGC GCTCCGCCCG CATTGA
|
Protein sequence | MQPSRLRDMP LYNSRIINSY ILLIKHKYSH VDISELLQYA GMKEYEVADQ AHWFSQEQID RFHEKLQQMT GNARISREAG RYAASPDVLG AMRQYILGLV DASSTFDIIS KTTAKFTRSS SYQSRSIASN QVEITVTPYE PGLEKPYQCE NRIGFFEAIV LVFNHKMTDL QYFPEIRNLP TIEHPECMFR GDPVCRYVVT WEKTLFTFLK KMRNILALPL AALNLALLAV GEAGLLTWVF PSSLCVLLLV ALATETSEKK AIKESLWSTR DSIENLLDQI NLNYNNAMLT HEIGQRLGNY TRIEEMLFDV AQIMRYRLEY DRGMILLADG GRKRLELRAS YGYKDEELAC LNSLPFLLEN REQGDVYLEC FRGQRPFLVN DLSIGSVGEN TLACAHMSGT RAFICCPIVA DGASLGVLTV ENVQVKRPLL ERDISLIMGT ASVLGISIRN CELIGALETA NEELELRVAK RTEDLEKSRQ KMQLQHEELV RTYFELEEET AQRLSALEEL ARKERMLLQQ NRLAALGEMI SNIAHQWRQP LNELGLIVQE LPVMYDRGDF NKEYLRESVS KFMKVLSHTS KTIDDFRTFF KPDREMVPFR VTEVVEKALS LVGESLKHLE IQVTVHSVDD PAIMGHPNEF SQAILNILFN ARDAFKERGI SSRQIEIRIF PEDETCVVTI ADNAGGIPEE IMDKIFDPYF TTRGPEQGTG IGLYMTKMIV EKNIPGKLSV RNTEKGAEFR IEASKVALRP H
|
| |