Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0549 |
Symbol | |
ID | 8135860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 671045 |
End bp | 673183 |
Gene Length | 2139 bp |
Protein Length | 712 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644868162 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_003020381 |
Protein GI | 253699192 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 0.338238 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATTAA TTAACCCCAA TGGAAATGAC CACCCCCCCG GTCCTTCCGA GAATAGCGTT ACCGCGACCG GTGCCGACGA GCCGGCCGGC AGGAAGGGAC GCAGGAACCT GAACAGGTGG CTCCTGGCCG GGCTCCTTCC GCTTGCAGCT TTGGCGCTGC AATGGCTTTT ATGGGAAAGA CTGCAGCCCT ACGTCTGGCT ATTCTTCTAC CCTGCCGTTT TCTTAAGTTC CTGGGTGGGA GGTAGGGTCG CAGGGCTTTT GGCTACGGCA TTTTCGGCTG TCGTCGTGTG GTATCTCTTC ATCCCCCCCC GCTACAGTTT CTCTCTGGTT CACCCCTCGA CGATCATCGC GCTGCTCATC TTCGCGGGCA TGGGCGTCCT ATTCTCCCTG TTCCACGAAC GGCTGAGAAG AGCGGGCCGG CAGACCCGGC TGGCGCTGGC GGAGGCTATC TTGGGACGGG AACACCTTGA GCATCTGATG GAGGAGCGGA CACGCGAACT GGTGAGCCTG GTCGACGAGT TGCGGCGCAA GGAGGCGGAC CTACGGAGGT CCCAGGAGCT GGCCAAGATC GGCAGCTGGA CCTACCAGGC AGACGGGAGG CTTGAGTGGT CGGACGAGCT TTACCGCATC TACGGCCTGG CTCGCGGGAA ATTCACACCG GACGTTCCAT CCTTTCTTGA GATCATCCAC CCGGATGACC GCGCTCACAT GCAGCATTGG GTTAAGGCCT GCCTGGGAGG GGAGAACCCC GGCGAGTTGG AGTTCCGGAT CGTTCGTCCC GACGGGAGCG TGCGCTTCAT CAGCGGCCAC GGCGCTCTGA GCCGCGATAG CGAGGGGCGA GTCACCGGCA TGTCGGGGAC CGGCCAGGAC ATAACCGAGC GGAAAATCGC TGAGTCCGCG CACCGGGAGA GCGAAGAGCT GCTGAAGCTT TTCATCGAAT ACGCGCCGGT GCCGCTTGCC ATGTTCGACC GCGAGATGCG TTACCTCTAC GCGAGCCGGC GCTGGCGCAG CGATTTCGGG TTGGGAGACC GTTCCCTCGT CAAGGTCAGT CACTACGCCA TATTCCCGGA GATCCCGCAG CACTGGAGGG AGCTGCACCG GCGCGGACTC GCCGGAGAGA TCCTGCGCGA GGAGGCCGAA GAATTCCGGC GCGCCGACGG CACGGTCCAG TGGCTGCGCT GGGAACTCCG CCCCTGGTAC GACGCCGGGG GGAAGGTGGG GGGAATCGTC ATCTTCAGCG AGGACATCTC CAACAGAAAA TGCGCCGAGG ACGCGCTGCA AAAACTCAAC GAGGAGCTGG AACTGCGCGT GGCCCAGCGG ACCGAGACGC TGGACGTGAT CTTAAGCGAG CGGGAAGCGC AGAATGCGGA GCTGCAGCGC GCCTATCACG AGCTCGAGGC GGAGACGGCG CGGCGTATCC GCATGGTGGA GGAACTGCGG CAAAAGGAGC AGTTGCTGAT CCATCAAAGC CGGCTCGCGG CCATGGGGGA GATGCTCGGT TACATCGCGC ACCAGTGGCG CCAGCCGCTG AACGTCCTCG GGCTGCACCT GCAGGTGCTG GGGCTTTCCT ACCAGCACGG GACCTTCAGC CGGGAACTCC TGGAGGAGAG CGTGGGAAAA GCGATGGGCA TCATAAGGCA CCTCTCCAGG ACCATCGACG ACTTCCGCGA CTTCCTGATC CTCAACAAGG AGAAGACCCT GTTCCAGGTC GACGAGGTGA TCGTAAAGAC GGTCGGGCTC ATCGAGGAGC ATCTCAAGAA GGCGGGGGTC CGCATCGAGG TCGCCTGCAC CAACCCGCCG GAGGTGAACG GCTTTCCCAA TGAATACAGC CACGTGATTC TGAACCTTTT GACCAATGCG AAGGACGCCT TTTTGGAGCG TCAGACGGAG CATCCGGTGA TCAGGGTGCA TTCGGGGTCC GAACAGGGGA AGACGGTGGT GACCATCGCC GACAACGCCG GGGGGATCCC GGAAGAGATA ATCGACAAGA TCTTCGACGC CTACTTCACC ACCAAGGGGT TGGGAAAAGG AAGCGGGGTC GGCCTGTTCA TGTCCAAGAT GATCATCGAG AAAAACATGG GGGGCAGCCT CACCGTTCGC AACGTCAACG GCGGCGCCGA ATTCAGGATC GAGATCTGA
|
Protein sequence | MQLINPNGND HPPGPSENSV TATGADEPAG RKGRRNLNRW LLAGLLPLAA LALQWLLWER LQPYVWLFFY PAVFLSSWVG GRVAGLLATA FSAVVVWYLF IPPRYSFSLV HPSTIIALLI FAGMGVLFSL FHERLRRAGR QTRLALAEAI LGREHLEHLM EERTRELVSL VDELRRKEAD LRRSQELAKI GSWTYQADGR LEWSDELYRI YGLARGKFTP DVPSFLEIIH PDDRAHMQHW VKACLGGENP GELEFRIVRP DGSVRFISGH GALSRDSEGR VTGMSGTGQD ITERKIAESA HRESEELLKL FIEYAPVPLA MFDREMRYLY ASRRWRSDFG LGDRSLVKVS HYAIFPEIPQ HWRELHRRGL AGEILREEAE EFRRADGTVQ WLRWELRPWY DAGGKVGGIV IFSEDISNRK CAEDALQKLN EELELRVAQR TETLDVILSE REAQNAELQR AYHELEAETA RRIRMVEELR QKEQLLIHQS RLAAMGEMLG YIAHQWRQPL NVLGLHLQVL GLSYQHGTFS RELLEESVGK AMGIIRHLSR TIDDFRDFLI LNKEKTLFQV DEVIVKTVGL IEEHLKKAGV RIEVACTNPP EVNGFPNEYS HVILNLLTNA KDAFLERQTE HPVIRVHSGS EQGKTVVTIA DNAGGIPEEI IDKIFDAYFT TKGLGKGSGV GLFMSKMIIE KNMGGSLTVR NVNGGAEFRI EI
|
| |