Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0533 |
Symbol | |
ID | 8135844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 653380 |
End bp | 656373 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644868150 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_003020369 |
Protein GI | 253699180 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 110 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTACGTC TTTCGTCTCT CTCAATCCGG TCACTTTTGC TGCTGATAAC ATGCGTTGTG GCATTGCCCG CGGCGGTGAT AATCCTGTAC TCGGGAATTG AATTTCGCAA TACCATGCTT GGAGAAGCCA GAAAAGAGAC AGTCAAATTT GCGGAGACTA TAGTAAACGA GCAGCGGAAT CTCGTTGTGG CCGCCGAACA GTTGATGACG GCATTGGCGC AACTTCCCGA GGTGAAGTCG CGCGACAGCG CGAAGGTAGA GTCGATTCTC AAAGAGTTGC TTAAGTTGAA CCCGATGTAC GCAAACATCA CCATTGCCGA CTGCGAAGGC AAAGTCTGGG GCACCGCAGT CCCGACAACC GTACCCCTGA ACATCTCCGA CCGCTATTTC TTCAAGAGCG CCCTCGCAAC CGGCAAACTC TCCTCCGGCG AATACATCGT CAGCCGTATC ACCACCAAAC CCGCCTTCAG CTTGGGCTAT CCGGTCAGAG ACGACGGCGG CGCAATCATC GGGGTTATCG GCGTCGCCTT CAATCTCGAG AACTACCGGG ATCTGTTGCA GCAGATGCGG CTGCCGTCCG GTTCCAGTTT CACCATCATC GATCATCGCG GCACCATCCT TTCCAGGGGA TTGACCCAGG GGAACTTTGC AGGGAAGGCT TACACGGTGG ATTCTTTTCG CAAAATGGTG GAGGGGCCGG ACGAAGGGGT AAGCATCAGG AAGGGACTGG CAGGCGACAC GAGGATCATC GCCTACCGAA AACTGTACCT CCCCGGGGAA AAGACCCCGT ACCTGTACGT CACCGCAGGT ATCCCCGTGG ACGCGGCAAC CCATAAAGCC AACCGCGCGC TCGTCCTGAG CGCCTTGCTG TTATCGTCAT TTCTGGCCCT TGCCTGCCTT TGCGCGGTGC TTATCGGCAA ACGCTGCATC GCGGACCGCT TACAGCTTCT GGAAGACGCC TCTAGGCGGG TCGCGACGGG GGACCTGCGC ATCCGCGTCT CCGAGGCGGT GACAGGAGGC GAACTCGGCA GTCTCGCCCA GACTCTGGAC GGCATGGCTG ATCAGTTACG TACCAGGACG GATGCCCTGG CTCACAGCAA GATGTTTATG AACACGATCA TCGAAACGGA ACCGGAATGC GTCAAACTCC TCGACAAAGA GGGGAGGGTG CAGATGATGA ACAGCGCCGG CCTGAAGATG ATAGAGGCCG ACTCGCTGTC CCAGGTCCAG GGGCAGTGCG TTTACCCGCT GATAGCTCCC GAACACCGGG ACGCGTTCAT TCAATTGACC CGGCGTGTCT TTGAGGGCGT TGCCGGCAAC CTGGTCTTCG AGGTGATCGG CCTCAAGGGG GGGCATGTCT GGCTCGACAG TCATGCCGTG CCTTTCCGTA ACGAACGGGG GGAGATTGTG TCGCTCCTTT CCATCACCCG GGACGTCACG GGGCTCAGGA AATCGGAGGA GGAACGCCGG GAGAATCTGC TGCTGTTCGA GTCTCTTATG CGGCACTCGC CGATGGGAAT CCGCATCTTT GACGGCGTCT CCGGCAAATG CATCCTGCTT AACCAAGCCA CCGCCGATAT TGCGGGTGGC GACATGAAAA CGATGCAGGA GCAGAGCTTC CGGGAATTAA AGTCCTGGCG GGAAAGCGAC CTGCTCGCCG CTGCGGAAAA GGTGCTAGCT GACGGCGTGG TTCGGGTAGT CGAGGCGGAT ATCCGCACGA GCTTCGGAAA ATCAGTTGTG ATGTCCTACA TCCTGTCCAG GCTCCTCATC AAGGACAAGC AGCATCTTCT CGTCGTCGGC CGGGACGTCA CTGACGAAAA ACGGCTGACT GAAGAGAAGA AAAAAATGGA AGCGCAGTTG CTGCATGTCC AGAAGCTTGA GAGTCTGGGG GTGCTTGCCG GCGGCATCGC CCACGACTTC AACAACATCT TGATGTCGGT CATGGGGAAC GCGGAGTTGG CGCTTTTGAC TCTTCCCCCC GAATCTCCCG CCCGAACCAA CCTGCGGAAC ATCGAGATCT CCTCGCAGCG CGCGGCTGAC TTGGCCAGGC AGATGCTGGC CTATTCAGGC AAAGGGAATT TCGTCATAGA GGAGATCGAT GTCAACAAGC TGATAAACGA GATGAACCAC ATGCTGGAGG TCTCCATCTC CAAAAAGGTG GATGTTCGAT TCAATCTCGA CAGTGGGCTG CCGCTGGTGT CGGTCGACGC GACCCAGATC CGGCAGGTCA TCATGAACCT GGTGATCAAC GCTTCCGAAG CGATAGGCGA CCGCAGCGGA GTGATATCGA TCTCCACCGG CGCCATGGAA TGCGATGCGG CGTTTCTCTC CAAGTTGTGG TTGAACGACG CGCTTAGGGA AGGAACCTAC CTCTACTTTG AGGTTGCCGA TGACGGGTGC GGCATGGATC CGGCCACCTT GGCCAAGATC TTTGATCCTT TCTTCACCAC CAAGTTCACG GGGCGGGGCC TCGGTATGGC CGCGGTCCTC GGCATCATAC GGGGGCACAA AGGGACTATC GAGGTTCACA GCGAGCCGGG CAAAGGCTCG AGATTCACCG TATTTTTGCC TGCTCTTCCT CCAGGCTCCG CACGCCCGGC GCAGGAGGCG GAGGCGGCTC AGCTGTCGCC TGGTTCCGGC ACCGTACTGC TGGTCGACGA CGAGGAGACC ATCCGCAACC TCGGCAACGA GATGCTCCGG ATCTTGGGAT ACCGCGTGCT CACCGCTGAA GACGGGGTGG TCGCTGTCGA GCTTTTCAAG GAGCACCGCG GCGACATCAC CTGCGTCATC CTTGACCAGA CCATGCCGAA CCTGGACGGG GAGCAGACCT TCCGCATCCT GCGCAGCATA GACCCGTCGA TCAAGGTGAT CATGTCCAGT GGTTTCAGTG AACAGGACAT CGCCGAGAGG TTTACCGGAA GAGGCCTGGC CGGTTTCATA CAGAAGCCGT ACAAGCTTGC CAGCTTAAGC CGGAAGCTTC AGGAACTGGG GTAA
|
Protein sequence | MLRLSSLSIR SLLLLITCVV ALPAAVIILY SGIEFRNTML GEARKETVKF AETIVNEQRN LVVAAEQLMT ALAQLPEVKS RDSAKVESIL KELLKLNPMY ANITIADCEG KVWGTAVPTT VPLNISDRYF FKSALATGKL SSGEYIVSRI TTKPAFSLGY PVRDDGGAII GVIGVAFNLE NYRDLLQQMR LPSGSSFTII DHRGTILSRG LTQGNFAGKA YTVDSFRKMV EGPDEGVSIR KGLAGDTRII AYRKLYLPGE KTPYLYVTAG IPVDAATHKA NRALVLSALL LSSFLALACL CAVLIGKRCI ADRLQLLEDA SRRVATGDLR IRVSEAVTGG ELGSLAQTLD GMADQLRTRT DALAHSKMFM NTIIETEPEC VKLLDKEGRV QMMNSAGLKM IEADSLSQVQ GQCVYPLIAP EHRDAFIQLT RRVFEGVAGN LVFEVIGLKG GHVWLDSHAV PFRNERGEIV SLLSITRDVT GLRKSEEERR ENLLLFESLM RHSPMGIRIF DGVSGKCILL NQATADIAGG DMKTMQEQSF RELKSWRESD LLAAAEKVLA DGVVRVVEAD IRTSFGKSVV MSYILSRLLI KDKQHLLVVG RDVTDEKRLT EEKKKMEAQL LHVQKLESLG VLAGGIAHDF NNILMSVMGN AELALLTLPP ESPARTNLRN IEISSQRAAD LARQMLAYSG KGNFVIEEID VNKLINEMNH MLEVSISKKV DVRFNLDSGL PLVSVDATQI RQVIMNLVIN ASEAIGDRSG VISISTGAME CDAAFLSKLW LNDALREGTY LYFEVADDGC GMDPATLAKI FDPFFTTKFT GRGLGMAAVL GIIRGHKGTI EVHSEPGKGS RFTVFLPALP PGSARPAQEA EAAQLSPGSG TVLLVDDEET IRNLGNEMLR ILGYRVLTAE DGVVAVELFK EHRGDITCVI LDQTMPNLDG EQTFRILRSI DPSIKVIMSS GFSEQDIAER FTGRGLAGFI QKPYKLASLS RKLQELG
|
| |