Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gmet_1083 |
Symbol | |
ID | 3741290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter metallireducens GS-15 |
Kingdom | Bacteria |
Replicon accession | NC_007517 |
Strand | - |
Start bp | 1205546 |
End bp | 1208506 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637778361 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_384048 |
Protein GI | 78222301 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0721424 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.00000190438 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCGAAAAA TCACAGTAAA CGTTATCGCG CTGAATATTC TCTTTGTCAT CGGCGCTACC GTGATCACCA CGTTCCTCTT CCTGGCGGCA ATGCGCGATG AATCGGAACG GCTGGCAAAG GTGGAGCAGG AACAGGGGAT CCAGACCTTC TGGAGGCTCC TCAGGGCCAA GGGGACCGAT TTCAGGATTG TCGACGGCAA GCTCATGGCG GGGGACTATG TCCTGAACGG CAATTTCGAA CTCCCTGACA TGATCCAGTC GATCTTTGGC GGCACCGCGA CGGTCTTCAT GGGTGATACG CGGGTTGCCA CCAACATCCG CAGGGAAGAC GGGACCCGTG CCGTCGGTAC CAAGTTGAAC GGACCCGCCT ACGACGCCAT CTTCAGGCAC AACAAACCCT TTCGCGGTGA GGCATTGATT TTCGGCATCC CCTATTTCAC CGCCTACGAC CCCATCAGAA ACAGCCGGGG CGAAATCATC GGCGTACTCT ACGTGGGTGC CAGGAAGAAC GAATTCCATT CGCGGTACGA GCACTATAAG GTCAATGTCA TCGCCGGCGC AGCGGTCATA TCCACTATCT TTACGGTTCT GGCCTTCGTG GTTCTCAGGG ACAGAAAACG CCACCTTGAG GCACTTCAGG ACAACGAAAC CAAGTACCGG ACCCTCTTCG AGAGTTCCTC AGAGGGAATT CTCCTGTTCG ATGGCGTTAT TTTTGACTGC AACGAACAGC TGTGTCGACT GTTTGGCAGC AGCCGCAATG AAATCATCGG GAGATCTCCT GTTCACTTCT CTCCGGAACT GCAGTCCGAC GGCACCCCCT CCGCCGAAAA AGCCCGACAC ATGCTTAACA CTACCCTGGG TGGGGAAGCC GGGATCTTCC CCTGGCAGCA CCGGAGACAG GACGGCACCT TGGTCGACAC CGAGGTTTCC CTGAAGGCGT TGACCATCCA GGGACGCACG GTTCTTCAGG CGGTGGTGCG AGACGTTACC GAATGGGAAA AGGCCGAGGA ATCGCTACGG ATGATCCGCC TACAGCAGCA GGCCATACTC GACAACATTC CCGACCTGGT CTGGCTCAAG GACATCGAAA GCAGATTCAT CACCGTCAAC GCGGCGTTTG CCCAGGCATG CGGCACCCTC CCCGCCGACC TGGTGGGGAA GACTGACCTG GACATCTGGC CCAGCGACCT GGCCACCCTC TATCGCGAGG ACGACGCCCG GGTCATAGCG TCGGGCAAGC AGGTCCGCAC CGAGGAGCCC CTCGCCGACG TCAACGGAAA GGGAGTCTGG ATCGAAACCA TCAAGATGCC CATTTATGAC GAGGCGGGAA CGGTCATCGG CACCACGGGC ATCGCCCGGG ACATTTCGGA GCGGCGGGAG GCGGAACTGA AGCTACGGGA GAACGAGGCC CGGCTCGCAA GGGCCCAGCA GATTGCCCAT GTGGGGAACT GGGAATGGGA TATTCTCAAC AACTCTGTTC AATTTTCCGA TGAACTTCTC CGGATATTCC GCATCCCCCC CGGCCAGCCC AACATGACCT ACGAAACATT TCTGGAAGCG GTCCATCCCG ACGACCGGCA AGCCGTAAAC GATACCATCA ATGCAACTCT GCACGAACAG GCACCCTACG GTAAAACGTA TCGCATTATC TGCCCTGACG GGGAGATCAG GCATCTCCGG GCAGAGGGAG AGGTCGAGTT CGATGCGGAA GGGGCTCCGG TGCGGATGCA GGGGGTCGTC AAGGACATTA CGGTAAGTAC CCTTGCCGAG GAGGCCCTGC GTGAGAGTGA AGCCCGCTTC CGGGAGATCT TCGAACAGAA TGAGGACGCC ATCATCCTGA TGGCTCGGGA AACCCTGGAC ATCATCGATG CCAACAGGGC CGCGGAGACA CTGATCGGCC GCGACAAGGA GTCACTCAAC TGGCTCGGCC CCTGGTCGTT CATCGTGCCC GACGACTACA ACCCGTTCAT CGCGGCCATC CCCCCCGTCG ATGATACGAC CCCCTTCCAC CTTAACCGCA TCGGCGTTGT CAGGTCCGAC GGAACCCGTC TCATCTCTTC GATCTGGGGC AAAATCATCC GGCTGCGGGA CACGGAGGTG GTCTACTGCT CCATCCGCGA CCTCACCCAG AGGATCCGCA TGGAGGAAGA GGCACAGATC ACCCATGCCC ACCTTATCCA CACCAACAAG ATGGCCTCCC TCGGCGTCCT CGTCTCCGGC ATAGCCCACG AGATAAACAA CCCCAACACC TTCATCCAGG GGAACGCATC GCTCATCGAA AGTTTCTGGC GCGACACCGT CCCGATCCTC GACCGCCATC GCACCGAAAA TGGTGATTTC ATCCTCGGCG GCCTCCCCGT CGCGGAGGTA GAGCGAATCT TCCCCCGCCT TCTCCACGGG GTAAAGGAGG GTTCGCGCCG CATCAGCGCC ATCGTCAACA ACCTCAAGGA CTTCGCCCGG GAGGATACCG CGAAGGCTTT CGTGCCGATC GCCGTCAACA ACATCGTTGA AAATGCAAAG ATGATTCTCA GCTACCAGAT TCACCGCTAC ACCGACCATT TCCGCATGGA ACTGGCCGAA GGCCTCCCGC TGGCCCGGGG GAAATTTCAG CAGATAGAAC AGGTGGTCAT CAACCTGATC ATGAACGCCC TGCAGGCGCT TCCCGGCAAG GACGCAGGGG TCACCGTCTC CACATCCGCC GACCCCGTTG CCTCGGTGGT CACCATCAGC GTACGCGATG AGGGAGAAGG GATGCAGTGG GAGGTTCTGG AGCGGATCAC CGAGCCCTTC TTCTCCACCA AGCTTGAACA GGGAGGGACC GGTCTCGGAC TCTCCATCTC CGCAGCCATC ATCAGAGAAC ACGACGGCAC GCTGACATTC GAATCGACTC CCGGCAAGGG AACCACCGCC ACGGTAACCC TGCCGCTCGC CTATCCCGCC GGAGAACGAA ACCATGCCTG A
|
Protein sequence | MRKITVNVIA LNILFVIGAT VITTFLFLAA MRDESERLAK VEQEQGIQTF WRLLRAKGTD FRIVDGKLMA GDYVLNGNFE LPDMIQSIFG GTATVFMGDT RVATNIRRED GTRAVGTKLN GPAYDAIFRH NKPFRGEALI FGIPYFTAYD PIRNSRGEII GVLYVGARKN EFHSRYEHYK VNVIAGAAVI STIFTVLAFV VLRDRKRHLE ALQDNETKYR TLFESSSEGI LLFDGVIFDC NEQLCRLFGS SRNEIIGRSP VHFSPELQSD GTPSAEKARH MLNTTLGGEA GIFPWQHRRQ DGTLVDTEVS LKALTIQGRT VLQAVVRDVT EWEKAEESLR MIRLQQQAIL DNIPDLVWLK DIESRFITVN AAFAQACGTL PADLVGKTDL DIWPSDLATL YREDDARVIA SGKQVRTEEP LADVNGKGVW IETIKMPIYD EAGTVIGTTG IARDISERRE AELKLRENEA RLARAQQIAH VGNWEWDILN NSVQFSDELL RIFRIPPGQP NMTYETFLEA VHPDDRQAVN DTINATLHEQ APYGKTYRII CPDGEIRHLR AEGEVEFDAE GAPVRMQGVV KDITVSTLAE EALRESEARF REIFEQNEDA IILMARETLD IIDANRAAET LIGRDKESLN WLGPWSFIVP DDYNPFIAAI PPVDDTTPFH LNRIGVVRSD GTRLISSIWG KIIRLRDTEV VYCSIRDLTQ RIRMEEEAQI THAHLIHTNK MASLGVLVSG IAHEINNPNT FIQGNASLIE SFWRDTVPIL DRHRTENGDF ILGGLPVAEV ERIFPRLLHG VKEGSRRISA IVNNLKDFAR EDTAKAFVPI AVNNIVENAK MILSYQIHRY TDHFRMELAE GLPLARGKFQ QIEQVVINLI MNALQALPGK DAGVTVSTSA DPVASVVTIS VRDEGEGMQW EVLERITEPF FSTKLEQGGT GLGLSISAAI IREHDGTLTF ESTPGKGTTA TVTLPLAYPA GERNHA
|
| |