Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2097 |
Symbol | |
ID | 8137433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2436518 |
End bp | 2440033 |
Gene Length | 3516 bp |
Protein Length | 1171 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644869712 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_003021907 |
Protein GI | 253700718 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.000000135263 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCTACG ACCAATCCGC ATCCAGGCTG TCGGCTGAAG GTATGGAGGA GGCGCGGGCG CACCTGGCGG CCATCGTCGA TTCCTCCGAC GACGCCATCG TCAGCAAGTC CCTGGAAGGA ATCGTCGCCA GCTGGAACCA TGGGGCCGAG AAGCTCTACG GCTACAGCGC CGCCGAGGCC ATCGGTCGCC ATATTTCCTT CCTGGTGCCT CCGGAGCGCC TCGACGAGCT GAGGCAGATA ACCGAGAAGA TCCTGCAAGG GGATCGGATC AAGAACCTGG AAACCGTCCG GCTTCACAAG GACGGCCGTG AAATCCCAGT TTCCATAACC CTCTCCCCCA TCTTGTGCGA GACAGACACC ATCATAGGGA TTTCCATGAT CGCCCGCGAC AATACCGAGC GCCTGAGGAT GGTGCGGCTT TTGCGGGAGA GCGAGGTGCG TTACCGGGTT CTGGTGGAGA TGGCCCCGGA CGCGGTCTTC GTGCACCAGG ACGGGCGTTT TGTCTACGCC AACTGCGCGG CGCTGGGCAT CTGCGGGGTG CAGAGCAGGG AAGAGCTGCA ACGGCACACC CTGTTCGATC TGGTGCATCC CGAGGACAGG GAGACGCTGC GGGATCGGAT CCACGAGCTG ATGGCGAACC AAGGCATAGC GCACCAGGAA TACCGGTTGG CCTGCCTGCA CGGCGAGGAG CTGGTCCTGG AGACCTCCTC CAGCCTCATC GAGTATCAGG GGGTCCCCTC CATACAGGTC ATCGCCCGCG ACGTGACGCT CAGGAAGCAG CAGGAACGCG AGCGAGAGCA GATGCTAAAG GAACTCGCCT TCCAGCAGAA CCGCTTCGAG ACCACGGTGA GGCAGTTGCC GGTCGGGGTG GTCATCGTCG AGGCCCCCTC GGGAAAGACT CTTTATCAAA ACGAGCGCGC ACGGCAGATA TTCGGTCATG ACATCGGCGC CGTTTCCGGC ATAGAACAGT ACCGACAGTG GGAGATGTTC CGCCTGGACG GCACCCCTCT TCCCGCGGAT GAGTACCCCG TCGCCCGTTC CCTGCTGCAA GGGGAAGCCT GTTGCGGCGA CGAGTACAAG ATCCGGCGGG CCGACGGCTC CTACGGCTAC ATCTTCGTCA ATGCGACCCC GTTACGTGAT GGGTCAGGGG CGATCGTTTC CGCCGTGGCG GCCTTTAGCG ACATCACGGA GAGAAAGCTT GCCGCCCAGG CACTCTTCGA GAGCGAGGAA CGCCTGAAGC TCGCGCTTGA GGCAGCCGGG ATGGGATCCT GCGACATGGA GGTGAAGACA GGCGCCGGAA TCTGTTCCCG GCGCTACTTC GCGCTTTTGG GCTACCCCGA GCCCGAGGGC GAATCCGCAC CGGCCACAAC CTCCATGTGG CTCGACCTGG TACACCGGGA CGACCTGGAA GAGGTGCAGC GTCGGCTGGA GCAGGCCCGC AGGGACAACA CCCTGTTTGG CTCGGAGCAC CGCATTGTCA AGGCCGGCAG CGAAGAGACG GTCTGGGTCA ACGTGATGGG GCGCTTCATC TGCGAGCAGA CGGAAGAGAG GTGCCGCTTC ATCGGCGTCA TCTTCGACAT CAGCGAACGC AAGGCTGCGG AGGAGGCGCT TTTGCGCAAC ATCCGGCGCT TTCGCCGCAT GGCGGACTCG ATGCCCCAGA TATTCTGGAC CGCGAACGCC GATGGGAGCG TGGACTACAT CAACGCCTAT TTTCAGGAGT ACGGCGGCAT CGACAGAACC GGTGTGGAGG AGGGGCGTGT CACCGCCGAG AGTCTACTTT CCGAGGTGGT CCATCCCGAC GACCATGACG GGCTGTGGAT CGCCTGGTGC CGGTCCATGG AGAGCGGCGA GCCGTTCCAG TACGAAAGCC GGGTGCGCCG CAAGGACGGG GTGTACCGCT GGTATCTAAG CCGTGCCCGT GCCGAGCTCG ACGAGCAGGG GAGGGTGGTC AAGTGGTACG GGACCGCCAC CGACATCGAC GAGCTGAGGG AGACGCAGGA AAAACTCGCG GCCAGCGAGA CCAGATTCCG CTGGCTTTAC GAATCGAACC TGATCGCCAT CTTCTACTGG AACCGCGACG GCGCCATCAC CGACGCGAAT CAGGCCTACT GCGACCTCGC CGGATATACC GCCGAGGAGT GCCGGTCCGG TGGGCTCAAT TGGCTGGACG TGACGGCCCC GGACTACCTG GACAGAGACG TAGCCGCCGT GGCCGAGAGC CAGATCCAGG GCATCTGTAA GTCCTACGAG AAGCTCTTCA TAAACCACAC TACCGGCGAG AAGGTGCCGG TACTGACCGC CATCGCCATT TCGGCCGGCT CAGACGAGGG GATCGGCTTC GCCGTGGACC TGACCGAGCT GAAGCGGGCC GAACAGGCGC AAAAGCACAG CGAATCGACG CTGAAACTCG CCGTCGAGAC CACCGGGCTC GGCATCTTCG ACCTCGATCT GAAAACCGGC AAGGGGGAAT GGTCCCCCAT CGCCAAGAAG CACTACGGCC TTCCTGCCGA CACGGAAGTA GGCCTCGCCA CGGTGATCGA GGGAGTCCAT CCGGAAGACC GCGAGAAGAT AGAGCGGATC GCGAGGGACG CCGCGAGCCC GGGAGGCGCG GGCATTTACA GCGCCGAATA CAGGACCGTG GGTGCTGTGG ACGGCAAGGT GCGCTGGATA AGCATGCGCG GCCGGGTCTT CTACGATGAG GAGGGGACCC CGCTGCGCCT GGTCGGCGCC TGCCTCAACG TAACCGACGT GGTGCAGGCC CAGGAGACGC TCAAAGAAGA GATGAGCGAA AGGCTGCGTG CGGTCGAGGA ACTGCGCCGC CAGGAGCAGC TGCTGATCAG GCAGGGAAGG CTTGCCGCCA TGGGAGAGAT GATCGCCAAC ATCGCCCACC AGTGGCGCCA ACCCCTGAAC ACGCTTGGGC TAATCATCCA GGAGCTGCCC ACCTACCAGG AGCGGAACCT TCTCACACGC GATTACCTGG AAGGAAGCGT CTCCCGCGCC ATGCAGGTGA TCAACTACAT GTCGCAGACC ATCGACGGGT TCCGCAACTT CTTCGGCCCG GACAAGGAGC ACCAGACGTT CCTGGCGAGC GAGGTGCTGG AAAAAACGGT CTCCATCCTC GACGCTGCCT TCGCCGAGCT GAACCTGGAG CTCGTGGTCC GGGTCGACCG CGAGGCGGTG GTGCAGGGGA TCCCCAACGA GTACTCGCAG GTGCTCCTCA ACATCCTGAT GAACGCGAAA GACGCGCTCC TGGAGCGCAA AGTCGAGCAT CCCAAGGTCG AGGTCAGGCT ATTCAAAGAG GGGGAGAAAG CGGTGGTCAC CATCACGGAC AACGCAGGGG GGATACCTCC GGAGATCATG GACAGGATTT TCGACCCGTA CTTCACCACC AAGGGACCGG ACAAAGGGAC CGGCATCGGC CTGTTCATGT CCAAGACCAT CATCGAGAAG AACATGAAGG GCTCGCTCAC GGTGATTAAC CAGCCGGAAG GGGCGCAATT CCGCATCGAG GTCTGA
|
Protein sequence | MAYDQSASRL SAEGMEEARA HLAAIVDSSD DAIVSKSLEG IVASWNHGAE KLYGYSAAEA IGRHISFLVP PERLDELRQI TEKILQGDRI KNLETVRLHK DGREIPVSIT LSPILCETDT IIGISMIARD NTERLRMVRL LRESEVRYRV LVEMAPDAVF VHQDGRFVYA NCAALGICGV QSREELQRHT LFDLVHPEDR ETLRDRIHEL MANQGIAHQE YRLACLHGEE LVLETSSSLI EYQGVPSIQV IARDVTLRKQ QEREREQMLK ELAFQQNRFE TTVRQLPVGV VIVEAPSGKT LYQNERARQI FGHDIGAVSG IEQYRQWEMF RLDGTPLPAD EYPVARSLLQ GEACCGDEYK IRRADGSYGY IFVNATPLRD GSGAIVSAVA AFSDITERKL AAQALFESEE RLKLALEAAG MGSCDMEVKT GAGICSRRYF ALLGYPEPEG ESAPATTSMW LDLVHRDDLE EVQRRLEQAR RDNTLFGSEH RIVKAGSEET VWVNVMGRFI CEQTEERCRF IGVIFDISER KAAEEALLRN IRRFRRMADS MPQIFWTANA DGSVDYINAY FQEYGGIDRT GVEEGRVTAE SLLSEVVHPD DHDGLWIAWC RSMESGEPFQ YESRVRRKDG VYRWYLSRAR AELDEQGRVV KWYGTATDID ELRETQEKLA ASETRFRWLY ESNLIAIFYW NRDGAITDAN QAYCDLAGYT AEECRSGGLN WLDVTAPDYL DRDVAAVAES QIQGICKSYE KLFINHTTGE KVPVLTAIAI SAGSDEGIGF AVDLTELKRA EQAQKHSEST LKLAVETTGL GIFDLDLKTG KGEWSPIAKK HYGLPADTEV GLATVIEGVH PEDREKIERI ARDAASPGGA GIYSAEYRTV GAVDGKVRWI SMRGRVFYDE EGTPLRLVGA CLNVTDVVQA QETLKEEMSE RLRAVEELRR QEQLLIRQGR LAAMGEMIAN IAHQWRQPLN TLGLIIQELP TYQERNLLTR DYLEGSVSRA MQVINYMSQT IDGFRNFFGP DKEHQTFLAS EVLEKTVSIL DAAFAELNLE LVVRVDREAV VQGIPNEYSQ VLLNILMNAK DALLERKVEH PKVEVRLFKE GEKAVVTITD NAGGIPPEIM DRIFDPYFTT KGPDKGTGIG LFMSKTIIEK NMKGSLTVIN QPEGAQFRIE V
|
| |