Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0753 |
Symbol | |
ID | 8136068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 899495 |
End bp | 901309 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644868370 |
Product | response regulator receiver protein |
Protein accession | YP_003020585 |
Protein GI | 253699396 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.00313494 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTCTTG TCGGCAATTT GGAAGATCTC GGACTAGGTG AGATCCTGCA GATCGTCAGC CTCAGCAGGA GATCCGGCGT GCTCTCCCTC GAAAGCCGGG GGAGGGAGGC GCGGGTGATC TTCCGCAACG GTCAGGTGAT CCGCGCCACC TCGACCACCT TCCAGCAAAA CCTGGGGGAA GTGCTGATCC AGCAGGGGGT CATAGACCTG GCCATCCTGA AGCGCGCCCT GAGCATCCAG GCGGAGGAGG GGTACTCCCA GCTTCTGGGC GGGATCATGA TCGACCGCTT CGGGGTGAGT GCGGACGCCA TCGAGGTCGT GGTGCGGGAG CAGATCGAGA ACGTGGTCTA TTCCCTGTTC GCCTGGGCCG AGGGGACCTT CGAATTCGAG CTGCAGGAGG TGAGCGAGGC GGACAACGCC AAGCTCGACC CCGTACAGTT CATGCTGAAG CAGGGGTTGA ATCCGCAGTA CCTGGCCATG GAAGGCTCCC GCATCATCGA CGAGCAGCGT CACCGCGGCG AGTCGGGGGA AGAGTGGGAA GCGCCGGCCG CACCCGAGGA AACCCTCGAC CTCGCCTTCG ACCTGTTGCA GGACGCGCCG TCCCAGGCAA CAGCGTCCGC AGCGCCGATT CCCACGCCCG ATTTTCCCCC CCAGCAACCA GACGAGCGGC AAGAAGCGCA GCTTCTCTCC GCGGACCCTT TCGACAGCGC CTCGGCGCCC GCCGAGACCG CAGCCCCCAT CGAGGCAGCC GAGGCCGAAG AGCCTGCCGC GGAGAAGGCC CCGAACCGCC TGGTCCTCGT GGACGACGAT CCCGAGACGC TGGCGGCGCT GGCCCTGGTT TTGGAGCGAG AGGGTTATCT CGTGGACGCC ATGGAGAAGG GGGAAGACGC CCTGATCCGA GTCGACTCCC TCTACCGCGA AGGGTTGCAG CCGACCGTGG TGGTCGACCT GATCATGCCG AGGATGGACG GCACCGGCAT CCTGGGTGGG CTCGAGCTGA TGGAGCTTGT GCGCAACAAC TTCCCGGAGG TGCCGCTTCT GGGGCTTTCC GACTTCCACA ACGACGAGGC GCAGAAAAAG CTGCGCAGCA TGGGGATTCC CATCCTGATC AAGCCGCTCA AGGGGGAGCT TGAGGATGCG CTGGAGGTCT TCGCCCCAAG GCTCCTAAAG GCGCTCGGCA ACCTCACCGC GGTGGAGGAA CTCAGCTTCA GCAGCGTCAA CATAGGCGAC GAGTTGAGGC TGGAGCTGGG AGAAGAACCG GCGCTCGCGG CGCCGCATGG AAACCAGAGC ACCGGTATAT CGCAACTGCG CGGCATGCTT GAGGAGTTGA ACAACCCCCA GCTTGGCGGC GGGATCATCC TTTTGGTGCT GCGTTTCGCC GCCGAGTTCG TGAACCGCGC CGTGGTGCTG CTGGTCAAGA AGGAGTCCGT GCAGGGACTC GGGCAATTCG GGCTTCAGGA CAAGGACGGA AGTGCCGATT ACAGGATCAG GAACCTGAGC ATACCCAAGG GAGAACCCTC GCTCTTTACC GAGGTGCTGG AGACCCGGTT CCCCGTGAAG GGGGAGGTGG AGCCCAGCCA CTGGAGCGAC TACTTCCTTG AGCAGCTAGG TGGCGGGCGT CCCGCCGAGG TGTTCGTTGG GCCGATCGTA AGCGAGGGGA AGGTGGTAGC CGTGCTCTAC GGCGACAACC TCCCGGAGGC GAAGCCTGTA GGCGACACTG ATTCGCTGGA GATATTCCTG AGCCAGGCCG GCATCGCGAT GGAGAAAGCC CTGCTGCAGC GCAGGCTGCA AGACCAGGGA CGGGGAGAAA TTTGA
|
Protein sequence | MSLVGNLEDL GLGEILQIVS LSRRSGVLSL ESRGREARVI FRNGQVIRAT STTFQQNLGE VLIQQGVIDL AILKRALSIQ AEEGYSQLLG GIMIDRFGVS ADAIEVVVRE QIENVVYSLF AWAEGTFEFE LQEVSEADNA KLDPVQFMLK QGLNPQYLAM EGSRIIDEQR HRGESGEEWE APAAPEETLD LAFDLLQDAP SQATASAAPI PTPDFPPQQP DERQEAQLLS ADPFDSASAP AETAAPIEAA EAEEPAAEKA PNRLVLVDDD PETLAALALV LEREGYLVDA MEKGEDALIR VDSLYREGLQ PTVVVDLIMP RMDGTGILGG LELMELVRNN FPEVPLLGLS DFHNDEAQKK LRSMGIPILI KPLKGELEDA LEVFAPRLLK ALGNLTAVEE LSFSSVNIGD ELRLELGEEP ALAAPHGNQS TGISQLRGML EELNNPQLGG GIILLVLRFA AEFVNRAVVL LVKKESVQGL GQFGLQDKDG SADYRIRNLS IPKGEPSLFT EVLETRFPVK GEVEPSHWSD YFLEQLGGGR PAEVFVGPIV SEGKVVAVLY GDNLPEAKPV GDTDSLEIFL SQAGIAMEKA LLQRRLQDQG RGEI
|
| |