Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1801 |
Symbol | |
ID | 8137132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2094441 |
End bp | 2096009 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644869413 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_003021613 |
Protein GI | 253700424 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 0.0233266 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATA TCGAGAAGGG GTTTAAGACG CTGTTCCAGA ACATGGAACT CCCTGCCTAC CTGCAAAGCG CGGACGGCTC CATGGTCGAT GTCAATCATG CGGCCTTATC GATGTTCGGC ATGACCAGGG AGGAGTTCCT TGCCGGTCGC GGCCCCTCGC CTTGCCGGCG CCTGACCGGT GAAAGCGGCG AGGAGCTTTC GCCCGACCGG CACCCCGCCG CTGTCGCCCT TGAGAGCGCA CGGGAGGTCC GGGATTTCAT CACCTCGGTG CAGCAGGAGG GAGAACCTCT TCCGCTCTGG GTCAACCTGA ACGCGATACC GGTGCTGGAT GAGGGAGGGG GCGCCCCCCA CGCCGTCCTG GTCACGCTGA GGGACATCTC GAAGCTCAGG CTCCTGGAGC AGGCTGTCGC CGCGACGGCC GCCGAGCGGG AACAGGAACA AAAGCAGTTG CAGATGCACC ACGCCCAGAA GCTGGAAAGC CTCGGAGTCC TCGCCGGCGG CATCGCCCAC GATTTCAACA ACATACTCAC CTCCATCATG GGAAACACCG AGCTTGCGCT GATGCAGCTC ACCCCCGGCG CCCCTGCCTG CGAAAACCTG CGCCGGGTGG AGCGGGCCTC CCACCGCGCC GCGGCCCTGT TGAAGCAGAT GCTTTTCTAC CTGGGTAAGG GCACCTTCTC CTCCGAACCG ATAGATCTGA ACCGGTTGGT GGAGGAGATG GCGGATATGC TGCAGGCTGC CGTTTCCAAG AAGGCGACGC TGCGCCTGGA GCTCTCCCGG CCGCTGGGGC TTTTCAGCGC CGACCCGGTC CAAGTGCGCC AAGTGGTGAT GAACCTGGTC CTGAACGCTT CGGAAGCGCT CGGAAACGAG GTGGGCAAGA TCAAGATCTC CACCGCGCAA AGGCACTACC GGCAGGAGGA GCTTGCGGAG TTCCGAGGCA GCGAGGAGCT TGCCCCAGGC CCCTACCTGA CCCTGTCGGT GAGCGACACG GGTTACGGCA TGGACAAGGA GACGAGGGCG CGGTTCTTCG ACGGCCTGTT CCCCGCTACC GGACGAGGGT TGGGTATGGC GGCCATCCTC GGCGTGGTCC GGGGGCTCAG AGGGGGGGTG CGGCTGCAAA GCGACGTGGG GAAAGGCTCC GCCTTCACGC TGCTGATCCC TGTGGACGCG GATGTGTTGA CGGCCGCCAA GCCCGCCGAG CGGGCCTCCG AACCGATGGG GAAGGGTCCC GTGCTTTTGG TCGACGACGA AGAGGAGGTG TGCCTGTTGG TGGGCGCCAT GCTGGAGCGG CTCGGGTACG AGGTGATCGC CGCACGCGAC GGCCATCAGG CTCTCGAGCT TTACCTGCAG CGCGACGACT ACGCCTTCGT CATGCTCGAC CTCACCATGC CGGTCATGGA CGGCGAGGAG ACCTACGAGC AGCTGCGCAG TATCGACCCG TCGGTGAGGG TGATCATCAC CAGCGGCTAC AGCGAAAACG AAGTGGCGCG CCGCTTCGAA GGAAAAGGGG TGAAGGGATT GCTGCAAAAG CCTTTCGACA TGGACGCGCT GCGCAGGGTT CTCAGGTAG
|
Protein sequence | MSDIEKGFKT LFQNMELPAY LQSADGSMVD VNHAALSMFG MTREEFLAGR GPSPCRRLTG ESGEELSPDR HPAAVALESA REVRDFITSV QQEGEPLPLW VNLNAIPVLD EGGGAPHAVL VTLRDISKLR LLEQAVAATA AEREQEQKQL QMHHAQKLES LGVLAGGIAH DFNNILTSIM GNTELALMQL TPGAPACENL RRVERASHRA AALLKQMLFY LGKGTFSSEP IDLNRLVEEM ADMLQAAVSK KATLRLELSR PLGLFSADPV QVRQVVMNLV LNASEALGNE VGKIKISTAQ RHYRQEELAE FRGSEELAPG PYLTLSVSDT GYGMDKETRA RFFDGLFPAT GRGLGMAAIL GVVRGLRGGV RLQSDVGKGS AFTLLIPVDA DVLTAAKPAE RASEPMGKGP VLLVDDEEEV CLLVGAMLER LGYEVIAARD GHQALELYLQ RDDYAFVMLD LTMPVMDGEE TYEQLRSIDP SVRVIITSGY SENEVARRFE GKGVKGLLQK PFDMDALRRV LR
|
| |