Gene GM21_3734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3734 
Symbol 
ID8139108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4301448 
End bp4303445 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content64% 
IMG OID644871353 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_003023511 
Protein GI253702322 
COG category[T] Signal transduction mechanisms 
COG ID[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones124 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCACTTCG CACCGTTCCT ACTCATAGCC GTTTGCGAAC TGACGCTGCT CTTCATCGGG 
AGGAGCACCC CGCTGCCGGG GTGGCTGCAC CTCAGTGTCT CGGTCACGGT TGCCGCCGTG
CTCCTCTTTG TCGCCGCCCG GGCCTGGAAA CGTCAGCTCC GCGTGCAGGA AGGGCTGCAG
CGCGAGTTGG AGGGGATGCG GCAGGAAGCG GTGAAAAGCG CCCGCCGCTA CAAGAGCCTT
TTGGAGGGGG CGGGTAACGC TATCTTCATC TTCAGCGCCG ACACCGGTCT TTTGGAGGAA
GAAAACCGCG TGGGACGCGA GCTTTTGGGC TTCAACAAGG AGGAGCTCGA CTCCATCCAG
GCGCGCGACC TGTTGCATCC CGACGAGCAT GAGCGGTTCC GCAGCTTCCT CTATCAGGTG
AAGCGCAACG GGCGCGGGGA GCTGGACTCG GTGCGGATCA GGAAGAAGAA CGGCACCTTC
TTCCTGGCCG AGATCAACGC CCGGCTCATC GACCTGGGGG ACGAGCAGGT GGCCCACTGC
CTTATGCGCG ACATCACCAA GAAAAGGCGC ACCGAAAAAG AGATCTGGCA GCGAAACCGC
GAACTTTCCA TCCTCAACGA CATCCTCACC GGCATGAACC ACTCCGCGGA CCTGGAACAG
GAGCAGCATA GAACCCTGCT GGAGATCATG GAGCTCTTCG AGGCGGGGGG GGGCACGATG
CACCTGTACC GCGGCGACCA GAAAAGCGCG CGGCTTTGCG CCAGTTGCTG CGTCTCTGCT
GAGCTGGAGC AGCGCATCTC TCAAAGCCTC GCCCATGACC CGGAGCGCTT CCTGAAGGTG
AAGAGGCTGA AGGCGCCGCA GGTGAAGGAG CTGGGCGCCG CCGCCGAGGG GTGGCAATGC
TTCACCGCGG TCCCCATCAG CGCGCAGGAG CACCTGGTCG GCGTGATGCA CCTCATGCAT
CGCGAGCCCT GCCGCCACAG CGCAGAAGAG CTCCGCTTCC TGGAAAGCGT CGGCAAGCAG
ATGGGCAACA TCATCGAGAA AGGAAGGCTC TTCGCCGAGC TGAACTGGAA AAGCGGCGAA
CTGCTCCGCT CGCACCGCCT TTTGGAGAAA AGCAGCCACA ACCTCTCCGT TTCCGAGATC
CGGCTCAAGC AGAACCTGGC CCTGGTGGAG CAGGCCAACC TGGAGCAGAA CCGGCTGGAC
CGGATGAAGA ACCAGTTCCT GGGGATGGTC TCCCACGAGT TCAACACCCC GCTCACCAGC
ATCATCTCCG GCGTCGAGCA CCTTTTGCAA AACGGCTGGC ACAGCGAACA AGAAGCGCGC
TCCGTGCTGG AACTGGTGCG GGACGGCGGG CTCAGGCTGA AGGGGCTGGT CGCGGACCTT
TTGAAGCTGA TCAAGGTGGA GGCCAGGCGC GGCGCGCTGG AGACCTCGGC GGTGCACCTG
CGGCATTTCC TGAGCGAACT CTCGGCGCAG TTGCAGCCGC AGTTGGAAGA GCGGGGCCAG
CGCGTCACCC TGGCGGGGCT GGACGAGCTC CCCTACTTCG ACGCCGACCG CCACTACCTG
GAACGGGTCT TCGGCGAGCT TTTGCAAAAC GCCATCAGGT TCAGCCCGCA CAGGGGCGAG
ATTGTGGTTA CCGGGAAGGT GGTGGACCGT CCCGCGCTCC TGGAACGCAG GGAGACCCTG
GAGCGCTTCA ACCCCGAGTT CCTCAGGCGC TGCGGCGATC GCTGCTACTT GGAAGTGGAG
GTGCGGGACA GCGGCATCGG CATCCCGCCC GAGGAGCAGC AGGGGATCTT CGGGATCTTC
TACGAGGTCG GCGAGATAAA GCACCACTCC AGCGGCATGA GCCGCGAGCA GGGAAGAGGG
GCGGGGTTAG GTCTCGCCAT GGTGAAGGGG ATGGTGGAGG CCCACGGCGG GATGGTGTGG
GTGGAGAGTG CGGGCGGAAG CTCCTTCTTC CTCGTCCTCC CCCTGGAGCA GGAAGCGATC
CAGCCGGCCC TGTTCTGA
 
Protein sequence
MHFAPFLLIA VCELTLLFIG RSTPLPGWLH LSVSVTVAAV LLFVAARAWK RQLRVQEGLQ 
RELEGMRQEA VKSARRYKSL LEGAGNAIFI FSADTGLLEE ENRVGRELLG FNKEELDSIQ
ARDLLHPDEH ERFRSFLYQV KRNGRGELDS VRIRKKNGTF FLAEINARLI DLGDEQVAHC
LMRDITKKRR TEKEIWQRNR ELSILNDILT GMNHSADLEQ EQHRTLLEIM ELFEAGGGTM
HLYRGDQKSA RLCASCCVSA ELEQRISQSL AHDPERFLKV KRLKAPQVKE LGAAAEGWQC
FTAVPISAQE HLVGVMHLMH REPCRHSAEE LRFLESVGKQ MGNIIEKGRL FAELNWKSGE
LLRSHRLLEK SSHNLSVSEI RLKQNLALVE QANLEQNRLD RMKNQFLGMV SHEFNTPLTS
IISGVEHLLQ NGWHSEQEAR SVLELVRDGG LRLKGLVADL LKLIKVEARR GALETSAVHL
RHFLSELSAQ LQPQLEERGQ RVTLAGLDEL PYFDADRHYL ERVFGELLQN AIRFSPHRGE
IVVTGKVVDR PALLERRETL ERFNPEFLRR CGDRCYLEVE VRDSGIGIPP EEQQGIFGIF
YEVGEIKHHS SGMSREQGRG AGLGLAMVKG MVEAHGGMVW VESAGGSSFF LVLPLEQEAI
QPALF