Gene GM21_3123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3123 
Symbol 
ID8138473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3625081 
End bp3626151 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content66% 
IMG OID644870727 
Producthistidine kinase 
Protein accessionYP_003022909 
Protein GI253701720 
COG category[T] Signal transduction mechanisms 
COG ID[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones148 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGCGA TACGGGGTTA CCTGATCAGC CTGCTGCTCG TCGCCGGCGC CACGGCCATC 
TGCGAACTGG TCCGGCCGCA CCTGATACCG ACCAACATGG TGATGGTCTA CCTGCTGGCT
GTCGTTGCCG CCGCCGCCAA GCTCGGCCGG CGTCCCGCCA TAGCGACGGC CTTTTTCAGC
GTCCTCGCCT TCGACTTCTT CTTCGTCCCG CCCCGGCTCA CCTTCAGCGT CGCGGAGAAG
GAGTACCTGA TCACCTTCCT CGGCTTCTTC GTGGTAGGGG TGATGATAAG CTCGCTGGTG
GCTAAGGTCC GGGAGCAGTC ACTGGAACGG GAGCGGCTCT CGCACGAGGC GGAAAAGGCG
AGGATGCTCG AGGTCCGCGA GAACCTGGAG CGGGCGCTTT TGAACTCCAT CTCGCACGAC
CTAAGGACGC CGCTGGTTTC CATCAAGGGG GCGCTCTCGG CCTTGAAGGA ACAGGGGGGG
CGCCTCTCCC CCGAGGCGCG CCGGGATCTG CTGGATACGG CAAACGACGA GGCGGAGCGG
CTGAACCGTT TCGTCGGGAA CCTCCTCGAC CTGAGCCGCC TCGAGGCGGG GGCGCTCCGC
CCGAGGATCG AGCCTTGCGA CCTGCAGGAA CTCATCGGCT GCGCCATCTC GGCCATGGAG
AGCCGGCTTG GGGACCGCAA CGTGTCGGTG CAGTTGCCGC AGGGGCTCAC CCTGGTTCCC
TTGGACCTGG TGCTGATGAT CCAGGTCCTG GTGAACCTTC TGGACAACGC CAACAAGTAC
GCCCCCCCGG GAGGGAGCAT CGAGGTGGCG GCGCGCGTCA ACGGCGCCTG GCTCACCCTC
AGCGTCGCCG ACCGGGGACC CGGAGCGCCG GAAGCGGAGC TCTCCCACAT TTTCGATAAA
TTCCACCGGG TGCAGGTTCC CGAGAAAACC GGGGGGACCG GCCTCGGGCT TTCCATCTGC
AAGGGGATCG TGGAGGCTCA CGGGGGGAGG ATCGTGGCCA GGAACCGCCC GGAGGGGGGG
CTCGCGGTCG AGATCCTTTT ACCGTTGCAG CAAGGAGCAG AGGAACCATG A
 
Protein sequence
MVAIRGYLIS LLLVAGATAI CELVRPHLIP TNMVMVYLLA VVAAAAKLGR RPAIATAFFS 
VLAFDFFFVP PRLTFSVAEK EYLITFLGFF VVGVMISSLV AKVREQSLER ERLSHEAEKA
RMLEVRENLE RALLNSISHD LRTPLVSIKG ALSALKEQGG RLSPEARRDL LDTANDEAER
LNRFVGNLLD LSRLEAGALR PRIEPCDLQE LIGCAISAME SRLGDRNVSV QLPQGLTLVP
LDLVLMIQVL VNLLDNANKY APPGGSIEVA ARVNGAWLTL SVADRGPGAP EAELSHIFDK
FHRVQVPEKT GGTGLGLSIC KGIVEAHGGR IVARNRPEGG LAVEILLPLQ QGAEEP