Gene GM21_2061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2061 
Symbol 
ID8137397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2390313 
End bp2391551 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content62% 
IMG OID644869676 
Producthistidine kinase 
Protein accessionYP_003021871 
Protein GI253700682 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00000000000174417 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCAGAA AAGCGACGCT GATAATCGGA ATATTCGCAA CCTGTTCGCT GCTTTGGTTC 
ACCGTCTACA ACTACCGCCA GGCCGGCCCC ATCGCCGAAG AGAACCTGCG CGGACTCGCC
CTTACGCTTT CTTCCGCCAT CGAGAACCTC GCCGTCAACG ACCCGAGCCT TGCCGCCCTT
GCCAAGTTCC GCACCAACGA TATCGCCTAC TTCGCCTTGG TAGACCGCCA GGGTCTTTTC
CGCTTTCACT CAAACCCGGA CCTGATCGGT GCCTCCCTGC CAGGAGCCGG GCGCAAAGCT
GCGCCAAGCC TTGAGCGATC CGAATCGCGT GTCACCCTGG GTACGGGTGA GCAGGCGTTC
CTGGTCACCG CCCCCATCAA CCTCTCTGGC GAAACGCTCG CCCTTCATCT CACCCTGCAC
ACCTATCGCG CCGATGCCGT CGTGCGCAGC GCCAAGCTGA ACCTGATCAT AATGCTCGCG
CTGATCGCTG CAGGGTGGCT CCTTAGTCTC GGCCTGTTCC GCTATGCCCG CAGGGAGGAA
TTGCACCAGG CCGAAATGAC ACGCAAGGGG AACCTCGCCA AGCTGGGGGA GATGGGGGCG
CTCCTGGCAC ATGAGATCAG GAACCCTCTG GCCGGCATAA AGGGATTTGC CCAGGTGATC
GCCAAAAAGC CGCAGGAAGC GCGCAACGGC GCCTTCGCAG AGAACATCGT CATCGAGGTC
GTGCGGCTGG AGACGCTGGT GAACGATCTT TTGGCCTATG CCGCAGGCGA CGGAGCCCAA
CCGGCGCGGT TCGATCTGGG CCAGTTGATC GATCACGCCC TCTCGCTCCT GGCCCCTGAG
GCAGCCGAGC GCGCGGTGGC CGTTTCCTGC AGCAGCACCG GTTCGCTTTT CGTGATGGGA
AACAGGGACC GGATCGAGCA GGCGCTCCTC AACCTCGGTA AAAACGCCCT GCAAGCTATG
GGCGCAGGGG GGGTACTCGA AATAGCGTGC GCCACCGCCG GGGGCGAAGC GAGGATCAGC
ATCAAGGATA CCGGGCAGGG GATAGCGGAA GCAGACCTTC CCAAGGTCTT CGAACCATTC
TTCACCACCA AGGCCAGGGG TACCGGGCTG GGACTCGCCC TGTGCAGAAA GGTGATCGAA
GAGCACGGCG GGACCATCAG TTTGGAGAGC AGGGTAGGGG AGGGGACCAC GGTGACCTTC
GCGTTGCCCA TCCTCCCCGC GGACAAGGAG CAGACATGA
 
Protein sequence
MFRKATLIIG IFATCSLLWF TVYNYRQAGP IAEENLRGLA LTLSSAIENL AVNDPSLAAL 
AKFRTNDIAY FALVDRQGLF RFHSNPDLIG ASLPGAGRKA APSLERSESR VTLGTGEQAF
LVTAPINLSG ETLALHLTLH TYRADAVVRS AKLNLIIMLA LIAAGWLLSL GLFRYARREE
LHQAEMTRKG NLAKLGEMGA LLAHEIRNPL AGIKGFAQVI AKKPQEARNG AFAENIVIEV
VRLETLVNDL LAYAAGDGAQ PARFDLGQLI DHALSLLAPE AAERAVAVSC SSTGSLFVMG
NRDRIEQALL NLGKNALQAM GAGGVLEIAC ATAGGEARIS IKDTGQGIAE ADLPKVFEPF
FTTKARGTGL GLALCRKVIE EHGGTISLES RVGEGTTVTF ALPILPADKE QT