Gene GM21_3902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3902 
Symbol 
ID8139276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4487799 
End bp4489205 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content61% 
IMG OID644871519 
Producthistidine kinase 
Protein accessionYP_003023677 
Protein GI253702488 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.000116044 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAACTCA ATACCAAGCT GGTGATGATC ATGCTCACCA TGCTTATCGT GGCGACGGCG 
ATGCTCTTCG TCCTGAACCA GTTCAGCCAG AACGACCTGG TGGGGGAGAT CCAGGAGAGT
TCCACCGTGG TATCGAAAGC CATTCAGCTC AGCGTGGAAG ACCTGACCTC CGAGGTCGAA
TCATCGCGCC TGACCGAGTA CCTCCAGCAG GCGAAGAGCA AAGGGCTCAA CGAGATCAAC
ATCATCAACA ACGAGGGGGA GATCATCAAC TCCTCGGATC CCGCCCAAGT CGGCAAAAAA
CGCGAAATCA ACAAGCTGGA GAAGGGGCTT CGGGCCTCGC GCCGCGGCGG CGGCGGGGGG
CCGCTCAAGC CGTACGACCT GGTGGTGCCG GTCATCGTGG GCGACGAGCA GCTGGGTTAC
GTGCAGGTGA ACCTCCTTCT CGACAACATA CGCGACATCC AGCACGCCAA CTTCGTCAAT
CGTCTGGCCG CGACCACCAT GGTGTTCCTG ATGGGGATGA TACTGATCAT CTACCTGGCG
CGCCGCTACA CCTCGCCGAT ACACCGGCTC GCCACCGGCG TCAAGCACGT CTCCGGGGGG
GATCTGAGCG TCACCTTCCA GGTGGGGAGC GGCGACGAGA TCGGGGAACT GGCCGAGAAC
CTGAACGAGA TGGTGGAGAA GCTGAAGGAA AAGGAGCAAC TCGAAAAGCG GCTCTACGAG
GCGGAGCACC TCTCCAAGGT GGGGCAACTG GCCGCGGGGA TCGCGCACGA GATCAGGAAC
CCGCTCAATT ACATAAGCCT CGCCATCGAC CACCTGAAGA GCGAATCCCT CCCCTCCTGC
CCCGAAAAGG CCAAGGAGCT GGAGTCGATC GCCAACAACA TCAAGGAAGA GGTGCGCAAG
GCGAACTACA TGGTGCTCAA TTTCATGAAT TACGGCCGAC CCTTGAAGCT GCGGCTGCAG
CGGGTATGCT ACCCTGAGCT CGTGGACAAG GCGATGCAAC TCATGAAAGA TCGGCTCGAC
GAGAGGGGGA TCGAAGTGGT GCGGGACATA CCCGAGTACC TGCCGCCGAT GCTGGCGGAC
CCGGAGCTGA TGCGCAACTG CCTGTGCAAC TTCATCAGCA ACAGCACCCA GGCGATGCCG
GAGGGGGGGA AGTTCACCAT CGGCGCGAGC ATCGCCCCCG AAACCGGCGA GTTCCGCCTC
ACCTTCAGCG ACGAAGGGTC GGGGATCGAG CCGCAGGATC TGGAGAAGGT GTTTCAGCCC
TACTTCACCA CCAAGGAGGC GGGGATCGGC CTAGGACTCG CCATCACCGA ACGGATCGTG
AGGGAGCACG GCGGCGGCAT CGCGGTTCAG AGCACGAAAG GGGAAGGGAC CACCTTCTCG
GTCACCCTCC CGGCGGCAAC GGCATAA
 
Protein sequence
MKLNTKLVMI MLTMLIVATA MLFVLNQFSQ NDLVGEIQES STVVSKAIQL SVEDLTSEVE 
SSRLTEYLQQ AKSKGLNEIN IINNEGEIIN SSDPAQVGKK REINKLEKGL RASRRGGGGG
PLKPYDLVVP VIVGDEQLGY VQVNLLLDNI RDIQHANFVN RLAATTMVFL MGMILIIYLA
RRYTSPIHRL ATGVKHVSGG DLSVTFQVGS GDEIGELAEN LNEMVEKLKE KEQLEKRLYE
AEHLSKVGQL AAGIAHEIRN PLNYISLAID HLKSESLPSC PEKAKELESI ANNIKEEVRK
ANYMVLNFMN YGRPLKLRLQ RVCYPELVDK AMQLMKDRLD ERGIEVVRDI PEYLPPMLAD
PELMRNCLCN FISNSTQAMP EGGKFTIGAS IAPETGEFRL TFSDEGSGIE PQDLEKVFQP
YFTTKEAGIG LGLAITERIV REHGGGIAVQ STKGEGTTFS VTLPAATA