Gene GM21_0009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0009 
Symbol 
ID8135308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp14435 
End bp15964 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content65% 
IMG OID644867626 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_003019854 
Protein GI253698665 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value0.923259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAGG ATAAGATTCT CATCGTGGAC GACGAGGCGG ACATCGCCCT CATCCTCAAG 
CTGCAACTGG AGGACGCCGG CTACGAGACG GTCCGGGCCC GCGACGGGGT AGAGGCCCTG
GAGGCGGTGG CGGGGGAGCA GTTCGACCTC ATCATGCTCG ACATAAAGAT GCCTCGCATG
GACGGCCTCG AGGTGCTGAG CCGCCTGAAG GGGGACGAGA CCCCCGTGGT GATGATGACC
GCCCACGGCA GCGAGGACAT CGCCGTCGAC GCCATGAAGA AGGGGGCGCT CGACTACATC
TCGAAGCCTT TCTCCACCGA CGACATGCTG CAGAAGGTGG AGCGCGCCAT CGGCATCGAC
CGCACCCGTA AGGAAAACGC CAGGCTTTCG CAGCAGTTGG ACGAGGAGCG GCGCAAGATG
GAGGCGGTGC TGCAGGGAAT GGCCGACCTC CTGGTAGCAG TCGACCTGCA GGGGCGCATC
ATCACCTGGA GCCGGCAGGC GGCCAGGCTT CTTGCCGCGG AAGGGGAGAG CCTCTTGGGG
AAAACCCTGG AAGAGGTGCT CAAGGCCGAG GTTGCCGGGG GGGAACTCCC GGCCCGGACC
GTCTTGCGCA CGGCGGAACC CCGCCTCGAC GTCGGCTACC TGCTGAAGAT AGGCAGCCAG
GCGGTGCCGG TACTCTCCTC GGCGGCGCCG CTTTGGAACG CGAACGGAGA GCTCGCGGGG
AGCGTCGAGA TCATCAGGGA CATCTCGAAG CTCAAGGAGC TGGAGCAGGA AAAAGAGGAT
TTCGTGAGCA TGCTCTCCCA CGACCTCAAG TCCCCCATCA CCGCGGTGGT CGGCTCCATC
GACCTGGTGC GGGAGTCGAA GCTCGGCCCG GTGACGCCGG ACCAGTCCGA GTACCTGAAC
GCAGCCATAG AGAGCTGCGA GGAGATGGTG GAGATGATCG ACACCCTGCT CGGCATCCAC
AAATTCGAGG CGGGGAAGAT GAAGCTGAAT TTCCGGGAGG AAGACCCGGT GCTGCTCATC
AACCGGAGCG CGGCCAAATT CCAGACGCCG GCGCAGCGCG GCGCGATCAG GCTCTTCACG
ACCCTCCCCG CTTCCCTCCC CGCCATCTCG GCCGACCGCT CCTTGTTCAG CCGCATCCTG
GGAAACCTCC TCTCCAACGC GGTGAAGTTC ACCCCCGAGG GGGGAGAGAT AGAGGTGTCG
GCGGACCTGG TACAGGACCC GGCACCCGTG CTGGCGCATG TGCCGCAGGA GTGCTATCCC
GGGCAAGAGC TGCCCAGCGA GGGGGAATTC GTCAGGATCA GGGTGCGCGA CTCCGGCGTA
GGGATTCCGC AGGAATCGCT TGGCTCCATC TTCGACCGTT TCGTGCAGGC CAGAAACCGC
CGGCAGGGAA AGACCCGCGG CACCGGCCTG GGGCTTGCCT TCTGCAGGAA GGCGATGGAC
GCGCATGGCG GGTACATCTG GGCGAAGAGC GAGCCGGGCG AAGGCTCCGT CTTCACCGTC
TTGTTCCCGG CGCTTCCGGA GGAAGAGTAG
 
Protein sequence
MQKDKILIVD DEADIALILK LQLEDAGYET VRARDGVEAL EAVAGEQFDL IMLDIKMPRM 
DGLEVLSRLK GDETPVVMMT AHGSEDIAVD AMKKGALDYI SKPFSTDDML QKVERAIGID
RTRKENARLS QQLDEERRKM EAVLQGMADL LVAVDLQGRI ITWSRQAARL LAAEGESLLG
KTLEEVLKAE VAGGELPART VLRTAEPRLD VGYLLKIGSQ AVPVLSSAAP LWNANGELAG
SVEIIRDISK LKELEQEKED FVSMLSHDLK SPITAVVGSI DLVRESKLGP VTPDQSEYLN
AAIESCEEMV EMIDTLLGIH KFEAGKMKLN FREEDPVLLI NRSAAKFQTP AQRGAIRLFT
TLPASLPAIS ADRSLFSRIL GNLLSNAVKF TPEGGEIEVS ADLVQDPAPV LAHVPQECYP
GQELPSEGEF VRIRVRDSGV GIPQESLGSI FDRFVQARNR RQGKTRGTGL GLAFCRKAMD
AHGGYIWAKS EPGEGSVFTV LFPALPEEE