Gene GM21_1273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1273 
Symbol 
ID8136599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1483114 
End bp1486128 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content64% 
IMG OID644868886 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003021091 
Protein GI253699902 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones124 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCACC ATCGCCGGGG GCACCGCCTC CCGCTCGTGG CAGTCACCCT CGCCGTCATC 
GCCCTCCTCC TGGTCCCGAA AGGGGCGCCG GCACAGGAAA CTGCGACCAT CCAGTTGAAA
TGGCTGCATC ATTTCCAGTT CGCGGGGTAC TACGCGGCGC TGGAGAAAGG GTTCTACCGC
CGGGCCGGCC TGGATGTGAC CATCAAGGAG GGGGGGCCGA GAACCGAGGT GGAGGATGAG
GTCCTCTCGG GCAGGGCTGA CTTCGGCGTG GGAACCTCGG CGATCCTTTT GCGCCGCGCC
CGCGGCGAAG ACCTGGTGGT GCTGGGGCAG ATCTTCCAGC ATTCGGCCGC CGTCCTGATC
ACCCCCCGCA GCACCGGAAT CCGCTCCATC CCCGACATGG CGCGCCGCAG GTTCATGTAT
TCCAACCAGC ACGGCGACAT GCTGACGCTT TTGCTGCAAA ACGGCGTCAA CGAAAAAGAC
CTCGTGCAGG TCCCCCACAA CGGCGACCCC CGCGACCTCA TCGGCGGCAA GGCAGACGTC
ATGATGGGGT ACAGCTTCAA CGAGCCCTTC ATCCTGGAGC AGGAAGGGAT ACCCTACCTC
CTCTTCTCCC CGCTTACCTA CGGCATAGAT TTTTACGGCG ACAATTTCTT CACCACGCGG
GCCAACATCG AGGCAAGGCC GGAACTGGTC CGCGCCTTTC GCGAGGCGAC GCTGGAGGGG
TGGCGCTACG CGATGGCCAA CAAGTCCGAG GTGGTGGACC TGATCCTCGC CAAGTACTCC
CGGAAGAAAA GCCGCGACTG GCTCATGTTC GAGGCGAACC AGATGGAGAC CCTGATCCAG
CCCACCCTGG TCGAGTTGGG GTACCAGAAC CCGGAGCGCT GGCGGAACAT CGGGGAGTCG
TTCGCAAAAC TCGGCATGGT CCCGCAGAAT TTCAACACCA GCGGCGTGAG CTACGACCCC
GCTCCCGGCA AGTACTATCG CGTCATCCTT CAAATACTGC TGGTCTGCGG TTCCGTCATA
GCGGTGCTGG TCGTCATCGT GATGAAGTTC AGGCAGTTGA ATGGAACACT CAAGGCCCAG
GTGGCCGAGC GCCAGGCGGC GGAGGAGGCG CTCAGGGAGA GCGAGGAGCG GCTGCGCGTC
ATCTTCGAGA CCTCGCAGGC CGGCATCATC ATGGTCGACC CCAAAGGGAT CATCCGCTTC
GCGAACAAGA GGATGGCCGA GATGTTCGGC TGCCCTCACG ACAAGCTGAT CGGCTCCGAC
TACCGAAGCC ATCTCCATCC GGAACAGTGC GAGGTGGGGA GCCAACTCAT GGAGAAGCTG
ATCCGCGGAG AAATGGAGCA GGCCTGCACG GAACGCCGCT ACCTGTGCGG TGAGCAGGGG
GATTTCTGGG GCTACCTCTC CGGCAGGAGG CTGGAGGCCC CCGACGGCAA GCTGCAGGCG
CTGGTCGGGA TCATCTCCGA CATAACCGAC CGCATCAAAG CGGACGAGGC CCGGGGGAAG
GCGCTCATGC TGGTGGAGAC CCTCCTGGCC CACTCCCCGA TGGGGATCAC CGTCTTCGAC
GGGGAGAGCG GCGCCTGCAT CCTTTTGAAC CAGGCCGCCG CAGGGATCTC CGGCGGCACC
AGGGAAGCGC TTCTAGGGCG CGATTTCCGG GGGGTGCAAC CCTGGCGCGA AGCGGGGCTC
ATCGCGGCGG CGGAAAAGGT CCTCTCCGAC GGGATCCCCC GCCCCTTCGA GGCGGAACTC
CGCGGCTCCC TGGGAAAAGA CGTCATGCTG CGCTGCCACC TTTCCCGGTT CGACCTGGAG
GGAAGAGCGC ACCTGCTGGT GCTGGAGCAG GATGTCACCG AGGAGATGCG CCTGGAGCGG
GAGAACAAGC GGATCGAGGC GCAGATGCTG AACATGCAGA AGCTGGAGAG CCTCGGGGTG
CTGGCGGGGG GAATCGCCCA CGACTTCAAC AACATCCTTA CCGGGATAGT GGGTAACATA
AGCTTCGCCC AACTGGCGCT CCCCGCGGCC CACAAGGCGG CGGCGCCGCT TCTGAAGGCC
GAGAAGGCCT GCCAGAGAGC GGCGGAACTC GCCTCCCAGC TTTTAACCTT CGCCCGGGGG
GGGCAGCCGA TCAAGAAGGC GTTCTCCGTC AAGCCGCTGG TCGGGGAATC GCTCTCGCTG
GTTCTGCGCG GCACCAACGT CAAGGGGGTC ATCGACATCG CCGACGATCT TTGCGTCATC
GAGGCGGACG AGGGGCAGAT AAACCAGGCT TTCAACAACA TCATCATCAA CGCCGTGCAC
GCCATGCCGG GGGGGGGAAC CCTCACCATA GCGGGCGAGG ATGCCGTGAT GGAAGCAGGC
AACCGCTTCG GCCTCGCGCC GGGCCCCTAT GTCCGGCTGA GTTTCAGCGA CCAGGGATGC
GGCATCCCCG AAGCGGACAT AGAAAGGATC TTCGACCCGT ACTTCACCAC CAAGACCAGC
GGCAGCGGCC TGGGGCTTGC ATCGACCCAC TCCATCATCG CCAGGCACGG CGGCATGATC
CTCGTGGACT CGGTTCCGAA AAAAGGAAGC ACCTTCATTA TCTACCTCCC CTCCACCGGG
AATTCGGTGG CGGAAGAGGC GGGGCAGGAC AAGGCCGAGC GCTTGCACGG GGGGGGACGG
ATGGTTGCGG TGATGGACGA CGAGGAGATG ATCAGGGACC TTACCCGCGC CATGCTGGTC
GAGCTCGGTT ACCGTGTGGA GGTATGTTGC GACGGCGCCG AGGTGGTCGA ACTCTATCGA
GCCGCCTGCG CCCGCGGGGA ACGCTACTCA GCCGTAATCA TGGACCTCAC CGTCCCCGGG
GGGATGGGGG GCAAGGATGC GGCGCTCCGG ATCCTGGAGC TCGACCCGAA GGCGCGGCTG
ATCGTTTCCA GCGGCTACTC CAACGACCCC GTCATGTCCG AGCACGAAAG CTTCGGCTTC
TGCGCCACGC TGGTCAAGCC TTACACCGCC GACGACATCG CCAGGGTGCT GGGGGAGGCG
ATTAACGGCA ATTGA
 
Protein sequence
MNHHRRGHRL PLVAVTLAVI ALLLVPKGAP AQETATIQLK WLHHFQFAGY YAALEKGFYR 
RAGLDVTIKE GGPRTEVEDE VLSGRADFGV GTSAILLRRA RGEDLVVLGQ IFQHSAAVLI
TPRSTGIRSI PDMARRRFMY SNQHGDMLTL LLQNGVNEKD LVQVPHNGDP RDLIGGKADV
MMGYSFNEPF ILEQEGIPYL LFSPLTYGID FYGDNFFTTR ANIEARPELV RAFREATLEG
WRYAMANKSE VVDLILAKYS RKKSRDWLMF EANQMETLIQ PTLVELGYQN PERWRNIGES
FAKLGMVPQN FNTSGVSYDP APGKYYRVIL QILLVCGSVI AVLVVIVMKF RQLNGTLKAQ
VAERQAAEEA LRESEERLRV IFETSQAGII MVDPKGIIRF ANKRMAEMFG CPHDKLIGSD
YRSHLHPEQC EVGSQLMEKL IRGEMEQACT ERRYLCGEQG DFWGYLSGRR LEAPDGKLQA
LVGIISDITD RIKADEARGK ALMLVETLLA HSPMGITVFD GESGACILLN QAAAGISGGT
REALLGRDFR GVQPWREAGL IAAAEKVLSD GIPRPFEAEL RGSLGKDVML RCHLSRFDLE
GRAHLLVLEQ DVTEEMRLER ENKRIEAQML NMQKLESLGV LAGGIAHDFN NILTGIVGNI
SFAQLALPAA HKAAAPLLKA EKACQRAAEL ASQLLTFARG GQPIKKAFSV KPLVGESLSL
VLRGTNVKGV IDIADDLCVI EADEGQINQA FNNIIINAVH AMPGGGTLTI AGEDAVMEAG
NRFGLAPGPY VRLSFSDQGC GIPEADIERI FDPYFTTKTS GSGLGLASTH SIIARHGGMI
LVDSVPKKGS TFIIYLPSTG NSVAEEAGQD KAERLHGGGR MVAVMDDEEM IRDLTRAMLV
ELGYRVEVCC DGAEVVELYR AACARGERYS AVIMDLTVPG GMGGKDAALR ILELDPKARL
IVSSGYSNDP VMSEHESFGF CATLVKPYTA DDIARVLGEA INGN