Gene GM21_1566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1566 
Symbol 
ID8136897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1823304 
End bp1826447 
Gene Length3144 bp 
Protein Length1047 aa 
Translation table11 
GC content59% 
IMG OID644869179 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003021379 
Protein GI253700190 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones95 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAAC GTCTGATCCT CTTCGCATGT CTGCTTGCCT CCGTCTACAC GGCGTCGGCC 
GCCGGTAATC CCCAGAAAAC GCTCGTTGTA GGCAGCGAGG AGAACTTTCC TCCCTTCTCC
ATCGGCAGTA CCGACGATAC TGCCGGCGGT TTCGCCGTGG AGCTTTGGAA GGAGGTTGCC
CGGGAGGCGG GGCTGAAATA TACCATCAAG GTCAGGCCGT GGGGTGAACT GCTCCAGGAT
TTCCGGGACG GGAAGGTCGA CGTCCTGATC AATATCGCCA CCTCCGAGGA GCGGCATCGC
TACACCGATT TCAGCGTCCC GCACGTCACC GTCAACGGCG CCATCTTCGT GCGCAAGGGG
GACTCCCTCA TAAGCTCCGA GGCGGACCTG GCGGGGAAGT CCATCATCGT GTTCAAATCC
GACCTGGCCC ATGAATACGC GGTCTCCAAA GGACGGCAAA AACAACTGGT GCTGGTCGAC
GACGCCCGTG AAGGGCTCAG GCTCCTTTCC TCCGGTAAGC ACGACGCCAT GCTGCTCAGC
AAGCTTGCCG GGATGCAGAC GCTGCGGGAA CTCGGCATCG ACAACGTCCG GGCCTTGGAC
ATCAAGGCCG GCTACGCGCA GAAGTTCACT TTCGGCGTGC ACAAAGGGGA CTCGGACCTC
CTCGCCGCGC TGAACGAGGG GCTGGCCCTG GCCAAGGCTG GCGGGGAGTA CGACAGGCTG
TACCAAAAAT GGTTCGGCGT GTACGAGACT AAGGAGCCGA CCTTTATAGA CATATTGAAG
TACCTGCTTC CCTTCGCGCT CTTCGTGGCT TGCCTTGCCG TCTATTTCTT CTACAGGCGA
AGCCGTGAAC GCAAAGAGTC CGAAGCTGCG TTGCGCGAAA GCAAGCTGCG CCTGGACACC
ATCCTCGACA ACGTCGGCGC CCACATCTTC ATAAAGGACA CCGAGTACCG CTACACCTAT
GTGAACCGCA AGGTCTGCGA GTTGTTCGGT CTACCCGAGG AAATGATCGT GGGCAAGAGC
GACGCCGCCT TCTTCTCCCC GGAATCGGTG GAGGAGATAA TCGTCAGCGA CCGACCGGTG
CTCGAGGAGG GGCTCACGGT AAAAAGGGAA GAAATCAACC TGGCGGCCGA GAACGAGCAG
CCTCACAGTT ATTGGACGGT GAAGCTCCCC CTCCGGGATG GAAGCGGCGC CATATACGGC
CTTTGCGGCA TATCGACCGA CATCACCGAA CTGAAGCTGC TCGAGGACGA GGTGAAGAAG
AAGAGCGCAC AGCTTGAGGA GGAACTGGCG GAGCGGGAGA TCGCGCAGGA ATCGCTGCAG
GAGCAGACTG CACTGCTCGA GGAGGAGGTC GAGGAACGCA GGAGGGCCGA GGTCGCGCTC
AAAAAGAGCG AGGAGACCGT TCGGCACAAA CTAAAGGCGA TCCTGGAACC GGAAGGGGAT
ATCGGCGCCC TGGAACTCTC GGATATCATC GACTGCGAAA TGCTCCAGGC CATGATGGAG
GATTTCTACA GGATCACGGG GATGTTGGGG GCAGTGGTCG ACGTCTCGGG CAAGGTGCTG
GTCGCGGTAG GGTGGCAGGA CATCTGCACC AAGTTCCACA GGGTCCATGC CGAGGCATGC
AAAAACTGCC TGGAAAGCGA CACCGTACTT ACCGAAGGGG TGGCCGCGGG AACCTACAAA
CTCTACCGCT GCAAGAACAA CATGTGGGAT ATGGTGACTC CCATCGAGGT GGGGGGGAAG
CATCTCGGCA ACGTCTTCAT CGGGCAGTTC TTCTACCAGG ACGACCCGCC CGATGTGGAG
TTTTTCCGCG AGCAGGCCAG GCGCTACGGG TTCGACGAGA CGCAGTACCT GGCCGCGCTG
GATCGGGTGC CTCGTTTCAC CAGGGAAGCG GTTGACGCGG GGATGAAATT CTACGCCGAA
CTGACCAAGG TGGTTTCCAC GCTCAGTTTC ACCTCGATAA AGTTGTCCAG GATGCTTGCC
GAGCGCATCC ACCTCGAAGA ACAGCTCCGG CAGTCCCAGA AGATGGAGGC GGTGGGACAA
CTGGCCGGCG GCGTCGCCCA CGACTTCAAC AACATCCTGC AGGTGATCAC CGGCTACGGC
AACATGCTCA AGATGAATGA AAACCTCGGC ATTAAACAAC AGGAATGGGT GGATCATATC
CTGTCATCGG CTGAACGTGC GGCGCAGCTT ACCAAGGGAT TGCTGGCCTT CAGTCGCAAG
CAGTTGATCT GCCTGGAACC CGTGGATCTG AACGAGGTCA TCGAGAACTT CAAGAAATTC
ATCCTCAGGG TCATCGGAGA GGACATACAC TTGAAGACGG TAACCTACGG GGACCGTCTC
ATCGTGCACG CCGACGTGGG GCAACTTGAG CAGGTGCTGA TCAACCTGGC GACCAATGCG
CGCGATTCCA TGCACAAAGG GGGAATACTG ACTATCGAGA CGGGGCTGCA GACGGTGGAA
GCACCCTTTG ACCACGAGTT CGGACGGGCG GAATCGGGAC GCTATGCTGT GGTAACCGTG
TCGGACAGCG GCAGCGGCAT GGATGAGGAG ACCCGCAAGA GGATCTTCGA GCCGTTCTAT
ACCACGAAGG AGGTCGGCAA GGGGACCGGC CTCGGCATGG CGATCGTCTA TGGCATCATC
AAGCAGCATC GCGGGATCAT CAACGTCTAC AGCGAGCCCG GGCACGGCAC CACCTTCAGG
ATTTACCTGC CGGTGCAGGA GGCCGAACTG GGCGGGGTGT ATCCCCAGGC CGTGCAGGTG
CAGGCGCCGA TGGGGGGAAG TGAAACGATC CTTTTGGCCG AGGATGATGC GGAAGTCCGC
AAGCTCGTGG TGTCCATCCT TTGCCAGTAC GGCTACGAGG TGATCCAGGC GGTGGACGGT
CAAGATGTCG TCGAGAAGTT CGCCGACAAT AGGGAAAAGG TAGCGATGAT TCTCATGGAC
ATGATCATGC CGAAAAAGAA CGGCAAGGAG GCCTACGAGG AGATCTCGCT GCTGCAGCCG
GGGGTGAAGG TGATGTACTC GAGCGGCTAT ACCGCGGACT TCATCCAGAA CAGAGGGGTG
TCGGAAGAGG GGATCGAACT GGTCATGAAA CCGGTGCAGC CGATGGAACT GCTGCGCAAG
ATCAGGGAGA TGCTGGACCG CTGA
 
Protein sequence
MLKRLILFAC LLASVYTASA AGNPQKTLVV GSEENFPPFS IGSTDDTAGG FAVELWKEVA 
REAGLKYTIK VRPWGELLQD FRDGKVDVLI NIATSEERHR YTDFSVPHVT VNGAIFVRKG
DSLISSEADL AGKSIIVFKS DLAHEYAVSK GRQKQLVLVD DAREGLRLLS SGKHDAMLLS
KLAGMQTLRE LGIDNVRALD IKAGYAQKFT FGVHKGDSDL LAALNEGLAL AKAGGEYDRL
YQKWFGVYET KEPTFIDILK YLLPFALFVA CLAVYFFYRR SRERKESEAA LRESKLRLDT
ILDNVGAHIF IKDTEYRYTY VNRKVCELFG LPEEMIVGKS DAAFFSPESV EEIIVSDRPV
LEEGLTVKRE EINLAAENEQ PHSYWTVKLP LRDGSGAIYG LCGISTDITE LKLLEDEVKK
KSAQLEEELA EREIAQESLQ EQTALLEEEV EERRRAEVAL KKSEETVRHK LKAILEPEGD
IGALELSDII DCEMLQAMME DFYRITGMLG AVVDVSGKVL VAVGWQDICT KFHRVHAEAC
KNCLESDTVL TEGVAAGTYK LYRCKNNMWD MVTPIEVGGK HLGNVFIGQF FYQDDPPDVE
FFREQARRYG FDETQYLAAL DRVPRFTREA VDAGMKFYAE LTKVVSTLSF TSIKLSRMLA
ERIHLEEQLR QSQKMEAVGQ LAGGVAHDFN NILQVITGYG NMLKMNENLG IKQQEWVDHI
LSSAERAAQL TKGLLAFSRK QLICLEPVDL NEVIENFKKF ILRVIGEDIH LKTVTYGDRL
IVHADVGQLE QVLINLATNA RDSMHKGGIL TIETGLQTVE APFDHEFGRA ESGRYAVVTV
SDSGSGMDEE TRKRIFEPFY TTKEVGKGTG LGMAIVYGII KQHRGIINVY SEPGHGTTFR
IYLPVQEAEL GGVYPQAVQV QAPMGGSETI LLAEDDAEVR KLVVSILCQY GYEVIQAVDG
QDVVEKFADN REKVAMILMD MIMPKKNGKE AYEEISLLQP GVKVMYSSGY TADFIQNRGV
SEEGIELVMK PVQPMELLRK IREMLDR