Gene GM21_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2001 
Symbol 
ID8137335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2319864 
End bp2322254 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content61% 
IMG OID644869614 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003021811 
Protein GI253700622 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value7.77001e-34 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGCCGCCT CAGAAAATAA AGCAGTCCAT TTCACCGCGA ACTGCACGGT GCTGCTGGCT 
TTTTTCGTCG CGCTCTTTTT TCCTGCAGGT TACTTCGTCG TGAGTTACCA GTATGTGGTA
GGGAGCCTGG AGACCGAAGC CGAGATCAAC GCCAGAATTG TGGGCGGGTT GATCGACGAG
ACACCCGAAT CGCTGCATAC CCAGGCCCAC CGGCTGGAGG GGATGCTTCC GCAGAACCGG
AATTCGGGAC ACCCGGACTT CAGCAGGATC CTCGACGCGG ACGGCCATGT GCTTGCCACC
ATGGGCGAGC CACCTGCCGA ACCTGCGATC ACCAAGTCGC ACGACATCGT TTTCGCCGGC
AAGACCACCG GCAGCATGGA AATCAGCCGC TCGCTCAGGC CTATCTTGAC GAAGACGGCG
CTGGTGGCCG TTTTCGGCCT CTCAGTCGGC CTCGTGGTAT TCATACTTCT CCCCTTCCGC
GCCATCAACC GCGCCAACCG CAAGTTGCAG GATTCGTACA ATTTTCTTAC CAAAGTCATG
GAGAGCAGCG CCAACGCGGT GATCGTTCTC AATCCGGACG GCAAGATCGG CATGGTCAAC
GGCCGGTGCA CCGAGATGAG CGGCTACCGT CGCGAGGACC TGTACGGCTC GGAGATCGAC
CGGCTGCTCT GCCCGCAATC GTTCGACACG GTCCTTGCGC AACTGCGCCA GGTGTCGAGC
GGCGAGGCCG AGATCGTCAA GTTCGAGACC GACCTGCTCA GAAAGGACGG CAGCACCATA
GCCATCGCCT GCGGCGCGAC CCCGGTATGC CAGGAAGGGC GCGTCGCCAG CACGGTGCTC
TCGGTGGAGA ACATAACCGA GCGGAGGCGT TCGGTCGAAC AGCTAAAGGC CGCCAAGGAG
TACACCGAGA ACCTGATCCA GGCTGCTTGC GTGATGATAC TGGGACTCGA CCTGAAAGGG
AACGTGACCC TACTAAACCG CACCGCCGAA GAGGTGACAG GGTACGACGC CGACGAGCTG
ATCGGAAAGA ACTGGTTCGA GACGGTGATG GGGGCAGAGG CTTTCTACAA GATGTGCACC
ATCCCCGGGA TGGTTGGCGA AAGCGTGAAC CGCTGCGCCT TCGAGAACCA GATCGTCACC
AAGGAAGGCC TGGTACGCAC CATCTCCTGG CGCAACAGCG CCATCATCGA AAAGGACGCA
CGGCTCGGGA CGCTCTGCTT CGGGATCGAC ATCACCGAGC ACCGCAAGAT CGAGGCGCAA
CTCAGGCACT CGCAGAAGAT GGAATCCATC GGGCAGTTGG CCGGCGGCGT CGCCCATGAC
TTCAACAACA TGCTGAGCGT GATCATGGGG TACGCCCAGC TCTGCCAGAT AGAGGTGGAT
GAAAACAGCT CGCTTTGGCT CTACCTCGGG GAGATCGTCA AGGCGGGCGA GCGCTCGCGC
GACATGGTGA GGAAACTCCT CGCCTTCTCC CGAAAGGAGA TCATCTCCCC CAAGGCGGTG
GACCTGAACA TCCTCTGTAT CGAGACGGAA AAGACCTTAA GCCGGCTGAT CGGCGAAGAG
ATCAAGCTCA ACTTCATCCC TGCCACGACG CTCTGGACCG TGAAGATCGA CCCGTCGCAG
GTGGACCAGA TCCTGATGAA CCTAGCGGTC AACGCGCGCG ACGCCATGCC CGAAGGGGGG
CGCCTGACCA TCGGCACCGA GAACGCGACG GTGGACGAGG CATTCTGCGA TTACCGCCTC
GACGCGCGCC CCGGAGAGTA TGCCTGCCTC ACCGTGGCGG ACACCGGTTT CGGCATGGAC
AGGGAACTCA CCAAACGGAT CTTCGAGCCC TTCTTCACCA CCAAGGACGT CGGCAAGGGG
ACCGGCCTCG GGCTCGCCAC GGTTTACGGC ATCGTCACCC AGAATGGCGG TTTCCTCGAC
GTCGACAGCG AACCGGGAGA AGGAACCTCG TTCCGGATTT ACCTGCCGCG CCTGAAAGAC
GAGCCTGAGG AGCAGACAAA GCCGTGCGTC GACCACCCCA CAGGGACGGG GACGATTTTG
GTCGTCGAGG ACGACCCCAT GCTACGGACC ATGGCAACCC AGATGCTGGA AAAGATCGGC
TACCGCGTCA TAGAGGCCGG GAACCCTCAA GTCGCCCTTT CCATCTGTGC GGATCCGACC
ACGTCCATCG ACTTGGTGCT GACCGACGTG ATCATGCCCG AGATGAACGG ATTGGAGATG
GCGCGGGGGA TCGCTGCGCT GCGCCCCGAC ACCAAGGTGC TCTTTATGAC CGGTTACTCT
TCCGACGTCA TCGCGAGCCA CGGCATCATT CAACCAGGAC TGCACTACGT CGAGAAGCCG
TTCAACATGG AGGGGCTGCA TGCGAAGATC CTGGAGATCC AGGCGACGTG A
 
Protein sequence
MAASENKAVH FTANCTVLLA FFVALFFPAG YFVVSYQYVV GSLETEAEIN ARIVGGLIDE 
TPESLHTQAH RLEGMLPQNR NSGHPDFSRI LDADGHVLAT MGEPPAEPAI TKSHDIVFAG
KTTGSMEISR SLRPILTKTA LVAVFGLSVG LVVFILLPFR AINRANRKLQ DSYNFLTKVM
ESSANAVIVL NPDGKIGMVN GRCTEMSGYR REDLYGSEID RLLCPQSFDT VLAQLRQVSS
GEAEIVKFET DLLRKDGSTI AIACGATPVC QEGRVASTVL SVENITERRR SVEQLKAAKE
YTENLIQAAC VMILGLDLKG NVTLLNRTAE EVTGYDADEL IGKNWFETVM GAEAFYKMCT
IPGMVGESVN RCAFENQIVT KEGLVRTISW RNSAIIEKDA RLGTLCFGID ITEHRKIEAQ
LRHSQKMESI GQLAGGVAHD FNNMLSVIMG YAQLCQIEVD ENSSLWLYLG EIVKAGERSR
DMVRKLLAFS RKEIISPKAV DLNILCIETE KTLSRLIGEE IKLNFIPATT LWTVKIDPSQ
VDQILMNLAV NARDAMPEGG RLTIGTENAT VDEAFCDYRL DARPGEYACL TVADTGFGMD
RELTKRIFEP FFTTKDVGKG TGLGLATVYG IVTQNGGFLD VDSEPGEGTS FRIYLPRLKD
EPEEQTKPCV DHPTGTGTIL VVEDDPMLRT MATQMLEKIG YRVIEAGNPQ VALSICADPT
TSIDLVLTDV IMPEMNGLEM ARGIAALRPD TKVLFMTGYS SDVIASHGII QPGLHYVEKP
FNMEGLHAKI LEIQAT