Gene GM21_3966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3966 
Symbol 
ID8139340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4549526 
End bp4551559 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content62% 
IMG OID644871582 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003023740 
Protein GI253702551 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0000000186378 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAAAG CCTCTGACCC GCCCCAACCC GATCCCCAGG ACGCCCAGGA CGCCCTGGCG 
ACGCTGCGGC AAAAGCTCGC GGGACTCGGG GAATCCTCGA TGCGCAAGAG CTACTATCCC
GAACTGCAGG AGCGCCTTGA GGAACTGGAG CGGTTCAAGG CCTTGCTGGA TCACAGCAAC
GAAGCCATCA TCCTGATCGA AGTCTCCACC GGGCGCATCG TCGACCTGAA CGACTCGGCC
AGCCGCCAGA CCGGCTGGAG CCACGACGAG CTGCTGCAGC AATCCCTCTT CGACCTCTCC
AACCTTGAGC AAAATCCCGC CGCGCAAGCG CTGATCAGGT CCGCCGACGA CATGGGAGCC
AGCGGGATGC TGGTCGTCAC CGAGCTCCAC CGCAAAAACG GCGGACGCTT CCCCGCCGAA
ATCACTTTGA ACCGGATGCA ATTCCGCGAT AATTCCTACG TGCTGGCGGT CGCCCGCGAC
ATCACACAGA GGAAGGCGAT GGAGGAGGCG CTTAGGGAGA GCGAGGAATT TCTCAAGAAC
ATCGTTGATC ACATCCCTGC GGTGGTTTTC GCCAAGGAGG TGCAGGGGCT GCGCTTCGTC
ACCATCAACA AGGCGTGCCA GGAGGTGTTC GGCCTGAGCC GGGCGGAGGT GCTCGGCCGC
ACCAACTACG ACCTGTTTCC CAAGGAGCAG GCGGACTTCT TCACCAAGGT CGACCGGGAG
ACCCTCGCCA AGGGCGAGCT GGTGGAGGTC CCGGAGGAAA TCATCAGCAC CCCCAGCGGC
GACCGCATAC TGCGGGTCAA GAAGATTCCG CTCTTCGACA ACCAGGGAAA GGAGCGTTTC
CTGTTGGGGA TCGCCGAGGA CATCACCGAA CGGAAGCAAC TGGAAGAAAA GCTGCTGCAA
TCGCAGAAGA TGGAGGCGAT TGGGCAACTG GCCGGCGGGG TGGCGCACGA CTTCAACAAC
ATCCTGATGG TGATTCTCGG CTACGGGAGC ATTCTGCTGA ACGAGGGGGC GCTGCCGGCG
CGGCAAAAGG AGCAGGTGGA GCAGATCATG AACGCGGCGG ACAAGGCGGC GAAGCTCACC
TCGGACCTCC TCGCCTTCAG CCGAAAGCAG GTGATCAAGC CCGCCACCAT GAACCTGAAC
GACATCATCC TGCACGTGGA AAAGTTCCTC TCCCGCATCA TCGGCGAGGA CGTCCAACTG
AAGGCTCGGC TCACCCCGCG CGAACTGCAG GTCGACGTCG ACCGTGGGCA GATAGAGCAG
GTGCTGATCA ACCTCGCCAC CAACGCCCGG GACGCCATGC CCAAGGGGGG GCTGCTCACC
ATCGAGACCT CGTCGCTGCA GATCGACGAC GCCTTCGTCC AGGCCAACGG CATCGGCGCC
CCCGGCCCTT ACGCCGTCAT CTCCATCTCC GACACCGGCG TCGGCATGAA CGAACAGACC
CGCAGGAGGA TCTTCGAGCC GTTCTTCACC ACCAAGGAGA TGGGGAAGGG CACCGGCCTT
GGCATGTCCA TCGTCTACGG CATCATCAAG CAGCACAACG GCTTCGTGAA CGTCTACAGC
GAGCCGAAGA TAGGGACCAC CTTCCGCATC TACCTCCCCT TCAGCGAACA AAGCTCCGAG
GCGGCCCTGG ACCCCCAAGC CCCCGACAGC GCACCGGGGG GGGCGGAGAC CATACTGGTG
GTGGAGGACG AGCCGGATCT GCGCCTTCTC TTGCAGAACA TCCTTTCCGG AGCCGGGTAC
TGCGTCCTCT TGGCAGAAAA CGGGCAGGTG GCGGTCGAGC GGTACGCGGC CGGCGCAGGG
GGGGAGATAG CGCTGGTGCT GATGGACATG ATCATGCCGG GGATGAGCGG CAAGGAAGCC
TGCCGCGCCA TCCGCGCCAT CGACCCTGCG GCCAAGGTGC TTTACACCAG CGGCTACACC
ATGGACATCA TCAAGAGCCG CGATCTGTTG GAGGAAGGGA CCGAACTTCT CATGAAACCG
GTCCGCCCTC TGGAGCTTTT AAAGAAGGTG CGGGAGATGC TGGATAGGTT GTGA
 
Protein sequence
MKKASDPPQP DPQDAQDALA TLRQKLAGLG ESSMRKSYYP ELQERLEELE RFKALLDHSN 
EAIILIEVST GRIVDLNDSA SRQTGWSHDE LLQQSLFDLS NLEQNPAAQA LIRSADDMGA
SGMLVVTELH RKNGGRFPAE ITLNRMQFRD NSYVLAVARD ITQRKAMEEA LRESEEFLKN
IVDHIPAVVF AKEVQGLRFV TINKACQEVF GLSRAEVLGR TNYDLFPKEQ ADFFTKVDRE
TLAKGELVEV PEEIISTPSG DRILRVKKIP LFDNQGKERF LLGIAEDITE RKQLEEKLLQ
SQKMEAIGQL AGGVAHDFNN ILMVILGYGS ILLNEGALPA RQKEQVEQIM NAADKAAKLT
SDLLAFSRKQ VIKPATMNLN DIILHVEKFL SRIIGEDVQL KARLTPRELQ VDVDRGQIEQ
VLINLATNAR DAMPKGGLLT IETSSLQIDD AFVQANGIGA PGPYAVISIS DTGVGMNEQT
RRRIFEPFFT TKEMGKGTGL GMSIVYGIIK QHNGFVNVYS EPKIGTTFRI YLPFSEQSSE
AALDPQAPDS APGGAETILV VEDEPDLRLL LQNILSGAGY CVLLAENGQV AVERYAAGAG
GEIALVLMDM IMPGMSGKEA CRAIRAIDPA AKVLYTSGYT MDIIKSRDLL EEGTELLMKP
VRPLELLKKV REMLDRL