Gene GM21_0312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0312 
Symbol 
ID8135619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp383441 
End bp386245 
Gene Length2805 bp 
Protein Length934 aa 
Translation table11 
GC content61% 
IMG OID644867929 
ProductSigma 54 interacting domain protein 
Protein accessionYP_003020151 
Protein GI253698962 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones172 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTCAG AGGAACAGTT ACCCGTCATG GACCAGCAAC TGAAAAAACT AGGCGACCGG 
ATCGCCTCCT TCGGCAAGGA AAACCTGGAC GAGATGCTGC ACCTGGTCGG CGAAGGGTCG
CGGCTCATCT CCGGGCAGGA GCGCGTGCGC ATCTACCTTG AGGATCTCAC CAAGGGCGCC
CTCTCCTGCG CCTTCTCCTG CGGCGGCTTC GCCACCGAAA TAAGGCAGGA AACCTTCCCG
ATCATCTCAG CCGAAGCCGC GGTCTCCCGC ACCTTCGTGA CGCAGCAGGT CGCGCAATAC
CAGAACCCCG CCGAAACCGG CCTCCCGCTG GACCAGGAAT TCTCGCGACG GTTCCAGATC
GAGGGGACGA CCATGCTCCC CATCACCAGC CAGGGCAAAT CGATCGGCGT CGCCTGCCTT
GATGGCAGCA CGCTCTCCCA AGACAAGATC GACGAACTCA TCCCCTTTCT GGCGCGGGCG
GGCGAACGTG TGGACCAGGC CAGGAAATAC CACCAGCAGT TGCTGTTGGC CCGGCGGGTC
GAGCTCTACA AACGGCGCGA AGCCGCCGGA TTCATGGTGC GCTCGGCAGT AAACCTCATC
GACGGGCTGA CGCTCGCCTC GGTGCTCGTC CCCGTCAAGG GGGGGATGGA AGTGCTGGCA
AGCCACGCCC AGGACCCGGA CCTGAAATAC CTCTACGACA ACGTGGGAGG GATCGATCTC
AAGCACGGCA CCTCGCTCGT CTCCCGCTAC GTCAACGAGG CCGGCGTAGT CACCGACCCG
AGCCTTCTGA AGCCTATGTT CTTTCCCGAC CTTTTGGACC AGTCCATCCA ACGCCGCGCC
CTCACCGAGG AGATGGGGCT GCGCACCCTG TACATGGTGC CGCGGGTAGA GCCGGAGACC
AAGCGCATCA TCTGCCTGAT GAACTACTTC ACCCGCGACG TCACCAACTA CACCGATTTC
GAGATGGGGC TTTTGCAGAC CCACGCCGAG ATGGTGGAGC GGGTCATCAA CGAGGTGGGC
GGCGAGCACC TGGAGATCAA GGTGCTATCC GAGATATCCG ATCTGCTCCA GGAGCCGAAC
GAAGGGCTGC ACGGCTTCCT CACCAAGGTC CTCTCCAAGG CGACGGAACT GATCGGCGCC
GACACCGGGA GCATCGCCAT CGTCCAGGAG CGCGAGGGAG TGAAGTGGCT GGTGGTCGAA
AACGAGGAAG GGACCATCGT CGGGGCGAAG AACAAGGAGT GGCTCAAGAA GAAGATCCCC
CCCTTCAAGA TCGGCGGTGC CGACCTTCCC CCCGAGGAGC GTAGCCTTAC CGGTTACGTC
GCCTGGACCA AGGAGCCCAA GATCATCGAG CAGGTGGCGG ACCAAAGCCG GCACGAGGGG
TTCCACCGCC CGATGAACGA ACTGATACAA AGCGAGATGG CGGTTCCCGT GATCAGCGAC
GACGAGGTGA TCGCCGTCAT CTGCCTCAAC TCCCTGCAAG ACGGGTACTT CACCGAAGAA
CACAAGCGGA TCCTGCAGAT CATCGACCGG CTCACCTCGC GCCACATCTC CGACATCCAG
CGCATAGAGC GGCTGCAGTC GGAGGTGAAC AAGCTGCAAA GCGACATCGC CTACAAGGAC
CCGAAAGTCT CCTCCTACCG GCTGGGAAAC ATCATCGGCA ACAGCCGCAA GTCGCAGGAG
ATCGTCGCCT TCATCAACAC CGTGGCGCAG CCTCTATCCA ACCGGATCGC GCTGTGGAGC
AGGAACATCC TGCAGGAGGC GACCATAGGC CTTCCCTCCA TCCTGGTCCT TGGCCCCACC
GGCGCGGGGA AGGAATTCTT CTTCAACAAC CTGTACAACA AGCTAAACGA GCTCTACCGG
CAGCAGATCA ACCCGGACGG CGAGCTTCCG GTGAAGAAGA CCAACATCGC CGCCTACAGC
GGAGACCTTA CCTACTCCGA GCTCTTCGGA CACATCAAGG GAGCCTTCAC CGGCGCCTAC
AGCGACCGCA AGGGGATCAT CGAGGAAGCC GCCGGAGGGA TCGTCTTTCT CGACGAGATC
GGCGACGCCG ACCCGAAGAC CCAGGTCCAG TTGCTTCGTT TCCTCGACAA CGGCGGGTTC
GTGCGTCTGG GCGAGAACCG CGAGCGTATC AGCCGCGTGC TCTTAGTTGC CGCCACCAAC
AAGGACCTGA GCAAAGAGAT CGCCATGGGG AACTTCCGCG AGGACCTCTT CCACCGCTTG
ACCGAGCTTT CCGTCGTGGT GCCGTCACTG AACGAGCGGC GCGAGGACAT CCCCGACCTC
TCCATCCACT TCCTGGGGAA GCTCTACCGC ACCTACCGAA GCCGCGAGGA AGAGGCGCGC
GACGCCGAGC CGAGCCTCTC CAAAGACGCC AAAGACGCCT TGGTAAGCCA CAACTACAAG
GGGAATATCC GTGAACTGCG CAGCATATTG TTGCGCGCCC TCTTCTTCAG GACCGGCAAG
ATGGTCACCG GCGAGGACAT AAAGAAAGCC ATCCGGGACG GGATGCGGGA GCAGATGACG
CCTGCGGCGG AAAGGCTATC CGAAGAAGTC GCCGGGGGGA TACTCGCCGA GATAGAAGCG
GGGAGCGATT TTTGGGAAGC CGTGTACCAG CCCTACTCGC AAAGCCGAAT TTCCCGCGAC
GTGGTGCGGC TGATAATAGA GAAGAGCCGT GTGGCCGCCG GCAAGAACAT GCCCGAAATA
GCCAGGTACC TGAAAGCCAT CACCGGCGAT CCCCAGGAGG ATGAAGAAGA GAGGAAACGC
TTCTTCCGGT TCAAGAACTT CCTTTACAAG ACGGTGAAGA TCTGA
 
Protein sequence
MYSEEQLPVM DQQLKKLGDR IASFGKENLD EMLHLVGEGS RLISGQERVR IYLEDLTKGA 
LSCAFSCGGF ATEIRQETFP IISAEAAVSR TFVTQQVAQY QNPAETGLPL DQEFSRRFQI
EGTTMLPITS QGKSIGVACL DGSTLSQDKI DELIPFLARA GERVDQARKY HQQLLLARRV
ELYKRREAAG FMVRSAVNLI DGLTLASVLV PVKGGMEVLA SHAQDPDLKY LYDNVGGIDL
KHGTSLVSRY VNEAGVVTDP SLLKPMFFPD LLDQSIQRRA LTEEMGLRTL YMVPRVEPET
KRIICLMNYF TRDVTNYTDF EMGLLQTHAE MVERVINEVG GEHLEIKVLS EISDLLQEPN
EGLHGFLTKV LSKATELIGA DTGSIAIVQE REGVKWLVVE NEEGTIVGAK NKEWLKKKIP
PFKIGGADLP PEERSLTGYV AWTKEPKIIE QVADQSRHEG FHRPMNELIQ SEMAVPVISD
DEVIAVICLN SLQDGYFTEE HKRILQIIDR LTSRHISDIQ RIERLQSEVN KLQSDIAYKD
PKVSSYRLGN IIGNSRKSQE IVAFINTVAQ PLSNRIALWS RNILQEATIG LPSILVLGPT
GAGKEFFFNN LYNKLNELYR QQINPDGELP VKKTNIAAYS GDLTYSELFG HIKGAFTGAY
SDRKGIIEEA AGGIVFLDEI GDADPKTQVQ LLRFLDNGGF VRLGENRERI SRVLLVAATN
KDLSKEIAMG NFREDLFHRL TELSVVVPSL NERREDIPDL SIHFLGKLYR TYRSREEEAR
DAEPSLSKDA KDALVSHNYK GNIRELRSIL LRALFFRTGK MVTGEDIKKA IRDGMREQMT
PAAERLSEEV AGGILAEIEA GSDFWEAVYQ PYSQSRISRD VVRLIIEKSR VAAGKNMPEI
ARYLKAITGD PQEDEEERKR FFRFKNFLYK TVKI