Gene GM21_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1954 
Symbol 
ID8137288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2264289 
End bp2265656 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content61% 
IMG OID644869568 
Producttwo component, sigma54 specific, transcriptional regulator, Fis family 
Protein accessionYP_003021765 
Protein GI253700576 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones130 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACAGG AAAAGATTCT TATCTGCGAC GACGAGGAAG GGATCCTCAT CTACCTGAAG 
AAGCTGCTTA AGACCCAGGG GTACCTGGTC GAGACCTTCG ACTGCGGAGC GGCGCTCTTG
CGCCGACTTA AAGATGGGGA CCCCTCCGAC GCGGACCTGC TTTTGCAGGA CGTAAGGATG
CCGGACATGG ACGGCATCAC CGTGCTCAAG GAGGTGAAGG CGCTTAGGCC CCCGCTTCCC
GTCGTCGTCA TGACCGCCTT CGGGACCATA GACGCCGCGG TGGAGACGAT CAAGATGGGT
GCCTACGACT ACGTCACCAA ACCTTTTCCC AAGGAGAAGA TCCTGAGCGT CATCAAGAAC
GCGCTGGAAA AGGAGCAGCT CTTGCAGGAG AACCGGGCGC TCAAGAGCGA GCTGGAAAAA
CCGATCCTGC AGGAATCGAT CATCTTCAAA AGCGCCGCCT TCCAGGAGAT CTACGACCTC
ACCCTGCAGG TCGCGGCCAG CGAGGCGAAC ATCCTGGTGC TGGGCGAGTC CGGGACCGGC
AAGGAGCTCA TCGCCGGCGC CATCCACTAC AACAGCTTAA GGCGCGACCG GCGCTTTCTC
TCCATCAACT GCGCGGCCTT GACCGACACG CTCCTGGAGA GCCAGCTCTT CGGGCACGTA
CGGGGCGCCT TCACCGGCGC GGTCGCGGCA CAGAAGGGAC TATTGGAGGA GGCCGACGGC
GGCACCCTCT TCATGGACGA GATCGGCGAC ATGACCCTCC CCATCCAGGC GAAGCTTCTG
CGCGTGATCC AGGAGCGCGA CTTCATCCCG GTGGGATCGA CCCGCCCCAA AAGCGCCGAC
ATCCGCTTCG TGGCCGCCAC CAACAAGAAC CTGGAACTGG AGGTGCGGGA GGGGCGTTTC
CGAGAGGACC TCTTCTACCG GCTGAACGTG ATCAACATCC CGCTGCCGCC GCTTAGGGAG
AGAAAGGACG ACGTGGAACC CCTGGCGCTG CACTTCCTGA AGAAGTACAG CCTGAAGATG
AAAAAGCAGG TTTCCACCCT CACCCCCGAG GCGCTGCAGC TCCTTTACGG CTACGACTGG
CCCGGCAACA TCAGGGAGCT GGAAAACGTC ATGGAGCGCG CCGTCATCCT GGCCCGGACC
CAGACGGTGA CCGCAAAGGA GCTTCCCATC TGGCGCAAAC AGCCGCAAAA GGTGGAAGCG
CCCCGCGAGG CTCAGTTCGT CTCGCTGGAA AACGTGGAGA AGGAGGCCAT CGTGCGGACC
CTCTCCGGCA CCGGGTTCCA CAAGAGCAGG TCGGCCGAGA TCCTGGGCAT CTCCAGAAAA
ACCCTGGACC GCAAGATCGT GGAATACCGC ATCACCATCC CCTCATGA
 
Protein sequence
MRQEKILICD DEEGILIYLK KLLKTQGYLV ETFDCGAALL RRLKDGDPSD ADLLLQDVRM 
PDMDGITVLK EVKALRPPLP VVVMTAFGTI DAAVETIKMG AYDYVTKPFP KEKILSVIKN
ALEKEQLLQE NRALKSELEK PILQESIIFK SAAFQEIYDL TLQVAASEAN ILVLGESGTG
KELIAGAIHY NSLRRDRRFL SINCAALTDT LLESQLFGHV RGAFTGAVAA QKGLLEEADG
GTLFMDEIGD MTLPIQAKLL RVIQERDFIP VGSTRPKSAD IRFVAATNKN LELEVREGRF
REDLFYRLNV INIPLPPLRE RKDDVEPLAL HFLKKYSLKM KKQVSTLTPE ALQLLYGYDW
PGNIRELENV MERAVILART QTVTAKELPI WRKQPQKVEA PREAQFVSLE NVEKEAIVRT
LSGTGFHKSR SAEILGISRK TLDRKIVEYR ITIPS