Gene GM21_1986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1986 
Symbol 
ID8137320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2300001 
End bp2301374 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content62% 
IMG OID644869599 
Producttwo component, sigma54 specific, transcriptional regulator, Fis family 
Protein accessionYP_003021796 
Protein GI253700607 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value0.546437 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAACG GCAGGGTGAT CATTTGCGAC GACGAGGTCG AGATTCTCAG GTATCTGAGT 
AAGATTCTGA CGGCCAAGGG GCTCGACCTG GAGGTTTTCA GTTCCGGCAC GTCGCTCATG
AGCTACCTGG AGAACCGCGG AGTGGACGCC GCGGACCTGC TGCTGCTCGA CGTCAGGATG
CCTGACCTGG ACGGCCTGGA GATACTGCAG CGGGTGAAGA AGATGAAGGT CGAGGTTCCG
GTGGTGATGA TGACCGCCTA CGCCTCGATC AACTCGGCGA TAGAGGCGAT GAAGCTTGGC
GCCTACGATT ACGTCACCAA GCCCTTCCCC AAAGAGAAGA TCTTCGGGAT CCTGGAGAAG
GTCCTGGAGC GCAAGGAACT CCTGAAGGAG AACTCGGCAC TCAAGGACGA GTTGGGGATA
CCCTCCCCTT CCACTCCGCT CATCTTCTCC TCCGAGAGCT TCAGGCGGGT TCACGAAATG
GCGATGCTGG TGGCCCAGAG CGAGTCCAAC GTCGTCATAC TGGGGGAATC TGGCACCGGC
AAGGAGTTGG TGGCGAGCCT GATCCACAAC AACAGCCCCA GGGCAAAGGA GCGCTTTCTT
ACCATCAACT GCGCGGCGCT CTCGGACACG CTTTTGGAAA GCCAGCTCTT CGGCCACGTG
CGCGGAGCCT TCACCGGCGC GGTGAACGCC CAGAAGGGGC TACTGGACGC AGCCAACCAC
GGCACCCTGT TTCTGGACGA GGTAGGGGAC ATGAGCCCGG CCATCCAGGC GAAGCTTCTG
CGCGTGCTGC AGGAGGGGGA TTTCATACCG GTCGGCGACA CCAAGGCGAG AAGCGTAGAC
ATCAGGTTCG TGGCCGCGAC CAACAAGGAC CTGGAGGAAG AAGTGCGCCT AAAAAGATTC
AGGGAGGACC TCTACTTCCG CCTCAACGTG ATCTCCCTGC ACTTGCCGCC GCTTAGGGAG
CGCGGCGACG ACATCGAGCT CTTGGCCCGG CACTTCCTGA CCAAGTACGC GGCCCGCATG
AAGCGCGAGA TCACTGACTT CACCCGCGAG GCGATGCAGC TCCTCACCTC CTACAACTGG
CCCGGCAACA TCCGGGAGCT GGAAAACGCC ATCGAGCGCG CGGCCATCCT GACCCGCGGC
CACACCATCA CCGCGGAGAC CCTGCCGGTT TGGCGCAACC CTCAGCCGGC GGAGGCGAAG
ACTGGGGTGA GCCGGCTGGT CTCGCTCGAG ACTGTGGAGC GCGAGCACAT CCTGCACGTC
CTCAAGGCCT CAGGGAACAA CAAGAGCCGC GCTGCCAAGA TCCTCGAGAT CGCCAGGCGC
ACGCTGGACC GCAAGATCGA GGAATACGGC CTGGAAGACG GGGGCTCCCG CTGA
 
Protein sequence
MYNGRVIICD DEVEILRYLS KILTAKGLDL EVFSSGTSLM SYLENRGVDA ADLLLLDVRM 
PDLDGLEILQ RVKKMKVEVP VVMMTAYASI NSAIEAMKLG AYDYVTKPFP KEKIFGILEK
VLERKELLKE NSALKDELGI PSPSTPLIFS SESFRRVHEM AMLVAQSESN VVILGESGTG
KELVASLIHN NSPRAKERFL TINCAALSDT LLESQLFGHV RGAFTGAVNA QKGLLDAANH
GTLFLDEVGD MSPAIQAKLL RVLQEGDFIP VGDTKARSVD IRFVAATNKD LEEEVRLKRF
REDLYFRLNV ISLHLPPLRE RGDDIELLAR HFLTKYAARM KREITDFTRE AMQLLTSYNW
PGNIRELENA IERAAILTRG HTITAETLPV WRNPQPAEAK TGVSRLVSLE TVEREHILHV
LKASGNNKSR AAKILEIARR TLDRKIEEYG LEDGGSR