Gene GM21_2869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2869 
Symbol 
ID8138212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3340219 
End bp3342165 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content57% 
IMG OID644870470 
Producttranscriptional regulator, NifA subfamily, Fis Family 
Protein accessionYP_003022659 
Protein GI253701470 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value0.990966 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTTGT CGAGCGATCC TGCTGCGGAG TTTTTCCTGG GCAAATTCAG TAATCTGGAA 
CAGCTGTTCG GTCAGGCCCA GTTCGGTGTC TCCATCATGG ACTGCGACCT GCGCTACGTC
TACGTCAATG ACCGGATGGC GTCGATCAAC GCTTTGCCCC CCAAAGAGCA CCTGGGGAAG
ACGGTAAGGG GGGTTGACCC CGCCGTCGCC ATGGTGGTCG AGCCCTTGCT GAACCGAACC
ATCCACGAGG ATCAATCCTT CGTGGACCTT GAGATACGGA TTTCGAAACC GGGGGAACCC
CAGCAACGAT CTTGGTTGAT ATGCTCGTAT CCCCTCAAGG AAAGCGACGG TTCCGTGACC
CGCTGCGGGC TGGTGGTCCA GGACATAACC GAGCAGAAGA TAAGGCAGAG CCTGCAGGCT
GAGCGGCTGC AGGTAGAGGC GTTCCTCTCC AACCTCTCCT CCGCTTTTAT CAACGTGTCG
GTGAGCGATG TGGATCGCAA GATAGAAAAA GGTTTGCAAA AGGTCGTCAA TTTCCTCGGC
TTCGAAAGAA GCAGCATCTG GCACTTCACA CCTGATCGTA GCAGTTTGCG CATCACCCAT
TCCTACGCGC TGCCCGGCGT CAAACAACCT CCCGCAGAAC TGATGAACCT GATCCCGGTC
TGGATCGACA TGATCATGGT GGGAGAGATC TTCCGTATTT CCGACGTTGA GGAATTGCCG
GACAGGTTTT GGAGGGAAAA GCAGTATTGC AGGGACCAGG GCGGCATCAA ATCCATCATG
TTCATCCCCG CCAGCGTCGG CGGCACCATA GTCGGCGCCA TCACCTTCGT TTCCTACAGC
ATCAAGAAAG AGTGGCCCGA TGAGCTGACT CAGAGACTGC GCCTATTATG GGAGATTTTC
GCCAACGCCC TCGAGCGCAA AAGGGCCGAT CAGAAAATAC AGAACGCCTT GGCTGAGATA
CGGCAGCTAA AGGACCGTCT TGAGGCGGAA AACGTCTATC TGCGCGACCA GATCGACGTG
GAATACAAGC ACGAGGAGAT CATCGGCAAG TCGGTGGCCG TTCGCAACGT GCTGCAGCAG
ATTCAACAGG TGGCACCTAC CGATTCCACC GTCCTCATCC TGGGCGAAAC CGGCACCGGG
AAGGAACTGA TAGCGAGGGC CATCCACAAC GCCGGCCAGC GCAAGGCCCG CGCCATGATC
AAGGTGAACT GCGCCGCGTT GCCGGCAGCT CTCATCGAGG CTGAGCTGTT CGGTCATGAA
AAAGGGGCCT ATACCGGAGC CGTATCCTCC CAGATCGGGC GCTTCGAGGC CGCTAACGGC
TCGTCCATCT TCCTGGACGA GATAGGGGAG CTCCCCCTGG AGCTTCAGTC GAAGCTGCTG
CGGGTCATCC AAGAAGGGCA GTTCGAGCGG CTGGGAAATC CAAAGCCGGT CAAGGTGGAC
GTGAGGGTGA TCGCCGCGAC CAACGTGAGC CTGGCGCAGG CAGTAAAGGA AGGGAAATTC
CGGCAGGATC TCTATTACAG GCTTAACGTC TTCCCCATCT TTGTCCCTCC GCTGCGTGAC
CGGCATGAGG ACATACCCCT CTTGGTGTGG GCGATGGTGG AGGAGTTCTC CAAGGTTTTC
GGCAAGACTA TCGAGCGCAT CCCCAGAAGA AATATGGAGG TCATGGAGCG CTACAGTTGG
CCGGGAAACA TAAGAGAGCT GAGAAACCTG GTGGAGCGGG CCATGATCCT CAGTAACGGC
GGCACCCTGG TGGTGGACAT TCAGGACGGA TCCGCCGCGA CCGCCAACGA GCCGATGACG
ATGGAAGCCG CGGAACGAAA CCATATAGCC CTTGTATTGG AAATGACGGG CTGGCGCATT
CGCGGAAAGG GTGGAGCCGC AGAAATTCTC GACATGAAAC CCACGACGCT GCATTCCCGG
ATCAAGAAGC TCGGCGTCAG CAAATAG
 
Protein sequence
MALSSDPAAE FFLGKFSNLE QLFGQAQFGV SIMDCDLRYV YVNDRMASIN ALPPKEHLGK 
TVRGVDPAVA MVVEPLLNRT IHEDQSFVDL EIRISKPGEP QQRSWLICSY PLKESDGSVT
RCGLVVQDIT EQKIRQSLQA ERLQVEAFLS NLSSAFINVS VSDVDRKIEK GLQKVVNFLG
FERSSIWHFT PDRSSLRITH SYALPGVKQP PAELMNLIPV WIDMIMVGEI FRISDVEELP
DRFWREKQYC RDQGGIKSIM FIPASVGGTI VGAITFVSYS IKKEWPDELT QRLRLLWEIF
ANALERKRAD QKIQNALAEI RQLKDRLEAE NVYLRDQIDV EYKHEEIIGK SVAVRNVLQQ
IQQVAPTDST VLILGETGTG KELIARAIHN AGQRKARAMI KVNCAALPAA LIEAELFGHE
KGAYTGAVSS QIGRFEAANG SSIFLDEIGE LPLELQSKLL RVIQEGQFER LGNPKPVKVD
VRVIAATNVS LAQAVKEGKF RQDLYYRLNV FPIFVPPLRD RHEDIPLLVW AMVEEFSKVF
GKTIERIPRR NMEVMERYSW PGNIRELRNL VERAMILSNG GTLVVDIQDG SAATANEPMT
MEAAERNHIA LVLEMTGWRI RGKGGAAEIL DMKPTTLHSR IKKLGVSK