Gene GM21_1405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1405 
Symbol 
ID8136733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1652219 
End bp1653655 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content61% 
IMG OID644869019 
Producttwo component, sigma54 specific, transcriptional regulator, Fis family 
Protein accessionYP_003021222 
Protein GI253700033 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR01818] nitrogen regulation protein NR(I) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.0000117702 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCATCA ACAATATTTT GGTTGCCGAC GACGAAGAGA GCATGCGCTG GGTCCTTTCC 
AAGGCGCTCA GGAAAAAAGG TTTCAACGTG GAGCTCGCCC GCGACGGCAA CGAGGCTCTC
AGGCTGATCA AGGAGAACGT CTACGACCTG GCGCTTTTGG ACATCAAGAT GCCGGGGATG
ACGGGACTCG AACTGCTCGA CCGGGTGAAG GAGGAGCGCA GCGACCTTTT CGTCGTGATC
ATGACCGCCG AGGCGAGCAT GAAAAACGCG GTAGAGGCGA TGAAGCGCGG CGCCTACGAT
TACCTGACCA AGCCCTTCGA CCTCGGGGTG ATCGACGCGG TGGTCGAGAA GGTGAGCCGG
GCGCGCGAGA TGACCAGCCA GGTCTCGCTC CTCAAGGAGG AGCTCAAGGA CCGCTACCAA
CTGGAGAAGA CCATCATCGG CAACTCCCCC GCCATGAGGG ATGTCTACAA GACCATCGGC
AAGGTGGCTC CCAGCGACGT GACGGTGCTG GTGCAGGGCG AATCGGGGAC CGGTAAGGAA
CTGATCGCCC GCGCCATCCA CTACAACTCC AAGCGTCTGG GGAAGCCGTT CATACCGCTC
AACTGCGCCG CCATCCCCAA GGAGCTTCTG GAGAGCGAGC TCTTCGGTTT CGAGAAGGGG
GCCTTCACCG GCGCCACGGA GAGGAAGCTC GGCAAGTTCG AGCAGGCAAA CGGCGGCACC
ATCTTCCTGG ACGAGATAGG AGACATGCCA CTCGATCTGC AGGCCAAGAT CCTGAGGGTG
CTGCAGGAGC GCGAGATCAC CAGGACCGGC GGGAACCAGA GCATCCAGGT CGACGTCAGG
GTGGTGGCCG CGACCAACCA GGACCTCGAG GAGAGCGTGC GCAACAAGCA GTTCCGCGAG
GACCTCTACT ACCGGTTGAA CGTGATCCCG ATCCATCTGG TGCCGCTCAG AGACAGGAAG
GAGGACGTGC CGCTTCTGGT GCAGTACTTC CTCACCAAGA TCTGCGCCGA GCTGGAGGTC
CCCCTAAAAC GCTGCTCCGC CGATGCCATC AAGCTCCTGA CCAACTACAG CTGGCCCGGC
AACATCCGCG AGCTGGAAAA CACCATCAAG AGGGGCGTAA TCCTCTCGCC GGACCCGCTC
TTGGTCGCCT CCGACTTCCC GGGGCTGCGC AGCCGCCAGA GCGAGGGCAA AGGAGAGGAG
CTTTCCCTGG AGGGGATCGT CGATCTGAAG CTGCGGGGCT GCTTCAGCAA CATGGAGAAG
ATGGAGAGCG GCGACGTGCA CGCCATGGTG CTGGAGCAGG TGGAGCGGCC GCTGATCCGC
TTCGTCCTGG AGAAGACGCG AGGCAACCAG GTTCGCGCCG CCGACATTCT CGGCATCAAC
CGCAACACCC TCAGAAAGAA GATAACCGAG CTAGGCATCG AGCTGCGCAA GGAATAG
 
Protein sequence
MSINNILVAD DEESMRWVLS KALRKKGFNV ELARDGNEAL RLIKENVYDL ALLDIKMPGM 
TGLELLDRVK EERSDLFVVI MTAEASMKNA VEAMKRGAYD YLTKPFDLGV IDAVVEKVSR
AREMTSQVSL LKEELKDRYQ LEKTIIGNSP AMRDVYKTIG KVAPSDVTVL VQGESGTGKE
LIARAIHYNS KRLGKPFIPL NCAAIPKELL ESELFGFEKG AFTGATERKL GKFEQANGGT
IFLDEIGDMP LDLQAKILRV LQEREITRTG GNQSIQVDVR VVAATNQDLE ESVRNKQFRE
DLYYRLNVIP IHLVPLRDRK EDVPLLVQYF LTKICAELEV PLKRCSADAI KLLTNYSWPG
NIRELENTIK RGVILSPDPL LVASDFPGLR SRQSEGKGEE LSLEGIVDLK LRGCFSNMEK
MESGDVHAMV LEQVERPLIR FVLEKTRGNQ VRAADILGIN RNTLRKKITE LGIELRKE