Gene GM21_3098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3098 
Symbol 
ID8138448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3588775 
End bp3590250 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content60% 
IMG OID644870702 
Producttwo component, sigma54 specific, transcriptional regulator, Fis family 
Protein accessionYP_003022884 
Protein GI253701695 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones133 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAACA AGAACCGCAT TTTAGTGGTC GACGACGAAA AGCTGATCTC GTGGTCTCTG 
GCAACCATGC TGAAAAAAGA GGGGTACGAC GTGGAAACCG CGGCCACCGG CAACGAGGCT
ATCAACAGGT TCGCGGAGTT TCGCCCCCAG CTGGTGCTAT TGGACGTCTG CCTCCCCGAC
GTGAACGGAC TCGAGCTGCT CAAGCGCTTC AAGAGCGTCA ATGAAGATCT CTACGTGATC
ATGATCACCG CCTACGCCCA CGCCGATTCC GCGGTGCAGG CGCTGCAGGA AGGGGCGGAG
GACTACTTCG GCAAACCGTT CAACCTGGAC GCCGTGAAGC ACGTGGTGAA CAAGGCGTTC
GAAAAGACCC AGCTCAAGAA AGAGGTGGAC TACTTCCGCG GGGAGCTGCG CAAGAAGTCG
GACCAGGACA AGCTGGTCGG CAACAGCCAG AAGATGATCG AGGTCTTCAA GATGATCAAG
GTCTGCGCCG ATGCCGACGC GAAGACGGTG CTGGTAACCG GCGAGAGCGG CACCGGCAAA
GAGCTCGTGG CGAAGGCGCT CCACATGCAC AGTGGGCGCT CCGAGGCCCC CTTCATCGAG
GTGAACTGCG CCGCGATTCC CGAGAACCTC CTGGAGAACG AGCTGTTCGG CCACGAAAAG
GGGGCCTACA CCGACGCCTC GAAACGCCAC AAGGGGGTCT TCGAGATGGC GGAGGGGGGG
ACGGTCTTCC TCGACGAGAT CGGGGACATG CCGTTTCTCA TGCAGGCCAA GATCCTCAAG
GTGATAGAGA GCAAGCGCTT CCGCCGCCTG GGAGGGCAGG AAGACGTCGA GGCGAACGTC
AGAATCATCA CGGCGACGCA CCAAAACCTG CAGCAGATGG TGAAGGAAGG GAAGTTCCGC
TCCGACCTCT TTTTCCGGCT CAACGTGATG AACATCGCGC TCCCGCCTTT GCGCGACAGA
AAGGAAGACG TCCCGGCCCT GATCCAGTAC TTCATCAAGA CCCTGAACGA CGAGTACGGC
AGAAGCGTCG AGGGGGCCTG CGAGGACACC ATGGAGTACC TGACAGGCTA CGACTGGCCC
GGCAACGTGC GCGAACTGCG CAACTGCATA GAGCGGCTCA TGATGCTGGA AAAGGAGAAG
ATGCTGGGGA GCGAGCACCT GAGCGCGGAG ATCACCCAGA GAAGCAGGCA GGGGAACCAG
ATGATGAGGG CCAACAGCAA CAACGAATTC GCCGGAGAGC ATATACTGCT CCCCCCGGAG
GGGATATCGC TGGAGGAGCT GGAAAAGCTG ATCATACAGC TTGCCCTTAA GAAATCGGGG
GGCAACCAGA CCAAGGCGGC CAAGTATCTG AAGACCAGCA GGGACACCTT GCGCTACCGG
ATGAAGAAGT TCGGGCTGGG TGAAAACGGC AGGGAGGAAG GGCGCGGCGA CAGCGAGGAA
CCGGAGGGGG AGCAGATGGT CCCATACGAC GCTTGA
 
Protein sequence
MINKNRILVV DDEKLISWSL ATMLKKEGYD VETAATGNEA INRFAEFRPQ LVLLDVCLPD 
VNGLELLKRF KSVNEDLYVI MITAYAHADS AVQALQEGAE DYFGKPFNLD AVKHVVNKAF
EKTQLKKEVD YFRGELRKKS DQDKLVGNSQ KMIEVFKMIK VCADADAKTV LVTGESGTGK
ELVAKALHMH SGRSEAPFIE VNCAAIPENL LENELFGHEK GAYTDASKRH KGVFEMAEGG
TVFLDEIGDM PFLMQAKILK VIESKRFRRL GGQEDVEANV RIITATHQNL QQMVKEGKFR
SDLFFRLNVM NIALPPLRDR KEDVPALIQY FIKTLNDEYG RSVEGACEDT MEYLTGYDWP
GNVRELRNCI ERLMMLEKEK MLGSEHLSAE ITQRSRQGNQ MMRANSNNEF AGEHILLPPE
GISLEELEKL IIQLALKKSG GNQTKAAKYL KTSRDTLRYR MKKFGLGENG REEGRGDSEE
PEGEQMVPYD A