Gene GM21_3903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3903 
Symbol 
ID8139277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4489255 
End bp4490622 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content62% 
IMG OID644871520 
Producttwo component, sigma54 specific, transcriptional regulator, Fis family 
Protein accessionYP_003023678 
Protein GI253702489 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.000574971 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCGGCA GCATCCTGAT AGTGGACGAC GAGAAGGGGC AGCGGGACAT ACTCACTGCC 
ATACTCACCA AGATGGGGTA CAAGATTCAG ACCGCCTCGG GCGGTGAAGA GGCGCTGCAG
CAACTGCAGC ACAAAGAATT CGACCTCCTC CTCACCGACC TCAAGATGCA GGGAATTTCA
GGGATGGAAC TCATGGAGCG GGTGCTGGCC GACGACCCCT CCCAGTGCGT GGTGATGATG
ACCGCCCACG GCACCATCGA CTCGGCAGTG GAGGCGATGA AGAAGGGGGC CTTCGACTAC
CTCGAAAAGC CCCTGGAGCG CGAGGACCTG ATCCTCACCG TGCAGCGGGC CTTCGAGCGG
ATCGGGCTTT TGAAGGAAAA CAAGGCGCTG CACAAAAAGC TCGCGGAAAC GAAAAGGCTT
CCCAACATGA TCGGGGAGCA CCCGAAGATG CACGAGGTGG TACGGATCAT CAACAAGATA
GCCCCCACCT CCACGACGGT CCTCATCTAC GGCGAGTCGG GGACCGGCAA GGAACTCGTG
GCGCGCGCCA TCCACGACGG GAGCCCGCGC AGGGACAAGG CCTTCTTCGC CATCAACTGC
GCGGCGATTC CCGACTCGCT TATGGAAAGC GAGCTCTTCG GGCACGAGAA GGGGGCCTTC
ACCGGCGCGG GCACCCGGGA AATCGGTCTT TTCGAGGCAG CCGAGGGGGG GACCGTCTTT
CTCGACGAGA TCGGCGAGAT GAACATCGCC ATGCAGGCGA AGCTCTTGCG CGCCATACAG
GAGAAGGAGA TCCGCAGGGT CGGGGGGAAG GTGAACATCC CCGTCGACGT CAGGATCATC
TCCGCCACCA ACAAGGACCT GGAACAGGAG ACCCGGCGCG GCAACTTCCG CGAGGACCTC
TTCTACCGGC TCAACGTCAT CAGGATCGCG CTTCCCCCCC TGAGGGAGCG CGGCAACGAC
ATCGTGACGC TGGCCGATTT CTTCCTGAAG AAGTACAGCG CCTCCTGCGG CATCCCCCTG
AAGGGAATCG CCAAACCGGC GCTGAAGATC CTGCTCGACT ACAGCTGGCC CGGCAACGTG
CGGCAACTGG AATCGGTCAT CGAGCGCGGC GTGCTCATGG CCGAGAGCGA ATACATCCAG
CCCGAGGACC TGCCGGCCGA GGTGCACCAC GAGGCCTCCC CCGCCGGGAG CCTCCCCTTC
GACTTCCCGG CAGAGGGGAT ATCCATCGAC AACCTGGAAC GGGACCTGAT CGTCAAGGCG
ATGCAGAGAA CCGACTGGGT CATCGCCAAA GCGGCACCAC TACTCGGTAT GAGCTACAAG
ACGCTGCAAT ACCGGTTGGA AAAGTTCGAC ATCGGCAAAC CGGAGTAG
 
Protein sequence
MIGSILIVDD EKGQRDILTA ILTKMGYKIQ TASGGEEALQ QLQHKEFDLL LTDLKMQGIS 
GMELMERVLA DDPSQCVVMM TAHGTIDSAV EAMKKGAFDY LEKPLEREDL ILTVQRAFER
IGLLKENKAL HKKLAETKRL PNMIGEHPKM HEVVRIINKI APTSTTVLIY GESGTGKELV
ARAIHDGSPR RDKAFFAINC AAIPDSLMES ELFGHEKGAF TGAGTREIGL FEAAEGGTVF
LDEIGEMNIA MQAKLLRAIQ EKEIRRVGGK VNIPVDVRII SATNKDLEQE TRRGNFREDL
FYRLNVIRIA LPPLRERGND IVTLADFFLK KYSASCGIPL KGIAKPALKI LLDYSWPGNV
RQLESVIERG VLMAESEYIQ PEDLPAEVHH EASPAGSLPF DFPAEGISID NLERDLIVKA
MQRTDWVIAK AAPLLGMSYK TLQYRLEKFD IGKPE