Gene GM21_2855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2855 
Symbol 
ID8138198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3321829 
End bp3323577 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content62% 
IMG OID644870456 
Productsigma54 specific transcriptional regulator, Fis family 
Protein accessionYP_003022645 
Protein GI253701456 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones154 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGT CGCAGAAAAA GAAGAACATC TCCGGCGAGG CGACGAAGAA CAAGGTGCGC 
TCCCTGGTCC ATTTCTGCTC CGACACCGGC AATATCTGGC TGCACGAGCA CCGCATGCTG
CTGATTCACG CGGAGGCGCA AGGGGCGATG CGCCGGGAAC TGATCGACAC ACTCGGCATG
GACCGGGCGC GGGCGCTGCT CATGCGCATG GGGTTCGCTT CCGGGGCCCA GGACGCCGAG
GTGTTCAAAA GACAGCTGGA GGGGGTGTCG GACCAGGAGG CGTTCCTGAT CGGGACACAA
CTGCACACCC TGGAGGGAAT CGCCAAGGTA ACCACCCTGG AGCTGGAGGT GGACCGTGCG
GCCGGCAAAT ATTCCGGCCA GTTCATGTGG GAGAACTCCT GGGAAGGTAA CTGGCACAAG
AAGCATTACG GAATCCACAC CGCCCCGACC TGCTGGAGCC AGTTGGGCTA TGCCTGCGGC
TACACCTCAA CCGTCATGGG GCGCCCGATC CTCTACAAGG AGGTGGAGTG CGTCGGCAAG
GGAGACAGGT ATTGCCGCAC CGTGGGAAAA CCGCTTGAGG AGTGGGAGGA CGCTGCCGAA
TTCAGGAGGA TATTCCATCC CGACCCCATC GTCGACGAGC TGATGGAACT GCAGACCCAG
GTGGTGGAAC TGCGCGCCGC CTTCAACGAG AAGGAGAAGC TCCCCGCCGA CGTGGTGGGC
AAGTCCCAGG CCTTCATCAC AGCCTTCGGT CTGTTGAAGC AGGCGGCGGG AAGCCAGATC
ACCGTCCTTT TGCAGGGGGA GACGGGGGTG GGCAAGGAGG TGTTCGCGCG CACCCTGCAC
GAGAAAAGCA GCAGGAGCAA GGAGTCTTTC ATCGCCGTGA ACTGCGCGGC CATCCCCCAC
GACCTGGTGG AGTCGGAGCT CTTCGGCGTG GAAAAGGGAG CCTACACAGG CGCCCTCACC
TCGCGCCCCG GGCGTTTCGA GCGGGCGAGC GGCGGCACCT TGTTCCTGGA CGAGATCGGC
GATCTGCCGC TGCCGGCGCA GGCAAAACTT CTGCGCGTGC TGCAGGAAGG CGAAATCGAG
CGACTGGGAG ACCACAAGGT CCGCAAGGTT GACGTAAGGC TCGTGGCGGC GACCAACATC
GACCTGAAAC AGCTCGTGCA AGAAGGGAAA TTCCGCTCCG ACCTCTACTA CCGCCTGAAC
GCGTTCCTGG TCAAGATCCC TTCTTTAAGG GATCGCAAGG AGGACATCCT GCTCCTTGCC
GATCGCTTCG TGGAGAAGTA CGCGGCGATA CAGGGAAAGA AGCTTAAGGG ATTCACCGAC
AAGGCCAAAC GGGCACTTTT GGCGTACCAG TGGCCGGGAA ACATCCGGGA ATTGCAGAAC
ATGGTGGAGC GGGGGGTGAT CCTGGCCCAG CCCGGCGCCC GGATAGAGCT GGACCAGATG
TTCTCCTCGA CCGCTGAGGA AGGAAGCGTG GAGTACGGCG TGAGCAGCAC CGGCAGCCTC
GACATCAACC GGGACTCCGC GGGAAAGGAG CTCTGCGAGG CGGTTTTAGA CGGCGGCCTG
ACGCTTGAGC AGGTGGAGGG GATGCTGATC AGGGCCGCTG TGGAGAAGGA GGGGGGAAAC
CTCGCCGCTT CCGCGAGGGC GCTGGGTCTC ACCCGCCCCC AGCTTGCCTA CCGCCTGGGA
AGCCTGCAGC AAAAAGGGGA CACGTCTTTC CGCCTCCAGG ATCCTTTCGC CAACCCCGAC
CTCTACTGA
 
Protein sequence
MSKSQKKKNI SGEATKNKVR SLVHFCSDTG NIWLHEHRML LIHAEAQGAM RRELIDTLGM 
DRARALLMRM GFASGAQDAE VFKRQLEGVS DQEAFLIGTQ LHTLEGIAKV TTLELEVDRA
AGKYSGQFMW ENSWEGNWHK KHYGIHTAPT CWSQLGYACG YTSTVMGRPI LYKEVECVGK
GDRYCRTVGK PLEEWEDAAE FRRIFHPDPI VDELMELQTQ VVELRAAFNE KEKLPADVVG
KSQAFITAFG LLKQAAGSQI TVLLQGETGV GKEVFARTLH EKSSRSKESF IAVNCAAIPH
DLVESELFGV EKGAYTGALT SRPGRFERAS GGTLFLDEIG DLPLPAQAKL LRVLQEGEIE
RLGDHKVRKV DVRLVAATNI DLKQLVQEGK FRSDLYYRLN AFLVKIPSLR DRKEDILLLA
DRFVEKYAAI QGKKLKGFTD KAKRALLAYQ WPGNIRELQN MVERGVILAQ PGARIELDQM
FSSTAEEGSV EYGVSSTGSL DINRDSAGKE LCEAVLDGGL TLEQVEGMLI RAAVEKEGGN
LAASARALGL TRPQLAYRLG SLQQKGDTSF RLQDPFANPD LY