Gene GM21_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1038 
Symbol 
ID8136360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1220608 
End bp1222014 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content65% 
IMG OID644868649 
Productputative two component, sigma54 specific, transcriptional regulator, Fis family 
Protein accessionYP_003020857 
Protein GI253699668 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCAA CGCCGTATCC CAGCTTCAGC ATATTGCTGG TCGACGACGA GCCGGCCTGG 
CTCAAGTCCT TCTCGCTCAC CTTGAAGAGC CGGGCGGGGA TCAACAACGT GCTCATCTGC
CAGGACAGCC GCGAGGCGAT GGGGCTCTTG GACCGGGGGG GAGTGGGGCT CGTGCTCCTC
GACCTGACCA TGCCGCATAT CTCCGGCGAG ACGCTTTTGC AGCAGATCGC TGAGAGCCAC
CCGGAGATCA TGACCATCAT CGTGAGCGGG ATGAACCAGC TCGACACCGC GGTGCGCTGC
ATGAAGCTCG GAGCCTTCGA CTACATCGTG AAGACCGACG AGGAGGACCG GCTGGTCGGG
GGGGCGATGC GGGCGATCAG GATGCTGGAG CTGCACCAGG AGTTCCGTGC CATGAGCGAC
CGGATGATCT CGCGCGAGCT GAAGCACCCG GAGGCGTTCG CCGACATCGT CACCAGCGAC
CGCGGCATGC ACGACCTTTT CAACTACGTG GAGGCGGTGT CGCCCAGCCA TCAGCCGCTT
TTGATCACCG GCGAGAGTGG CGTTGGAAAG GAGCTGATCG CCCGCGCCGT GCATGCCTTG
AGCGGCTGCC AGGGGCCGCT GGTCGCCGTC AACGTAGCCG GGCTCGACGA TACCGTCTTT
GCCGACACTC TTTTCGGGCA CGTGCGCGGC GCCTATACCG GCGCGGACCA GGCGCGCCCC
GGGATGATCG AGCAGGCCGG AAACGGAACG CTTTTCCTCG ACGAGATCGG GGACCTGAGC
ATCGCCTCGC AGGTGAAGCT ATTGCGCCTG CTGCAGGAAG GAGAGTATTT CCAGCTGGGG
AGCGACCGGC CCAAGCGGAT GAACGCGCGC ATCGTGGTCG CGACGCACCG GGATCTGGCC
GCCAAGGAGG CCGCCGGGAC TTTCAGGCGC GACCTTTATT ACCGTCTTTG CACGCACCGC
ATCCAGATTC CGCCGCTCAG GGAGCGGACC TCCGACATCC CGCTTTTGCT CGACTATTTC
CTTGAGCAGG CGGCGCAGTC GCTGGGGAAG AAGAAGCCGA CCCCCCCAAA GGAGTTGGCC
CAGATACTCG CCACCTACAG CTTCCCCGGC AACGTGCGCG AATTGCGCGG CATGGTCTAC
AACGCGGTCA GTCTGCACAA GGAGAGGATC CTTTCCATGG ACAGCTTCCT GAAGGCGATC
GGCCGCTCCC GCGTGGAGCA GCCGCTTCCT GCCCTGCACG AAAACCCGTT CCAGTCCTTC
GAGCGCCTGC CCACCTTCGC CGAGGCGGCC GAACTGCTGG TGGAGGAGGC GGTCTCGCGC
GCCAACGGCG TCCAGGCCAT CGCCGCGAGG CTTTTGGGGA TCTCCGCCCC CGCGCTCAAC
AAGCGCCTCA AGCTCAAGCG GAAGTAG
 
Protein sequence
MKSTPYPSFS ILLVDDEPAW LKSFSLTLKS RAGINNVLIC QDSREAMGLL DRGGVGLVLL 
DLTMPHISGE TLLQQIAESH PEIMTIIVSG MNQLDTAVRC MKLGAFDYIV KTDEEDRLVG
GAMRAIRMLE LHQEFRAMSD RMISRELKHP EAFADIVTSD RGMHDLFNYV EAVSPSHQPL
LITGESGVGK ELIARAVHAL SGCQGPLVAV NVAGLDDTVF ADTLFGHVRG AYTGADQARP
GMIEQAGNGT LFLDEIGDLS IASQVKLLRL LQEGEYFQLG SDRPKRMNAR IVVATHRDLA
AKEAAGTFRR DLYYRLCTHR IQIPPLRERT SDIPLLLDYF LEQAAQSLGK KKPTPPKELA
QILATYSFPG NVRELRGMVY NAVSLHKERI LSMDSFLKAI GRSRVEQPLP ALHENPFQSF
ERLPTFAEAA ELLVEEAVSR ANGVQAIAAR LLGISAPALN KRLKLKRK