Gene GM21_1805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1805 
Symbol 
ID8137136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2101875 
End bp2103326 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content61% 
IMG OID644869417 
ProductPAS modulated sigma54 specific transcriptional regulator, Fis family 
Protein accessionYP_003021617 
Protein GI253700428 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.00407865 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGAAC CGGTTATGAA TACTGCTCCC TCGCTCGATC TCGAGGAGAT GGCGCGCCAG 
ATGCGGGCGT TTCAGGATCT GACCCGCGAG CTGGACGCGA TCATCGATTC GTCCTCGGAC
GGGCTCTGGA TCTGCGACGC CGAGGCCCGG GTCATCCGCA TCAACCCTGC CTCGGAGCGC
ATCAACAACA TAAAGGCCTC GGAAGTTGTC GGTAAGAACA TGCGGGAACT CCTCGATGAA
GGTTTCATCG ACCGTTCGGC GGCACTTGAG GCGATCACGA CCAAGAAGGT GGTCAGCCAA
CTGCAGAATA GGGAAGGGCG CAAGCTCATC TCGACGGGGA CCCCGGTTCT GGACGCGAAC
GGCGAGGTGA TCCGGGTCGT GGTGAGCGAG CGGGACATCA CGGAAATCGA TAACTTGCAG
CGCGAACTGG AAGAGCAGGA GGCGCTGCGG GATCAGTTCC GCAACCACAT GCTGGAACTT
CAACAGGCGG ACGTGGCATC CAAGAGCGTC GTCGCCAGGA GCCCGCTGAT GGTGAACGCC
CTGAAACAGG CGCTCAAGGT GAGCGCGGTG AACTCGACGG TGCTGATCCT CGGGGAGTCC
GGCGTCGGCA AGGGGCTGAT AGCGGAGTTG ATACACAAGA ATTCCACCAG GGCGGACAAG
CCGCTGATTG AGATAAACTG CGGCGCGATA CCGGAGTCGC TGATCGAGTC GGAACTCTTC
GGCTATGAGA AGGGGGCCTT TACCGGCGCG CAGACTACCG GCAAACCGGG CTATCTGGAA
CTCGCGGACG GCGGCATCCT GTTTCTGGAC GAGATCGCGG AGCTGCCGCA GTCGGCGCAG
GTGAAACTGC TTCGCTTCCT CGAAAACGGG AAGGTGATCC GTTTGGGGGG GACCAAGGCC
AGGCATCTGG ATGTGCGCAT TCTCGCGGCG ACGCACCGAA ATCTTGACGA AATGGTGCGG
CAGGGGAGCT TCAGGCTGGA CCTTTATTAC CGGCTCAACG TGATCCCGAT CGGCGTCCCG
GCTTTGCGCG AGCGGCGGGA CTGCATTCTG CCGCTGGTAA GACACTACCT GGAACTTTTC
GGCGCCCGCG ACTCCATCCG CAAGCGACTG ACACGTGCCG CCTCCGATGC GCTCCTTGCC
TATGACTACC CCGGAAACGT GCGGCAGTTG ATGAACATCT GCGAGCGGCT CGTGGTCATG
GCGGAAACGG ACCTGATCGA CTTGAAGGAT CTCCCCGCCG AGATATCCGC CGGCATCGGC
AAACCTGCCG CTGTGGCCGG GGTCTGGCAG GAGGATGTGC CGCTTCAGGA GACGCTGGAT
CAGGTCGAGA AGGCCGTCCT GGAAAAGGCG CTGGCCAAGC ATCGCAACCA GACGCGCATG
GCGGAGGTTC TTGGGGTGAA CCAGTCGACC ATCGCCAGGA AACTCAGGAA ATACAAGCTG
AACGGCAATT GA
 
Protein sequence
MTEPVMNTAP SLDLEEMARQ MRAFQDLTRE LDAIIDSSSD GLWICDAEAR VIRINPASER 
INNIKASEVV GKNMRELLDE GFIDRSAALE AITTKKVVSQ LQNREGRKLI STGTPVLDAN
GEVIRVVVSE RDITEIDNLQ RELEEQEALR DQFRNHMLEL QQADVASKSV VARSPLMVNA
LKQALKVSAV NSTVLILGES GVGKGLIAEL IHKNSTRADK PLIEINCGAI PESLIESELF
GYEKGAFTGA QTTGKPGYLE LADGGILFLD EIAELPQSAQ VKLLRFLENG KVIRLGGTKA
RHLDVRILAA THRNLDEMVR QGSFRLDLYY RLNVIPIGVP ALRERRDCIL PLVRHYLELF
GARDSIRKRL TRAASDALLA YDYPGNVRQL MNICERLVVM AETDLIDLKD LPAEISAGIG
KPAAVAGVWQ EDVPLQETLD QVEKAVLEKA LAKHRNQTRM AEVLGVNQST IARKLRKYKL
NGN