Gene GM21_1825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1825 
Symbol 
ID8137156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2123721 
End bp2125649 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content61% 
IMG OID644869436 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003021636 
Protein GI253700447 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.000986123 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTAAAAC CGCTTCGCGT GCTGATAGTG GAAGACTCGG AGGACGACGC GCTGCTGCTC 
GTCTTCGAGC TGCGTCGCGG CAACTATGCT CCGGTAACCA TGCGCGTGGA GAACTCCGAC
TCTTTGCGCA AGGCGCTCAA GGAGGAAACC TGGGACCTGG TCATCTCCGA CTACGTCCTT
CCCGGCTTCT CGGGGTTGGA CGCGCTGAAG CTGGTGCGTG CCTCGGGGAT GGACCTTCCC
TTCATCATCG TTTCCGGGAA AATCGGCGAG GAAGACGCCG TAAAGGCCAT GAAGGAGGGG
GCGAACGATT ACCTGATCAA GGGTAACACC TCGCGCCTCA TCCCTGCCAT CGAGCGGGAG
ATGCAGGAGG CCGAGGTGAG GCGCAAGCGG CGCGAGGCCG AGTCGGCGCT GGTGAGAAGC
GAGCGGCGCT ACAAGAGGCT GGTGTCGGCG GTCACCGATT ACATCTATAC GGTGATCATC
CATAAAGGAG CGGTTGTTAA AACCTCGCAC GGACCGGGAT GTCTCTCGGT GACCGGCTAC
AGCCACGAGG AGTACGTCGA CAACCCTTTT TTGTGGTACC AGATGATCTA CGAGGAGGAC
CGGAGCGCGG TGACGAGCCT CACCGAGGAC CTGCGGGCGG GAAAAGACAT CCCTTCGCTG
GAGCACCGGA TCCGGCACAA AGACGGCTCG CTGCGCTGGG TCATCAACAC CATCGTCCCA
CGCTACAGCG AACAGGGCGA ACTGATCGCC TACGACGGCC TCATCTCGGA CATCTCGGAG
CGAAAGCGCG CCGAGGAGTC GCTGCAGCTT CAAAGCGCGG CGTTGGAGGC GGCGGCCAAC
GCCATAGTCA TCACGGACAG CAGCGGGGTC ATCATCTCGG TAAACGAGGC TTTCACCGGC
ATGACCGGGT ACGGCCGCGA GGAGGCCCTA GGGCGCGATC TGAGTTTCCT GAAATCGGAG
CGGCACACTT CGGAGTTTTA CCGCTGCCTC AGGGAAACCA TCAGCGCCGG CGAGGTCTGG
CACGGAGAGA TGATCAACCG GCGCAAGGAC GGCACCCTCT ACCCCGAGGA GCAGACCATC
ACGCCGGTTT TGGACGAAGA CGGGCGCATC AACCACTTCA TCTGCATCAA GCAGGACATC
ACCGAGCGCA AGCAGGCAGA GCAGGCGCTG ATGCAAAACG CCAGCATGCT GAAGGAAATG
GAGATCGCGA AGCAGATCCA GATGTCTTTG CTTCCGGTCA CCCCCCCCTG CCTCCCGGGC
ATCGACTGCG CCGGGAACTG CTCCCCCGCC AACAACATCG GCGGCGACTA CTACGACATC
CTCCCGCACG GAGAAGAACT CGACCTCGTC ATCGCCGACG TCTCGGGCCA CAGCGTCGGC
GCCGCGCTGA TCATGGTCGA AACCCGCAGC GTACTTCGCG CGCAGCTTGC GACCTTGAAG
GGGCCGGCCG AGATTGTGTC GGCCCTGAAC GAACTGCTGC ACGAGGACCT GAGTCGGGCC
GAACTCTTCA TCACCATGTC GTACCTGAGT TACCACATCC CCACCGGCAC GCTCCGCTAC
ACCAACGCCG GCCATCCCCC TCCCCTGCTC TACCGCCACC AAACCGACCA GTTTTTCGAG
TTGGACGCCG AAGGGCTCAT CCTGGGGGTA CACCGGGAGG TGTTCTTCCA GGAGCCCTCG
CTCCAGGTCC GGGACGGGGA CCTGCTGCTT CTCTATACCG ACGGCATCAC TGAAGCGGAA
AACCGCGAGG GCGGGTTCTT CGGCATTGAT AGATTGCGGC AAGTAGTGGC GCGCGAGCAT
ATGAAACCGG CAGCGGGAGT GATCGCCGCA GTCATGGAAT CCGTCAGGAC CTTCACCGGC
TGCGACGCCT TCAACGACGA CATCTCTATG CTTCTTATCA AATTCGTCCC CGTAACCCCC
CATTCTTAG
 
Protein sequence
MVKPLRVLIV EDSEDDALLL VFELRRGNYA PVTMRVENSD SLRKALKEET WDLVISDYVL 
PGFSGLDALK LVRASGMDLP FIIVSGKIGE EDAVKAMKEG ANDYLIKGNT SRLIPAIERE
MQEAEVRRKR REAESALVRS ERRYKRLVSA VTDYIYTVII HKGAVVKTSH GPGCLSVTGY
SHEEYVDNPF LWYQMIYEED RSAVTSLTED LRAGKDIPSL EHRIRHKDGS LRWVINTIVP
RYSEQGELIA YDGLISDISE RKRAEESLQL QSAALEAAAN AIVITDSSGV IISVNEAFTG
MTGYGREEAL GRDLSFLKSE RHTSEFYRCL RETISAGEVW HGEMINRRKD GTLYPEEQTI
TPVLDEDGRI NHFICIKQDI TERKQAEQAL MQNASMLKEM EIAKQIQMSL LPVTPPCLPG
IDCAGNCSPA NNIGGDYYDI LPHGEELDLV IADVSGHSVG AALIMVETRS VLRAQLATLK
GPAEIVSALN ELLHEDLSRA ELFITMSYLS YHIPTGTLRY TNAGHPPPLL YRHQTDQFFE
LDAEGLILGV HREVFFQEPS LQVRDGDLLL LYTDGITEAE NREGGFFGID RLRQVVAREH
MKPAAGVIAA VMESVRTFTG CDAFNDDISM LLIKFVPVTP HS