Gene GM21_0452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0452 
Symbol 
ID8135761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp545432 
End bp548299 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content60% 
IMG OID644868070 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003020290 
Protein GI253699101 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones95 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCAAA AGTTAGTCAA AAACGGAGGG CTTGCCCTCC TCATCCTTGC CTTCACCTGT 
CTGTGCACGT CCATCTTCTA TTTCTCCTAC CAAAAGGCAA AACAATCGGC CATCAACAGG
CTGAACGACG AGCAGTTCAC ACATGCCAAG CAGGCGGCCA GGGGGATCGA AGAACACTTC
GTCACCTGGA CCGGCATCCT CACCTCGCTT GCGAAGCTCG ACTCCGTCGT CGCCATGGAT
CCCGACGGCA AGCGCCAGAT GGAATTCTTC TACGACGCCC ACAAGGACCA GATCAGATCG
TTCACCCGGA TGGACGAGAA GGGGAACATA CTCTTCACCG TTCCGGATCT GAGAATGGCG
GGAAGGAACA TCACGGGGCA AAAGCACATA CAGGAACTGA TCGCGACGCA AAAGCCTGTG
GTGAGCGACG TCTTCCGGAC CATCCAGGGG TATGACGCCA TCGCCCTTCA CGTTCCGGTC
TTCGATGGAC TCCGGTTCAG GGGTAGCATC GCCATCATCA TCAACTTCCA GAGCCTCGCC
CAGCGCTACC TCGAGGTAAT CAAGATCGGC AAGACCGGCT ACGCCTGGGT GGTAAGCCGG
GACGGCACCG AACTCTACTG CCCCGTCCAC GGCCACAGCG GCAAGAGCGT TTTCGCCACC
GCGAACGAGT TTCCGGCGGT TCTCGCCATG GTAAAGGCGA TGCTCAAAGG GGAATCGGGG
ACGGCGAGCT ACGACTTCAA CACGAGGGGC GGCGCCTCCG CCAAATCTGT AAAGAAACAT
GCCGTCTACC TTCCCATCAA CCTCGGCAAC ACCTTTTGGT CCATCGTCGT CGTTTCGTCG
GAGCAGGAGA TTCTCTCCTC GCTTTCCTCC TACCGCAACA GGCTGGCTCT GGTCTTCGGC
GGCATCCTGC TGGGTGGGAT CATCATCTGC ATCCTGGTGC TACGGGCCCT GCTGATAGTG
AGGGAGCAAG CCGTGCACAA GGAAGCGGAG GCGGAGTTGC GAGCCAGCGA GCAGAGGTAC
CGCTACCTCT TCGAGCAAAA TCCCGCCCCG ATGCTCATCT ACGAAAGGGG AACCATGCAG
ATGCTGGCGG TGAACGACGC CTTCGCCGTC GGCTACGGCT ACAGCAACGA GGAAGCGCTG
GCGCTTGGGC TCACCGACCT CTACCCCGAG GAGGAGAAAC AGAAGATCAC CGAGGTCGCC
GCCGGGCTCA GCGGCCACAC CTATGTCGGG GAATGGCACC ACCGCCGCAA GGACGGCACC
GTCTTCCCCA TCGTTGTCAC CTCCCACGAC ATGACCTACG GAGGCAGGAC CGCTCGCATC
GCGGTCATCA CGGACATAAC GGATCGTAAG GCGATGGAGA AGGCAATCGA GGAGGAGTCG
ACCTTCAACC GCCTCCTTTT AGAGCATTCG CCCGACGGCA TAGTCATCAT CGACCCGAAG
ACCGCCCGCT TCATCAACTT CAACGCCGCC GTCTGCCGGC AACTCGGGTA CTCCCGCGAG
GAGTTCGCCC AACTAAGCGT CTTCGATATT GAGGCCGTGG AGACACGGGA AGACACCCGC
CGCCGCATCG AGGGTATCGT GCGGGAGGGA CGGGGCGACT TCGAGACCAT GCAGCGGACC
AGTCAGGGAG AGCTACGAAA CGTCCAGGTG ACAGCGCAGA TCCTGACCAT CCAGGACCAA
CAGGTCTACT ACTGCATCTG GCGCGACATA ACCGAGCACA AGAAGCTGGA GGAGCAGTTA
AGGCAATCGC AGAAGATGGA ATCGGTGGGA CGGCTGGCGG GGGGAGTCGC CCACGACTTC
AACAATATGC TCGGCGTGAT CATCGGGTCC GCCGACCTCT GCCAGCACCA GGTACCGGCG
GATAGCCCGC TGCAAAAGTA TCTCGATCAC ATCCTGAAAG CGGCGAAACG GTCGAGCGAC
ATAACGCGCC AGTTGCTCGC TTTCTCCCGC AAGGAGGTAG TTTCGCCCAA GCCGGTGAAC
CTTAACAGCC TCATCATCGA CTCCGAGAAG ATGCTCTGCC GTTTGATCGG CGAGGACGTC
AAGCTCACCT TCAAACCCTC CACCGGCCTT TGGACCGTGA TGATCGACCC GGCCCAGTTC
GACCAGATAC TCATGAACCT CTCCGCCAAC TCCCGCGACG CCATGCCCGA CGGCGGCACG
CTCGACATAG CGACCGGCAA CGTTCACCTC GACGCAGGCT ACTGCCGCCA CCACTCGGAC
ACCGTTCCCG GCGACTACGT CAAGATCACC GTCTCCGACA CCGGGACGGG GATGAATCGC
GAAACCAGGG ATCACATCTT CGAGCCCTTC TTCACCACCA AGGGGGTCGG GGTAGGGACC
GGTCTCGGTC TCGCCACGGT CTACGGCATC GTCACCCAGA ACAACGGGTT CATCAACGTC
TACAGCGAGC TTGGCCAAGG GTCGGTCTTC AACATCTACC TGCCGCGCCT TTTGGAAGAT
GGCGCGACAG AGGAAGAGGC CGAGGCGGCG CCCCCCCCGA AAGGAACCGG AACCATCCTC
CTGGTCGAGG ACGAGGAGAT GCTGCTCTGG ACCACGACGA AAATCCTGGA GGAGATGGGT
TATACCGTGC AGCAGGCCGA ATCCCCGGCA AAGGCGATAG CGATCTGCGA AAACGGCAAA
CAGATAGACC TGGTGCTGAC CGATGTGGTG ATGCCCGGCA TGAACGGCCG GGAGATGGTG
GACAGGATAA GGAGCGCCAG GCCTGACATA AAGGTGCTGT TCATGTCCGG CTATACAGCG
GATATAGTGG CCCAGCGGGG AATCGTGGAA GAAGGGATGT TCTACATCTC CAAGCCGTTG
GATTCCAAAC AGTTGCACGA GAAGATCGTC CAGACGCTGG CGTCGTAG
 
Protein sequence
MIQKLVKNGG LALLILAFTC LCTSIFYFSY QKAKQSAINR LNDEQFTHAK QAARGIEEHF 
VTWTGILTSL AKLDSVVAMD PDGKRQMEFF YDAHKDQIRS FTRMDEKGNI LFTVPDLRMA
GRNITGQKHI QELIATQKPV VSDVFRTIQG YDAIALHVPV FDGLRFRGSI AIIINFQSLA
QRYLEVIKIG KTGYAWVVSR DGTELYCPVH GHSGKSVFAT ANEFPAVLAM VKAMLKGESG
TASYDFNTRG GASAKSVKKH AVYLPINLGN TFWSIVVVSS EQEILSSLSS YRNRLALVFG
GILLGGIIIC ILVLRALLIV REQAVHKEAE AELRASEQRY RYLFEQNPAP MLIYERGTMQ
MLAVNDAFAV GYGYSNEEAL ALGLTDLYPE EEKQKITEVA AGLSGHTYVG EWHHRRKDGT
VFPIVVTSHD MTYGGRTARI AVITDITDRK AMEKAIEEES TFNRLLLEHS PDGIVIIDPK
TARFINFNAA VCRQLGYSRE EFAQLSVFDI EAVETREDTR RRIEGIVREG RGDFETMQRT
SQGELRNVQV TAQILTIQDQ QVYYCIWRDI TEHKKLEEQL RQSQKMESVG RLAGGVAHDF
NNMLGVIIGS ADLCQHQVPA DSPLQKYLDH ILKAAKRSSD ITRQLLAFSR KEVVSPKPVN
LNSLIIDSEK MLCRLIGEDV KLTFKPSTGL WTVMIDPAQF DQILMNLSAN SRDAMPDGGT
LDIATGNVHL DAGYCRHHSD TVPGDYVKIT VSDTGTGMNR ETRDHIFEPF FTTKGVGVGT
GLGLATVYGI VTQNNGFINV YSELGQGSVF NIYLPRLLED GATEEEAEAA PPPKGTGTIL
LVEDEEMLLW TTTKILEEMG YTVQQAESPA KAIAICENGK QIDLVLTDVV MPGMNGREMV
DRIRSARPDI KVLFMSGYTA DIVAQRGIVE EGMFYISKPL DSKQLHEKIV QTLAS