Gene GM21_0257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0257 
Symbol 
ID8135564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp308336 
End bp311662 
Gene Length3327 bp 
Protein Length1108 aa 
Translation table11 
GC content62% 
IMG OID644867878 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003020100 
Protein GI253698911 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.00000213837 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCTCGTCA CCAGCATGAA AACCAAAATG ACACTAGCCG TCTTCCTGCT GGTCGGCTTC 
CTGATGTCGG TGACGGCCGT GGGCTCGGTT TTGTACTACG AGCAAAGATT CAAACGCAAC
ATCTCCCTGC AGCAGTCCAC GCTCCTCGAT TCCCTGGCCT ATCAGGTCGA CAACCGCATC
TCCGACTTCA TGTCGGACCT GGACCAGCTC GCCGCCGGCA TAACCCCGGA ACTGCTGGCC
AGCCACGAGC AGGCGCAGCA ACTGCTCGTG GCGCATAAAA AGCAGCGCGC CTTCTTCGAC
AATTCCCTCT TCCTCTTCTC TAGCGAAGGG CGGCTCGTCG CCGCCAACCC GGAGGATCTC
AACTTCATCG GCAAGGACTT TTCCTTCCGG GACTACTTCA AGCAGACCAT GCAGACGGGG
AAGATGTACG TCTCCGCGCC TTTCGCCTCG ATCCAGAAGC AAAGCCACCC CATCATCATG
TTCAGCACGC CGGTGTACGA CTCAAAGCGC CGGATCATCG GCATGCTGGG GGGGAGCATC
GACTTGCGCC GGCAGCGGTA CCTGCAGTCG CTGGCCGCGC TGAAACTCGG GCAAAAGGGT
TTCCTCTCCC TCTACGACAG CAACCGTTAC GTGCTGATGC ACCCGAACCC CAAGCGGGTG
CTGCGCCGAG ACCCCCCCGG AACAAACGAC ATGGTGGACC GGGCGATCGC AGGGTACGAG
GGGACCAGGG AGAGCGCCAC CGTGGCCGAA CAGCCGGTGC TGCGCTCGGT GAAACGGTTG
AAGTCCACCA ACTGGATCAT GGCAGTCAGC TACCCGTTGA GCGAGGTGTA CGGGCCGCTT
TACACGGCGA GGAACTACTT TTTGGCCGGC TCTCTCGCCG CGTCGCTTCT CTCCATCCTG
ATCGTCTGGC CCTTCATGGG GTACCTGACC CTGCCCCTTT TGTCCTTCAC CCGCCACCTG
GAGCAGCTCC CCTCGCTTAA GGGCTCGCAG CGGCTCGCGC CGGTCACCTC CGAGGACGAG
ATCGGGCAGA TGGCGCGAAC CTTCAACAAG ATGCTTCTCG AACAGGAGCG CTCCCGTGAC
TTCTACCTGA CGCTGTTCGA GAACTTCCCC GCCCTGGTCT GGCGCGCGGG AACCGACGGC
AAGTGCGACT ACTTCAACCA GACCTGGCTG GAGTTCACCG GGCGGACACT GGAAGAGGAA
CAAGGAGACG GCTGGCTGCA AGGGGTGCAT CCGGAAGAGC TCGATTTCTG CAGCCGCACC
TACCGCGACG CCGTCGAAAA TCGCAGGCCG TTCCAGATGG AATACCGGCT GCGGCACAGA
AGCGGCGGCT ACCGCTGGAT CACCGACATG GGGCGACCTT TCTACGGCCT CGACGGCGAG
TTCGCCGGGT ACATCGGCAC CTGCTATGAC GTGACCGAAC GGCAGGAGGC GGCCGACAAG
ATCCTCAAGC TGTCGCGGGT GATCGAACAA AGCCCGAACG CGGTGCTGAT AGCGACGCTC
GACGGCACCA TCGAATACGT CAACCCCAAG ACGGTGGAGG CCACCGGATA TGAAGTCGCG
GAGCTGGTGG GGAGCGACAT GAAACGGCTC CTGCCGCACG AGGCCCAGCT CAACTTCAAG
CTGGCGTTGC GCGAACTGGT GCGGCGAGGC AAGGAATGGC GCGGTGAAAT CCCGGCCCGC
AGGAAAGACG GCGCCGTCTT TTGGGAGCAG GTGACCCTTT CCCCCATCAA GACCATGGAG
GGCAAGGCCA CGCACCTCGT CTGCATCAAG GAGGACATCA CCCAGCGCCG GCAGTTCGAG
TTATCGCTCA AGGAGTCGGA AGAGCGCTAC CGCCTGCTGT TCGAGAACAA CCCGCACCCG
ATGTGGGCCT ACGATCTGAA GGCCCTCTCC TTCCTGGCGG TGAACAACGC GGCGCTTAGG
CACTACGGGT ACTCCATGCA GGAGTTCCTG GAGATGAAAC TGAACGACCT GTCGGCTAAG
ACAGCGGAGG GGTTGGAACT GCTCCCCCAG GGGGAAACAA GGCCGGGGGG TGCCCCGCTT
AAGCGACACG TGAAAAAGGA CGGGAGCGTC ATCTTGGTGG AGACCATCTC CCAGGTGATG
AAATTCGCCG GGGTCGACGC CGAGATCGTC ATGGTGCACG ACGTAACCGA GAAGCTCCGG
GCGGAGCAGG AGAAACTGGC GCTGGAGCAC CAGCTCACCC AGTCCCAGAA GATGGAGGCC
ATCGGCACCC TGACCGGCGG CATAGCGCAC GACTTCAACA ACATCCTCAC CGCCATCATC
GGCTACACCA CCATGCTGCA TCTGGAACTG GACGAGGCGC ACCCGCTGCA GCGCAAGGTA
GGCGAGATCC TGCGGGCCTC CGAGCGCGCG GCGAGCCTGA CCCGGAGCCT TTTGGCCTAC
AGCAGGAGGC AGGCAGGCAA CCCCGCGCCC ACCGGGCTGA ACGCCATCGT CAACAACGTC
GACATCCTCT TGCAGCGCCT GATCCCTGAG AACATCGAGC TTAAGTCGCA GCTTGCGCCG
GAGGAACTCT CCATCATGGC GGACTCCGGT CAGATCGAGC AGGTGATCAT GAACCTGGTG
GTGAACGCCC GGGACGCCAT GCCGGAAGGG GGGGTGCTGA GCCTGTCGAC TGAGAGCATG
ACGCTGGACC GGGAGTTCGT GGCCAAGCAC GGCTACGGCC GCCCGGGGAG CTACGCCCTT
TTGTCGGTCG CCGACTCCGG GGTCGGCATG GACGAAAGGA CCAGGGACAG GATCTTCGAA
CCCTTTTTCA CCACCAAGGC GCCGGGCAAG GGGACGGGGC TCGGGCTGGC CATGGTGTAC
GGGATAGTGA AACAGCACGG CGGCTTCATC AACTGCTACA GCGAACCGGG GCACGGCACC
GTTTTCAGGA TCTACCTGCC GCGCATCGAT GCGCCCGCCG AGATAGAGGC GGCCCAGGCG
GCACAGGAGC TCAGGGGTGG GGACGAGACC ATCCTGCTGG TCGAGGACGA CGCGGTGATC
AGGGAGATGG TGGGAGAGCT CCTGGAGGAA TTCGGCTACC GGGTGATCAA GGCGGTGGAC
GGGGAAGATG CGGTGCGAAC GTTCCGTGGG GCAGCCGCGG AGGTGCAGTT GGTGATCCTG
GACGTGATCA TGCCCAAGCG AAACGGCAAG GAGGCGTACG AGGAGATATC CCGGATCAGG
CCGGGCGTCA AGGCTCTGTT CATGAGCGGT TACACGGCCG ACATCATCAC CGACAGCCTG
ATCAAAGGTG ACCCGCGCCA CTTCGTGTCC AAGCCGATCA GCATCAACGA GTTGCTGGGC
AAGGTGAGGG ACTTGTTAGA CAGATAA
 
Protein sequence
MLVTSMKTKM TLAVFLLVGF LMSVTAVGSV LYYEQRFKRN ISLQQSTLLD SLAYQVDNRI 
SDFMSDLDQL AAGITPELLA SHEQAQQLLV AHKKQRAFFD NSLFLFSSEG RLVAANPEDL
NFIGKDFSFR DYFKQTMQTG KMYVSAPFAS IQKQSHPIIM FSTPVYDSKR RIIGMLGGSI
DLRRQRYLQS LAALKLGQKG FLSLYDSNRY VLMHPNPKRV LRRDPPGTND MVDRAIAGYE
GTRESATVAE QPVLRSVKRL KSTNWIMAVS YPLSEVYGPL YTARNYFLAG SLAASLLSIL
IVWPFMGYLT LPLLSFTRHL EQLPSLKGSQ RLAPVTSEDE IGQMARTFNK MLLEQERSRD
FYLTLFENFP ALVWRAGTDG KCDYFNQTWL EFTGRTLEEE QGDGWLQGVH PEELDFCSRT
YRDAVENRRP FQMEYRLRHR SGGYRWITDM GRPFYGLDGE FAGYIGTCYD VTERQEAADK
ILKLSRVIEQ SPNAVLIATL DGTIEYVNPK TVEATGYEVA ELVGSDMKRL LPHEAQLNFK
LALRELVRRG KEWRGEIPAR RKDGAVFWEQ VTLSPIKTME GKATHLVCIK EDITQRRQFE
LSLKESEERY RLLFENNPHP MWAYDLKALS FLAVNNAALR HYGYSMQEFL EMKLNDLSAK
TAEGLELLPQ GETRPGGAPL KRHVKKDGSV ILVETISQVM KFAGVDAEIV MVHDVTEKLR
AEQEKLALEH QLTQSQKMEA IGTLTGGIAH DFNNILTAII GYTTMLHLEL DEAHPLQRKV
GEILRASERA ASLTRSLLAY SRRQAGNPAP TGLNAIVNNV DILLQRLIPE NIELKSQLAP
EELSIMADSG QIEQVIMNLV VNARDAMPEG GVLSLSTESM TLDREFVAKH GYGRPGSYAL
LSVADSGVGM DERTRDRIFE PFFTTKAPGK GTGLGLAMVY GIVKQHGGFI NCYSEPGHGT
VFRIYLPRID APAEIEAAQA AQELRGGDET ILLVEDDAVI REMVGELLEE FGYRVIKAVD
GEDAVRTFRG AAAEVQLVIL DVIMPKRNGK EAYEEISRIR PGVKALFMSG YTADIITDSL
IKGDPRHFVS KPISINELLG KVRDLLDR