Gene GM21_1276 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1276 
Symbol 
ID8136602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1487737 
End bp1489665 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content64% 
IMG OID644868889 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003021094 
Protein GI253699905 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones110 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCACCATT TTCAGCAGAC AGGTACGGAC GACCCTCTGG AACACGCGCA GCAGGAACTA 
AAGGCTGCCA GGCTTGAGCT GCAGGCGAAA AACGCGGAGC TCGAACGCGC CCGAAACGAC
CTGGAGAATT TCCTGAGCTG CGCCTGGACC GCCACCGTCT TCTTCGACCG CGAGCTCCGG
CCGACGCGCC TGACTCCGAC CCTGGCGCAG CTTTTGGATC TCTCCCCCGC CGACACCGGA
GAGCTTTTGC GCAAGCTTTG CCGCAGGCTC GACTGGCCCA GCGCGGACGC GGAACTCCGG
AGCGTGGTCG GCGGCGCCGT CGTCCCGGAG CGGGAGGTGT CGGACCTGCA GACGGGGCAC
CAGTTCCTGC TGAGGCTCTT TCCCTACCTG AACGCCGAGG GGGAGACCGA GGGGGCCGTG
CTGAAGCTGA TCGACATAAG CGACTACAAG CTGGCCGATC AGCGGCTCCT CGAATACCGC
GCGGTGTTCG AGTGCACAGG CGACATGATC TACGTGTTCG ACCGCGGCTA CCGCTTCATC
CTGGCGAACA AGGCGTACCT CGAGTGCCAC CGGGTGCGTC ACGAGGAGCT TATCGGCCGC
ACGGTACGGG ATCTTTTGGG ACCGGCGATG TTCGCCGAGA TTAAGGGGCG CATCGATGCC
TGCTTCGCCG GCGAAGAGGT GATCTTCGAG AGCCGGTACG ACTACCCCGC CAAGGGGGTA
CGGGACATCC TGATCACCTA CACGCCGCTT AAGAATGGGG ACCGGGTCGA CCGGGTCGCC
TGCCTAATCA AGGACATGAC GGAGAGGACC CAACTGGAGG AGCAGCTGCG GCACGCGCAG
AAAATGGAGG CGATCGGCAC GCTGGCCGGG GGGATAGCAC ACGACTTCAA CAACCTCTTG
ACCGTCATCG CCGGCTATGC GTCGCTGATC CAGTTCAACT CCCAGGGAAG CGAGGTCGCC
TCCATGGCGG CCGAGATCCA GGGATCGGTG GAGCGGGCGG CGGAAATGAC ACGGGGCCTC
CTTGCTTTTT CCAGGAAGCA GGAGGTGAAC TTGATGCCTG TCGACCTGAA CCAGCTTGTC
GAGGGGCTGC ACAAGAGCCT GAGGCGGCTC ATCACTGAAG ATATCGAACT GGCGGTCGAG
TTCTCCGATG CCCCGCTTAT CGTCTCCTCG GACAAGGGGC AGTTGGAGCA GGTGCTCTTC
AACCTGGTGG TGAACGCGCG CGACGCCATG CCGTCCGGGG GGAGGCTCTC GATCCGGACC
GAGCGGTCGC AGCTAAACCA CGCCCTGATC ACCGTCGCCG ACACCGGGGT CGGGATGAGC
CGCGAGGTGC AGGACCGTGC CTTCGAGCCT TTTTTCACCA CCAAGGAGCT GGGAGTAGGT
ACCGGGCTTG GGCTTTCCAC CTGCTACGGC ATCATCAAAA AGCACAACGG CGTCATAGAG
CTGCAGAGCG AGCCGGGTGC CGGGACCGTC TTCAGCATCT ACCTCCCGCT CTCCGCACAG
CAGCCGGAGG CGGCCTACAC CGGCGGCGGC GATCGATGGG AGACGGGCAA CGAGACGGTG
CTGCTGGTCG AGGACGACGA GACGGTGCGG ACGATGACTC GTCTTTTGTT GCAGCATAAC
GGCTATCGCG TGCTCTGCGC CAAGGAGGGG GAGGAGGCGC TGAATATCTT CGCCGAGCAG
GGCGACGGGG TGGACCTGCT GCTCACCGAC CTAGTCATGC CGCGCCTGAA CGGGACGGAG
CTCTGTCTGC GGATCCACGC CCAGAGGCCC GGTTTCCCCA CCATCTCCAT GAGCGGGTAC
CCGGCTGACG TCATGTCGCG GAAGGGGATC GCCGCAACGG GGAACTACCT CCCTAAGCCG
ATCAAGCCGG AGCTTTTGTT GCGCCGCATC CGCGAGACGC TCGATGCTCC CCGTGCCGCA
TCCGCCTAG
 
Protein sequence
MHHFQQTGTD DPLEHAQQEL KAARLELQAK NAELERARND LENFLSCAWT ATVFFDRELR 
PTRLTPTLAQ LLDLSPADTG ELLRKLCRRL DWPSADAELR SVVGGAVVPE REVSDLQTGH
QFLLRLFPYL NAEGETEGAV LKLIDISDYK LADQRLLEYR AVFECTGDMI YVFDRGYRFI
LANKAYLECH RVRHEELIGR TVRDLLGPAM FAEIKGRIDA CFAGEEVIFE SRYDYPAKGV
RDILITYTPL KNGDRVDRVA CLIKDMTERT QLEEQLRHAQ KMEAIGTLAG GIAHDFNNLL
TVIAGYASLI QFNSQGSEVA SMAAEIQGSV ERAAEMTRGL LAFSRKQEVN LMPVDLNQLV
EGLHKSLRRL ITEDIELAVE FSDAPLIVSS DKGQLEQVLF NLVVNARDAM PSGGRLSIRT
ERSQLNHALI TVADTGVGMS REVQDRAFEP FFTTKELGVG TGLGLSTCYG IIKKHNGVIE
LQSEPGAGTV FSIYLPLSAQ QPEAAYTGGG DRWETGNETV LLVEDDETVR TMTRLLLQHN
GYRVLCAKEG EEALNIFAEQ GDGVDLLLTD LVMPRLNGTE LCLRIHAQRP GFPTISMSGY
PADVMSRKGI AATGNYLPKP IKPELLLRRI RETLDAPRAA SA