Gene GM21_4132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4132 
Symbol 
ID8139506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4722972 
End bp4724831 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content61% 
IMG OID644871747 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003023905 
Protein GI253702716 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACC GGATTCCCAC AGACAAAGAG ACCCTGGAGC AGCAGTTGGC TGCCCTGCGG 
AGCGAGAACG AGGAACTGGC CCAACAGGTC AAACGGCTGA TCCGGGCGGA AGGAAAGCTC
TACGAGTACC AGCAGGTGCT CGACCTCCAG TTGATCGAAT ACAAGGGGCT TTATGACCTG
AGCCGGAGGC TGAGCGGGAG CTTCGACATC CAGACCCTGT TCCGCGATAC AGTGCAGTAC
GTGGTCCAGC AGCTCGAATA CGAGCGCGCC ATCCTGCTCC GCCGCGAGGA ATCCTTTACC
TATAGCGTCT TCGCCCTGGA CGGCTACTAT GATCCCAGCG AAAAGGAGCA GGCCGCCCTT
ATCACCATGC GGTACGGCGC CCCCTGCCTC TCCCCCCTTC TTGCCGGCAG AGAGCACGTC
ACCTGCAGCG CAACCTCGAC GGAACCCGGA AACGGATGCC GGCGGCGCCT CCTGATGGAC
GAGTTCCTGG TATACCCCCT GGGCCACGAC GAGATACCCC ATGCCCTGCT GGTGGTAGGG
AACACCTCGG CCAACGCCCC GTTTCACCGG CGGGTGGAGG AGAGCGACCA AGCTCTCTTG
AGCATGGGCA ATCTGGTCGG CCTCGTCTCC TCTTTATTGG ACACCCACAT ATTCTTCGAG
CGGATGATAG AAGCACGCGA GCAGGAACGT GTCGCCGAGG CGAAGTACCG CAGCCTTTTC
GAGAACGCGG CGGAGGGCAT CTTCCGCAGG ACCCCCGAGG GAAAGTACCT GGACGCTAAC
CCCGCCCTGG CGCATATGCT GGGCTATGCC TCGCCCGAGG AACTGGTCGC CTCCGTCACC
GACATCGGCT CCCAGGTCTA CGTGAATCCC GCCTCATATG CCGAGATGCA GAGGGTGCTG
TCGGCGCACG GCAAGGCCGA GGGATTCGAG ACGCAGGTCT ACCGCAAGGA TGGCAGCGTC
ATCTGGGTAT CCCTTAGCCT GCGCGCGGTG CGCGACAGCT ATGGGAAGGT CCTCTTCTAC
GAGGGGATGT CCGAGGAGAT CACCAAGCGC AAGATCGCGG AGGCAGCCCT GCGCGAGAGC
GAACAGAAGT ACCGCCAGTT GAGCGAGGCG CTGGAGCGGC GCGTGAAACA GGCGGTCGAC
GAGCTACGCC AGAAGGACAA GATGCTTATC ATGCAGGGGC GGCAGGCTGT GATGGGGGAA
ATGCTGAGCA ATATCGCCCA CCAGTGGCGC CAACCGTTGA ACATGCTTGC CTTGCTGGTC
CAGGACGTCC AACTGACCCA CAGGCAATCC GGGCTCAGCG ACGACTTCAT CGAGCGGAAC
GTCAAAAGGA GCATGGAGAT CATCCAGCAG ATGTCCCGGA CCATCGACGA TTTCAGGTAT
TTCTACCGCC CCGACCGGGA AAAGCTGGAG TTCGCGGTGA GCGAGCCGCT GGAAAAAGCG
CTGGGATTGT TGGAGGGGAG CTTCAGGACC AACAGCATCG AGATCCAGGT CCTCAAAAGC
GGCGAACCCG CCATCAGGGG GTACCTCGGA GAGTTCGTGC AGGTACTGCT AAACATCCTG
ATCAACGCGC GCGACGCGCT CATCGCCAGC CACGCCGCTT CGCCGCTCAT CACCGTCAGG
CTCCACGAAG AAGGAGGGGA AACGGTGGTG AGCATCGCGG ACAACGCCGG AGGCATCCCC
GACGGGATCA AGGAGAAGAT CTTCGAACCC TACTTCACCA CCAAAGGGCC CGACCAGGGA
ACCGGCATCG GCCTTTTCAT GTGCAAGACC ATCATCGAGA AGAGCATGAA CGGCAGGCTA
ATCGCCAGAA ACAGCGGCGA GGGAGCCGAG TTCGTCATCA CCGTCCCCAA AACTCCCTGA
 
Protein sequence
MKNRIPTDKE TLEQQLAALR SENEELAQQV KRLIRAEGKL YEYQQVLDLQ LIEYKGLYDL 
SRRLSGSFDI QTLFRDTVQY VVQQLEYERA ILLRREESFT YSVFALDGYY DPSEKEQAAL
ITMRYGAPCL SPLLAGREHV TCSATSTEPG NGCRRRLLMD EFLVYPLGHD EIPHALLVVG
NTSANAPFHR RVEESDQALL SMGNLVGLVS SLLDTHIFFE RMIEAREQER VAEAKYRSLF
ENAAEGIFRR TPEGKYLDAN PALAHMLGYA SPEELVASVT DIGSQVYVNP ASYAEMQRVL
SAHGKAEGFE TQVYRKDGSV IWVSLSLRAV RDSYGKVLFY EGMSEEITKR KIAEAALRES
EQKYRQLSEA LERRVKQAVD ELRQKDKMLI MQGRQAVMGE MLSNIAHQWR QPLNMLALLV
QDVQLTHRQS GLSDDFIERN VKRSMEIIQQ MSRTIDDFRY FYRPDREKLE FAVSEPLEKA
LGLLEGSFRT NSIEIQVLKS GEPAIRGYLG EFVQVLLNIL INARDALIAS HAASPLITVR
LHEEGGETVV SIADNAGGIP DGIKEKIFEP YFTTKGPDQG TGIGLFMCKT IIEKSMNGRL
IARNSGEGAE FVITVPKTP