Gene GM21_0533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0533 
Symbol 
ID8135844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp653380 
End bp656373 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content58% 
IMG OID644868150 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003020369 
Protein GI253699180 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones110 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACGTC TTTCGTCTCT CTCAATCCGG TCACTTTTGC TGCTGATAAC ATGCGTTGTG 
GCATTGCCCG CGGCGGTGAT AATCCTGTAC TCGGGAATTG AATTTCGCAA TACCATGCTT
GGAGAAGCCA GAAAAGAGAC AGTCAAATTT GCGGAGACTA TAGTAAACGA GCAGCGGAAT
CTCGTTGTGG CCGCCGAACA GTTGATGACG GCATTGGCGC AACTTCCCGA GGTGAAGTCG
CGCGACAGCG CGAAGGTAGA GTCGATTCTC AAAGAGTTGC TTAAGTTGAA CCCGATGTAC
GCAAACATCA CCATTGCCGA CTGCGAAGGC AAAGTCTGGG GCACCGCAGT CCCGACAACC
GTACCCCTGA ACATCTCCGA CCGCTATTTC TTCAAGAGCG CCCTCGCAAC CGGCAAACTC
TCCTCCGGCG AATACATCGT CAGCCGTATC ACCACCAAAC CCGCCTTCAG CTTGGGCTAT
CCGGTCAGAG ACGACGGCGG CGCAATCATC GGGGTTATCG GCGTCGCCTT CAATCTCGAG
AACTACCGGG ATCTGTTGCA GCAGATGCGG CTGCCGTCCG GTTCCAGTTT CACCATCATC
GATCATCGCG GCACCATCCT TTCCAGGGGA TTGACCCAGG GGAACTTTGC AGGGAAGGCT
TACACGGTGG ATTCTTTTCG CAAAATGGTG GAGGGGCCGG ACGAAGGGGT AAGCATCAGG
AAGGGACTGG CAGGCGACAC GAGGATCATC GCCTACCGAA AACTGTACCT CCCCGGGGAA
AAGACCCCGT ACCTGTACGT CACCGCAGGT ATCCCCGTGG ACGCGGCAAC CCATAAAGCC
AACCGCGCGC TCGTCCTGAG CGCCTTGCTG TTATCGTCAT TTCTGGCCCT TGCCTGCCTT
TGCGCGGTGC TTATCGGCAA ACGCTGCATC GCGGACCGCT TACAGCTTCT GGAAGACGCC
TCTAGGCGGG TCGCGACGGG GGACCTGCGC ATCCGCGTCT CCGAGGCGGT GACAGGAGGC
GAACTCGGCA GTCTCGCCCA GACTCTGGAC GGCATGGCTG ATCAGTTACG TACCAGGACG
GATGCCCTGG CTCACAGCAA GATGTTTATG AACACGATCA TCGAAACGGA ACCGGAATGC
GTCAAACTCC TCGACAAAGA GGGGAGGGTG CAGATGATGA ACAGCGCCGG CCTGAAGATG
ATAGAGGCCG ACTCGCTGTC CCAGGTCCAG GGGCAGTGCG TTTACCCGCT GATAGCTCCC
GAACACCGGG ACGCGTTCAT TCAATTGACC CGGCGTGTCT TTGAGGGCGT TGCCGGCAAC
CTGGTCTTCG AGGTGATCGG CCTCAAGGGG GGGCATGTCT GGCTCGACAG TCATGCCGTG
CCTTTCCGTA ACGAACGGGG GGAGATTGTG TCGCTCCTTT CCATCACCCG GGACGTCACG
GGGCTCAGGA AATCGGAGGA GGAACGCCGG GAGAATCTGC TGCTGTTCGA GTCTCTTATG
CGGCACTCGC CGATGGGAAT CCGCATCTTT GACGGCGTCT CCGGCAAATG CATCCTGCTT
AACCAAGCCA CCGCCGATAT TGCGGGTGGC GACATGAAAA CGATGCAGGA GCAGAGCTTC
CGGGAATTAA AGTCCTGGCG GGAAAGCGAC CTGCTCGCCG CTGCGGAAAA GGTGCTAGCT
GACGGCGTGG TTCGGGTAGT CGAGGCGGAT ATCCGCACGA GCTTCGGAAA ATCAGTTGTG
ATGTCCTACA TCCTGTCCAG GCTCCTCATC AAGGACAAGC AGCATCTTCT CGTCGTCGGC
CGGGACGTCA CTGACGAAAA ACGGCTGACT GAAGAGAAGA AAAAAATGGA AGCGCAGTTG
CTGCATGTCC AGAAGCTTGA GAGTCTGGGG GTGCTTGCCG GCGGCATCGC CCACGACTTC
AACAACATCT TGATGTCGGT CATGGGGAAC GCGGAGTTGG CGCTTTTGAC TCTTCCCCCC
GAATCTCCCG CCCGAACCAA CCTGCGGAAC ATCGAGATCT CCTCGCAGCG CGCGGCTGAC
TTGGCCAGGC AGATGCTGGC CTATTCAGGC AAAGGGAATT TCGTCATAGA GGAGATCGAT
GTCAACAAGC TGATAAACGA GATGAACCAC ATGCTGGAGG TCTCCATCTC CAAAAAGGTG
GATGTTCGAT TCAATCTCGA CAGTGGGCTG CCGCTGGTGT CGGTCGACGC GACCCAGATC
CGGCAGGTCA TCATGAACCT GGTGATCAAC GCTTCCGAAG CGATAGGCGA CCGCAGCGGA
GTGATATCGA TCTCCACCGG CGCCATGGAA TGCGATGCGG CGTTTCTCTC CAAGTTGTGG
TTGAACGACG CGCTTAGGGA AGGAACCTAC CTCTACTTTG AGGTTGCCGA TGACGGGTGC
GGCATGGATC CGGCCACCTT GGCCAAGATC TTTGATCCTT TCTTCACCAC CAAGTTCACG
GGGCGGGGCC TCGGTATGGC CGCGGTCCTC GGCATCATAC GGGGGCACAA AGGGACTATC
GAGGTTCACA GCGAGCCGGG CAAAGGCTCG AGATTCACCG TATTTTTGCC TGCTCTTCCT
CCAGGCTCCG CACGCCCGGC GCAGGAGGCG GAGGCGGCTC AGCTGTCGCC TGGTTCCGGC
ACCGTACTGC TGGTCGACGA CGAGGAGACC ATCCGCAACC TCGGCAACGA GATGCTCCGG
ATCTTGGGAT ACCGCGTGCT CACCGCTGAA GACGGGGTGG TCGCTGTCGA GCTTTTCAAG
GAGCACCGCG GCGACATCAC CTGCGTCATC CTTGACCAGA CCATGCCGAA CCTGGACGGG
GAGCAGACCT TCCGCATCCT GCGCAGCATA GACCCGTCGA TCAAGGTGAT CATGTCCAGT
GGTTTCAGTG AACAGGACAT CGCCGAGAGG TTTACCGGAA GAGGCCTGGC CGGTTTCATA
CAGAAGCCGT ACAAGCTTGC CAGCTTAAGC CGGAAGCTTC AGGAACTGGG GTAA
 
Protein sequence
MLRLSSLSIR SLLLLITCVV ALPAAVIILY SGIEFRNTML GEARKETVKF AETIVNEQRN 
LVVAAEQLMT ALAQLPEVKS RDSAKVESIL KELLKLNPMY ANITIADCEG KVWGTAVPTT
VPLNISDRYF FKSALATGKL SSGEYIVSRI TTKPAFSLGY PVRDDGGAII GVIGVAFNLE
NYRDLLQQMR LPSGSSFTII DHRGTILSRG LTQGNFAGKA YTVDSFRKMV EGPDEGVSIR
KGLAGDTRII AYRKLYLPGE KTPYLYVTAG IPVDAATHKA NRALVLSALL LSSFLALACL
CAVLIGKRCI ADRLQLLEDA SRRVATGDLR IRVSEAVTGG ELGSLAQTLD GMADQLRTRT
DALAHSKMFM NTIIETEPEC VKLLDKEGRV QMMNSAGLKM IEADSLSQVQ GQCVYPLIAP
EHRDAFIQLT RRVFEGVAGN LVFEVIGLKG GHVWLDSHAV PFRNERGEIV SLLSITRDVT
GLRKSEEERR ENLLLFESLM RHSPMGIRIF DGVSGKCILL NQATADIAGG DMKTMQEQSF
RELKSWRESD LLAAAEKVLA DGVVRVVEAD IRTSFGKSVV MSYILSRLLI KDKQHLLVVG
RDVTDEKRLT EEKKKMEAQL LHVQKLESLG VLAGGIAHDF NNILMSVMGN AELALLTLPP
ESPARTNLRN IEISSQRAAD LARQMLAYSG KGNFVIEEID VNKLINEMNH MLEVSISKKV
DVRFNLDSGL PLVSVDATQI RQVIMNLVIN ASEAIGDRSG VISISTGAME CDAAFLSKLW
LNDALREGTY LYFEVADDGC GMDPATLAKI FDPFFTTKFT GRGLGMAAVL GIIRGHKGTI
EVHSEPGKGS RFTVFLPALP PGSARPAQEA EAAQLSPGSG TVLLVDDEET IRNLGNEMLR
ILGYRVLTAE DGVVAVELFK EHRGDITCVI LDQTMPNLDG EQTFRILRSI DPSIKVIMSS
GFSEQDIAER FTGRGLAGFI QKPYKLASLS RKLQELG