Gene GM21_0550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0550 
Symbol 
ID8135861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp673306 
End bp674928 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content63% 
IMG OID644868163 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003020382 
Protein GI253699193 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.0519596 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACCG ACGACACCAT CGCCATCCCC CAGCTCCTTT ACGAGGCGGT CAGCCGCGGC 
AAGCGCGAGT GGGAACAGAC CTTCGACTCC ATCGGCGACC TGATCTTCAT CACCGATACC
AACCACACCA TTTCGCGGGC CAACCGCGCC ATGGCCAGGC ATTGCGGCCT GCGTCCGGAG
GAGCTCCCGG GACGAAAGTG CTACGACCTC TTCCACGACC TGGCGTCTCC CCCCCCTTAC
TGCCCGCTCC GGAGCCTGAA AGAGGGGGGG GCGCCACAGG CGGAGGAAGT CGAGGTCGCC
AAGTTCCGCG GCTTCTTCGA CATCTCGGTA TCACCTTTGT ACAACGATGA GGGAACACTC
GCCGCCTGCG TCCACGTCGC CCGCGACGTC ACCGAGCGCA AGAGGGCCCA GGAGTACCGG
CTGGAGTTGG AGCAGCAACT GCTGCAGTCC CAAAAGCTCG AGAGCCTTGG TGTTCTCACC
GGCGGCATCG CCCACGACTT CAACAACATC CTGATGATAA TCCTCGGGCA CTGCATGCTC
GCCAAGGAGA ACCAGGCCGT CGCCCCGGTC GTCGGCCACC TGGATCAGAT CGAGTCCGCC
GGAAACCGTG CCGCCGACCT TTGCCGCCAG ATGCTGGCCT ACGCCGGCAA GACGCCGCTG
GTCCAAACTC AGATCCACCT CCCCGCGCTG GTGCGCGACA TGGTGCATAT GCTTCAACCC
GCGTTCAACA AGAAAGTGAT CATTGAATGC GACCTCGACG GGGACCTGCC CAACCTGACC
TGCGACGAGG GGAAGATCCA GCAGATCGTG ATGAACCTGG TGGTGAACGC AGCCGAATCG
CTTGGAGAGC GGGGGGGGAA CGTCAAGGTG ACCCTCCGGC ACAAGACGGT GCTGCAGTCG
GAGCAGGAGG TCGACTGCTT CGGCAACTCC ATACCCCCTG GAACCTATCT ATGCCTGGAG
GTCGCCGACA CCGGATGCGG CATGGACCAG GAGACCCGGA AGCGGATCTT CGAGCCGTTT
TTCACCACCA AGTTCACCGG ACGGGGACTT GGGCTTTCCG CGATCAGCGG CATCATCAAG
TCCCATGAAG GCGCGCTGCA GCTCTGCAGC GCCCCCGGCG CGGGGACCAC TTTCAGCGTC
TATTTCCCCC TCCCCCCATG CTGCCCCGCC GGCGACCAAG TCGCGCCGCC CCTCCCCTCG
CCCTCCAAAG CGGCCGCAAG GCTCGAAGGC ACCATCCTGC TGGTAGACGA CGAGGAGGAA
CTTCGTGCCG TCGGCTGTGA ACTTCTCACC AGCATGGGGT TCAAGGTGAT TGCCGCCAGT
AACGGCAGCG AGGCACTCGC GATCTGGCAG GAGCGCAAAA GCGAGATAGA CCTCGTGCTG
ATGGACCTGA CCATGCCGGA ACTGGACGGC GTCGAGACCT ACCGCGCCCT GCGCGAGGAT
ACTTCCACGC TCCCGGTTCT TTTTTGCAGC GGGTACGGAG ACCAGGACAT CCGCCCTTGC
ATAGGCGAGG ACGTCCACGC CGGCTTCATC TCCAAACCGT ACCAGTTGAA CCACCTACAA
CGAGCACTGG CGGCCCTCTG GGAAAACCGC ATGCCTCATG CCGCAGAGGG TTTCCCCGCC
TGA
 
Protein sequence
MVTDDTIAIP QLLYEAVSRG KREWEQTFDS IGDLIFITDT NHTISRANRA MARHCGLRPE 
ELPGRKCYDL FHDLASPPPY CPLRSLKEGG APQAEEVEVA KFRGFFDISV SPLYNDEGTL
AACVHVARDV TERKRAQEYR LELEQQLLQS QKLESLGVLT GGIAHDFNNI LMIILGHCML
AKENQAVAPV VGHLDQIESA GNRAADLCRQ MLAYAGKTPL VQTQIHLPAL VRDMVHMLQP
AFNKKVIIEC DLDGDLPNLT CDEGKIQQIV MNLVVNAAES LGERGGNVKV TLRHKTVLQS
EQEVDCFGNS IPPGTYLCLE VADTGCGMDQ ETRKRIFEPF FTTKFTGRGL GLSAISGIIK
SHEGALQLCS APGAGTTFSV YFPLPPCCPA GDQVAPPLPS PSKAAARLEG TILLVDDEEE
LRAVGCELLT SMGFKVIAAS NGSEALAIWQ ERKSEIDLVL MDLTMPELDG VETYRALRED
TSTLPVLFCS GYGDQDIRPC IGEDVHAGFI SKPYQLNHLQ RALAALWENR MPHAAEGFPA