Gene GM21_2829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2829 
Symbol 
ID8138172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3293291 
End bp3295099 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content63% 
IMG OID644870431 
Producthistidine kinase 
Protein accessionYP_003022620 
Protein GI253701431 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCT TTTCCCGCAC ATTCAAACTC CCGCTCGCCC TTTTGCTTAT TGTAGCCTTG 
ACCGTCTGCG CCTTCATAAC CATCCGCCGC GCCAACAACG AGGTCCGCGC CGAGGCGCGC
GCCCGCTTCT TCGAACAGTA CAACCGCCAG CAGTACCTCG TCGCCGAGCT CACAGCGCAC
TCCCTAGACG AAATGTTCGC CACCTTCAGC CGCAACCTCG AACTGGTCGC CGGCCTTTTC
GACGGCAAAA GGGTCGACCG CGAACAGGTG CGACGGCTAC AGGTGCATCT GGAAAGAATC
TACGGCTCGC TGCTCGGCAC CCCGGTCATC GATCTGGTCG TCTTCGACAA TAAGGGCGTC
ACCGTCGCCA TCGAGCCGCA CGACAATTTC ACCTTGGGGC GAAATTATGC CTGGCGCGAC
TACTACCGCT GGGTGCGGGA CCAGCGGCAA CCGGGGCGGA TGTACATCTC GCCCTTCATG
CAATTGGAGG GGGGAAAGAA ACGCGGGGTG AAGGAACTGA TCGTAGCCAG GGGTATCTAC
GGCTCACGCG GCGAATTCAA CGGCGTGGTC GCCTGCACGC TGGATTTCGA CAAGCTGGCC
CGAAAGCACG TCCTCTCCGT GAAGATCGGC AAGCACGGCC AGGCCTGGCT CATGGACAGC
ACTACCAGGA CCATCCTGGT CGACCCCAAC GGCAGGATCG CGGGGCAGAC CTTCGACGAG
GCGCTGCGCC GCAAATGGCC GCGCCTTTAC GACCTGCTCC TTTCGGCCGG AGACGGCAAG
CCCGGCAGCG GCTGGTACGA GTTCGAGGAT CCGGCCGATC CGCGCCTGCA GGTCAGAAAG
CTGGTCAGTT ACCACCCGGT GCGTCTGGAA AACCGGCTCT GGACGGTGGG GGTGACCACG
CCGGAAAGGG AGGTGGAAGC GCTCCTCTCC TCGTTTTTGC AGCGGCAGGA GGCCATTTCC
ACCACGCTCC TCGTCACCAT ACTGGGAGGG GCGGCGCTGC TGCTCACCCT TTTTTACAAC
TGGAACCGGA CGCTTACCGC ACAGGTAGGC CTGCACACCC GCGCCTTGAG CAAGGCCCAT
TCCCGCCTGG AGGCTACCTT CGACGAACTG CTGGTCGCCA AAAAGGTGGC GGCCGTGGGC
CATCTGGCGC TCGGGCTCGC CCACGAGATC CGCAACCCGC TTTCGGCCAT CCAGATGAAC
ATGCAGATGA TCCGCAAGAA GATCGAGCCC ACCGGGGTTC TCTGCGAGAA CTTCTCCATT
GCCGACGGCG AGATCAAACG ACTGAACCGC CTGTTGAAAG ACGTCCTGGA TTTCGCGCGC
CCCCGCCCCG TTCGGCTGCA GACGGCCGAC CTGACGGGGA TCGTGCGGCG CCTTTTGCAA
CTGGTCGACC AGCGTCTGGC CCAGCACGGA GTCACCACGG TCACCGATCT CGAATCGCCG
CTTGAGCTTG TCTGTGACCC TGAGCAGATC CACCAGGTGC TCCTGAACCT GGTGCTGAAC
GCCATGGAGG CGATGGAGGG GATCGAGGGT GACAGGACCC TCAAGGTCGC CGCCTACCGG
CGGGAGACGC AGGCCTATGT GCTCGTCAGC GATTCCGGGA GAGGGATCGC CCCGGACAAG
TGCGAGCAGC TGTTCGATCC CTTTTACACG ACCAAGATAT CCGGGGGAGG GCTCGGGTTG
TCCATCCTGC AGACCATCGT CCTCTCCCAT GGCGGAAGCG TTTCCGTGAC GGGAGGGACG
GGCGCCGGCG CCACCTTTAC CGTCGTACTC CCCCTCGCGG GGCCTTCAGG AACTGGAGAA
ACACTGTGA
 
Protein sequence
MTAFSRTFKL PLALLLIVAL TVCAFITIRR ANNEVRAEAR ARFFEQYNRQ QYLVAELTAH 
SLDEMFATFS RNLELVAGLF DGKRVDREQV RRLQVHLERI YGSLLGTPVI DLVVFDNKGV
TVAIEPHDNF TLGRNYAWRD YYRWVRDQRQ PGRMYISPFM QLEGGKKRGV KELIVARGIY
GSRGEFNGVV ACTLDFDKLA RKHVLSVKIG KHGQAWLMDS TTRTILVDPN GRIAGQTFDE
ALRRKWPRLY DLLLSAGDGK PGSGWYEFED PADPRLQVRK LVSYHPVRLE NRLWTVGVTT
PEREVEALLS SFLQRQEAIS TTLLVTILGG AALLLTLFYN WNRTLTAQVG LHTRALSKAH
SRLEATFDEL LVAKKVAAVG HLALGLAHEI RNPLSAIQMN MQMIRKKIEP TGVLCENFSI
ADGEIKRLNR LLKDVLDFAR PRPVRLQTAD LTGIVRRLLQ LVDQRLAQHG VTTVTDLESP
LELVCDPEQI HQVLLNLVLN AMEAMEGIEG DRTLKVAAYR RETQAYVLVS DSGRGIAPDK
CEQLFDPFYT TKISGGGLGL SILQTIVLSH GGSVSVTGGT GAGATFTVVL PLAGPSGTGE
TL