Gene GM21_0010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0010 
Symbol 
ID8135309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp16171 
End bp17262 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content62% 
IMG OID644867627 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003019855 
Protein GI253698666 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACTGC GTGAAAGTGA CGCCGGATAT CGCGAATTGT TTGAAGAAAA CCCTCAGCCG 
ATGTGGGTAT ATCAGCGGGA GAGCCGGAGA CTTCTTGCGG TGAACGAGGC GGCCCTGCGG
CTTTATGGAT ATCGTCGCGA GCAGTTTCTC GAACTGGCCC TGGAGCATCT AAGCGGCGGC
GAGCCGATGG GGGACCCGGG TTCGTCGCAG GACCAGCAGC CGCGCTGCAG GCAGATGAGA
AAGGACGGCA GTTCCTTCGA AGCGCAGCTG GTCTGCCACC CTTGCCAGTT CCAGGGGGAG
CGGGTGCAAC TGGTGCTGGT GCGCGAGGAC GGTGGCGCCC AGCAAGAGGC GCAGCTTAGG
TACCGGGTGC TCCAGCAGGG AAGTCTTTTG GAGGCGGCGC AGCGCGAACT GGAGACCTTC
AGCTATTCCG TCTCGCACGA CCTGCGCGCG CCGCTACGCC ACATAGACGG CTTCAGCCGG
GCTCTCATGG ACGATTACGG AACCATTCTG GACGGCCAGG GCAAGGAGTA CCTGACCAGG
ATCTGCCAGG CGGCGGAGAA GATGTCGCAG TTAATCGACG CCATGCAGCA ACTGTCGCGG
GTGGGTAGGA CGGAGTTGAG CCTGGAGAAG GTCGACCTGA GCGTGAAGGC CCAGGTGATT
TCATTGGAAC TGAAGCACCG GGAGCCCGAA CGCCGGGTCG ACTTCGCCAT CGAGGAGGGG
GTGAGAGCCG AGGCCGACGC CAAGCTGGTG CGTCAGCTTC TGGAGATCCT GATGGGGAAT
GCCTGGAAGT TCAGTTCCAA GACACCCTCC GCGGTGATAG GCTTCGGCTC CGTCGAGCTG
CAGGGGGAGA CCGCGTACTT CGTCAGGGAC AACGGGGCAG GGTTCGACAT GGCCTACGCC
GACAAGCTCT TTTCCGTATT TCACAGGCTC CATCGTGCCG ACGAGTTCGA GGGAAGCGGC
GTGGGGCTTG CCATCGCCCA GCGCATCGTA GCGCGCCACG GTGGCCGGAT CTGGGCCGAA
AGCGCGCCCG GCGCCGGTGC CACCTTCTAC TTCACGTTGA AAGGCGAGAA ACAATTGACG
ATTGACAATT GA
 
Protein sequence
MALRESDAGY RELFEENPQP MWVYQRESRR LLAVNEAALR LYGYRREQFL ELALEHLSGG 
EPMGDPGSSQ DQQPRCRQMR KDGSSFEAQL VCHPCQFQGE RVQLVLVRED GGAQQEAQLR
YRVLQQGSLL EAAQRELETF SYSVSHDLRA PLRHIDGFSR ALMDDYGTIL DGQGKEYLTR
ICQAAEKMSQ LIDAMQQLSR VGRTELSLEK VDLSVKAQVI SLELKHREPE RRVDFAIEEG
VRAEADAKLV RQLLEILMGN AWKFSSKTPS AVIGFGSVEL QGETAYFVRD NGAGFDMAYA
DKLFSVFHRL HRADEFEGSG VGLAIAQRIV ARHGGRIWAE SAPGAGATFY FTLKGEKQLT
IDN