Gene GM21_1118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1118 
Symbol 
ID8136440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1309392 
End bp1311236 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content62% 
IMG OID644868729 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003020937 
Protein GI253699748 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones147 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGATA CGAAAGAACT GGAATTCGCG GACATAGGCA CAGTCAACAG CCACGACCAC 
ATCTGCCACA TCTACCATGG AGAGGGGGAG ATCTACCACC CGGTGCTCCC CTTTATCCAG
AAGGGGGTGG CCATAGGGGA GCGCTGCATC TATCTGCATG GGTCGGAGCA ACGGCTGGAG
CGGTTGCTGC AGAATGCGGT CTCCTCACAG CAGGACGACT CCGGCGCACT GATACTGTTC
CCGCTTGAGG ATTTCTGGCA CAAGGAGGGC GCGTTCAAGC AGGAACGGGT GCTGAAACTT
TTGCACAAAC TCTGCAGTGC CGCCATTGAC GACGGCTTTA GCGGCACCAG GGTCATCTGT
GACATGGGAT GGGCCGCCTT AGAGCCTAAG CGACAGGAAC TGCTGCAGCG TTTCGAAAGG
GAGTTGACCG CATTCGCTTC GCAAAACGAC GTGACGCTCC TTTGCCTGTA CAACCGCGAC
CTTTTCCCTC CCGAAATAAT ACTGGAGCTG GCGAAGCTGC ACCCGCAGGT GATCGTCGGC
GGCAGGACCT GCGGCAATCC TTTCTATTTC CCGGCAGCTT CGGACACTCG CATCAGGGCG
GCGCGCTGCG AACTGGACGT ATTTATGGCT ACCGCGCAGA GAATGACGGT GCTTTTGGCC
GAGAGCGACC GTCTCAAGCA GGAGCTGGAA CAGGCCTATG GCGCGCTGGC CCGGAAGATC
TATGAGAATT GGCAGGAGGA GGACTCGCTT AGGGCCAATG AGAAGGAGAT GCACGAGAAG
GACGAGGCTC TTTTGGAACA CAAGAGGAAA CTGCAGACCA TCCTGCAACA TATCCCGGCC
ATGCTTATGG CGTTCGACGG CGGCGACAGG CTTGCCGCCT GCAATCACGA GTTCGAACGG
GCCACGGGCT TCAGGGTGGA AGAGGTTATC GGCAAGCCCA TGCTGGAGCT GCTCCACGTG
GAAGGGGAGC TGCGCGAAGA GGTGGTCTCG GCGCACCCGC GGGAGGGGGG GGACTACCGG
GGGAGGGAGT GGAGCCTGCG CTGCAGAGAC GGCTCTGTGA AGACCGTCGC CTGGTCCAAT
ATCTCCCGCT ATGTCCCGAT CAGGGGGTGG AGCAACTGGG TCGTGGGACT GGACGTGTCG
GCCAAACTCC ATGCGGAGAA CGCTCTCAAG GGGCTGCGCG AAGAGTTGGA GGCGAGAAAC
GCCGAGCTGG AGGCGTTCGG CGAGGCGGTC TCCCACGATC TCAGCGCGCG GTTGGCCCGG
ATCAGCGAGG ACTGCCGCGA GATGCAAAAG CTCTACGGTG GCGACCTTTC GACCCCCTGC
CGCGAGATGC TGCAAAAGGT AAGCGTCGCG GCGCTGGAGC TGGCCGGCCC CATCGCCGCC
CTGCAGCGGC TGACCGCGCT GGCGGCTGCA GGCCTGCAGC CTGAAGAGGT GGACTTAAGC
GCCATGGCCT CGGAGATAGC CGAGAAGCTA TCGGATACGG TCACCCGGCC GGTCACCTTC
AGGATCGAGG ACGGGGTGAC TGTGACCGGC GACCGGGAGA TGCTCCGGTT GGCCATGGAA
CAGCTGCTGG AGAATGCCTT CAACTGCACC GTGGGCGTAA AGCACCCGGT GATCAAGTTC
GGTACCGCGC AGGTGAAGGG GGAGCGGAGC TTCTACGTTT CCGACAACGG CCCCAGACCC
GGCGAGCAAC CGGGCAAGGG GATAGTAGGC AAAGCAGAGG GGCAGGAGCG GATATCCAGC
GGCATCGGCC TTGCCACGGT ACAAAGGATC ATCAACCTGC ACCGCGGCCG GTTTTGGTGC
GCCGATCAAA CGGGCAGGGG AGGTACCCTT TACTTCCAGG TCTAG
 
Protein sequence
MEDTKELEFA DIGTVNSHDH ICHIYHGEGE IYHPVLPFIQ KGVAIGERCI YLHGSEQRLE 
RLLQNAVSSQ QDDSGALILF PLEDFWHKEG AFKQERVLKL LHKLCSAAID DGFSGTRVIC
DMGWAALEPK RQELLQRFER ELTAFASQND VTLLCLYNRD LFPPEIILEL AKLHPQVIVG
GRTCGNPFYF PAASDTRIRA ARCELDVFMA TAQRMTVLLA ESDRLKQELE QAYGALARKI
YENWQEEDSL RANEKEMHEK DEALLEHKRK LQTILQHIPA MLMAFDGGDR LAACNHEFER
ATGFRVEEVI GKPMLELLHV EGELREEVVS AHPREGGDYR GREWSLRCRD GSVKTVAWSN
ISRYVPIRGW SNWVVGLDVS AKLHAENALK GLREELEARN AELEAFGEAV SHDLSARLAR
ISEDCREMQK LYGGDLSTPC REMLQKVSVA ALELAGPIAA LQRLTALAAA GLQPEEVDLS
AMASEIAEKL SDTVTRPVTF RIEDGVTVTG DREMLRLAME QLLENAFNCT VGVKHPVIKF
GTAQVKGERS FYVSDNGPRP GEQPGKGIVG KAEGQERISS GIGLATVQRI INLHRGRFWC
ADQTGRGGTL YFQV