Gene GM21_0008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0008 
Symbol 
ID8135307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp12181 
End bp14448 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content62% 
IMG OID644867625 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003019853 
Protein GI253698664 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value0.865749 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTTC GTTTTGGCAT AAGAACAAAA CTGCTCCTGT CCATACTCGC CATCCTCTTC 
GTATCATATT CAACCCTGTT GTACTCCTCG ATGAAGACCC TCAACGCGTC GCTTCGGACC
GAGCTGGACC GTAACCTCGC CACCAACATC AAGTACGCAC GCAGCCAGTA CCTGGCTCAA
GCCGAGATCG CCAAGTTCTC CGTGATGCAG TCGGTGGTTT CGCAGTCGGT GCAGCAGCAC
CTGCGCGAGC GGGACAGCGC CTGGTTCTCC TCGCGCGTGA AGCACTGGCA TGCCGTGCTC
CCCTTCGTGG ACCTGGTGGT CGTGGTGGAT CCCGAGGAAC AGGTACTGGC CACCCTGCAA
GGCCCCCGAA ACGGCGGCCC GATGGAGCTG CCGGTGGTGG TGGAACAGGC CCTGGCCAGC
AAAAAAGCCA TCTTGTCCAC CGAGCTTCTG AGCGGCGAGT TCATGTGCCG GGCGGGAGTG
GAAGGATACT GCGAACGCCC CGACAGCGAG ACCCTGGTTG CCACGGTGGC GGTCCCGATC
ATAGGCGCGG ACGGCGGCGT GCTCGGTTGC GTGGTGACCG GAGACATCAT GAACCACCAC
CCGAACCTCC CGGCCAAGCT GCAGGAGGTC TCCGGGAACA ACGTGGAGGT GACGCTCACC
CAGCGCGGGC TTCGGGTCGC CAGCAGCTTG CCGGAGCGGG TGCTGGAATC CTATACCCTC
TCCCCCGCGG TGCTCGACGT GCTGGAGCGC GGCGAGGTGT ACCGCGGCCG GACCGCCATG
GGCACCAAGA GCTACGAGAC CATCATCGAC CCCCTTTTGA ACAGCAGGGG GGAGTTCGTC
GGGTCGCTTT CCGTGGCCAT ATCCACCGAA ACGGTGACCA GCGGCAGGCG GGAAAACCTG
CAGTACATCC TCGCCTCGGC TTTCCTCGGC ATCATCTGCT CCTTCGGCAT GGCCTACATC
GCCTCGCGTC ACCTGACGGG GCCGTTGCGG CAGTTGGCGG CGAGCGCGCG CCGCATCGAA
GAGGGGGATC TCGACCAAAG GGTCGTCGGC CACCAGCGGG ACGAGGTGGG TATGCTGGCC
TCCTCCTTCA ACAACATGGC GGAATCGCTC AAGGAGCGGG ACAGCATCAT CAACAGGAAG
ACCGGCGACC TGCAGGAACT CAACGAGCAG CTGGAGAGAA TGGTCGAGCA GCGGACCTCC
GCCCTCAGCA TGGAGATGGG GAGGTTGGAG GCGGTCTTGA CCAGCCTGGC CGAAGGGGTG
GTGGTGACCG ACAGGGACAA CCTCGTGGTG CTCTTCAACC CGGCGGCTCA GCAGATCTTC
GAACTGGTCC CGCATCGCGT GGTCGGGCAG TCCGTCGAGC GCCTGTGCGA GATGACGGGC
TTTTGCAACG TTCTGGAACA GGTCGGCGAG CAGACGCCGC GGGAGCGCAA CCGCGGCGGG
AAAAAAGAGA TCACCGTGAA AGGGAAGCGG CTGAACGTGA ACAAGGCCAC CCTCCAGGAC
GAGGCAGGTG AGTTCGCCGG TATGGTCATG TCGCTGCGCG ACGTCACCAA GGAGGAGCAG
GTGGACCGGA TGAAGACCGA GTTCATATCC ACCGTCTCCC ATGAGTTGAA GACGCCGCTT
ACCTCGATCA AGGGGTCGCT GCAACTGCTT TTGACCCGCA GCAAGTGGCT CACGGACACC
GAGAGGCAGC TTTTGACCGT CTGCTTCCGG AACACGCAGA GGCTGATCCG GCTGATCAGC
GAGATCCTCG ACATTTCCGG CATCGAATCG GGCGGGATGA TCTTCAACTT CAAGTCGCTT
TGCATCGGTG AACTCGCGGT GTACGCGGTC GAGGAGATCA AGTCCTACGC CATGGGACGG
GACATCACCA TCGTCAACAC CGTGGGCGAG CATCTTCCCA TGGTGTTTGG CGACAGCGAC
CGCCTGATCC AGGTGATGAC CAACCTCCTC TCCAACGCGG TGAAGTTTTC CCCCGAGGGG
AAGGTGGTCA TGGTCACCGC CGAGCAGGAA GGAAACTACG TTGTGGTTTC GGTGGCCGAC
CGGGGGCGGG TGATACAGTG GTCCGACCGG GACAAGCTTT TCAAGAAGTT CCAACAGATC
GAATCGACCG AACGCGGCAA GATCGGCGGC ACCGGGCTGG GGCTCGCCAT CTGCAAGGAG
ATAGTAGAGC GGCATCACGG CAGGATCTTC TACACCGCCG CCAAGGAATA CGGCAATACC
TTCAGCTTCA CGGTGCCGAT AATAGGGGAG ACAGATGCAA AAGGATAA
 
Protein sequence
MQLRFGIRTK LLLSILAILF VSYSTLLYSS MKTLNASLRT ELDRNLATNI KYARSQYLAQ 
AEIAKFSVMQ SVVSQSVQQH LRERDSAWFS SRVKHWHAVL PFVDLVVVVD PEEQVLATLQ
GPRNGGPMEL PVVVEQALAS KKAILSTELL SGEFMCRAGV EGYCERPDSE TLVATVAVPI
IGADGGVLGC VVTGDIMNHH PNLPAKLQEV SGNNVEVTLT QRGLRVASSL PERVLESYTL
SPAVLDVLER GEVYRGRTAM GTKSYETIID PLLNSRGEFV GSLSVAISTE TVTSGRRENL
QYILASAFLG IICSFGMAYI ASRHLTGPLR QLAASARRIE EGDLDQRVVG HQRDEVGMLA
SSFNNMAESL KERDSIINRK TGDLQELNEQ LERMVEQRTS ALSMEMGRLE AVLTSLAEGV
VVTDRDNLVV LFNPAAQQIF ELVPHRVVGQ SVERLCEMTG FCNVLEQVGE QTPRERNRGG
KKEITVKGKR LNVNKATLQD EAGEFAGMVM SLRDVTKEEQ VDRMKTEFIS TVSHELKTPL
TSIKGSLQLL LTRSKWLTDT ERQLLTVCFR NTQRLIRLIS EILDISGIES GGMIFNFKSL
CIGELAVYAV EEIKSYAMGR DITIVNTVGE HLPMVFGDSD RLIQVMTNLL SNAVKFSPEG
KVVMVTAEQE GNYVVVSVAD RGRVIQWSDR DKLFKKFQQI ESTERGKIGG TGLGLAICKE
IVERHHGRIF YTAAKEYGNT FSFTVPIIGE TDAKG