Gene GM21_1102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1102 
Symbol 
ID8136424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1291797 
End bp1292891 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content61% 
IMG OID644868713 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003020921 
Protein GI253699732 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones94 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCC ATCCCCCTTT GAAAGCCGAG AGCACCCCCT GGGAAAACCT CGAACGCGAA 
CAGTACCAGA GCTTCTTCTC CTTAATCCCC GATCTCGCCT GCATCTGCTC CGCCGACGGC
TATTTCAGCT ACTTGAACCC CGCCTGGGAA GAAACCTTGG GATACACCAA GGAGGAGCTT
TTGAGCCGGT GCTTCTCCGA GTTCGTTCAC CCGGACGACA AGGCGCTGAC CTCGGTCAGG
ATGAGGCAGC GAGCAAGATT CGTGAATCAC TACCGGTGCC GGGACGGGAG CTACCGGTGG
TTTGAGTGGA ACACCGCCCC CAACCCCGAC GGCACGGAAA CCTTCGCCAT CGCACGGGAA
ATGAACGACC TGGTCCAGGC GCAGGATGCC CTTTTGGTGC ACCAGGAACA ACTGAGGCTG
CTGGCGATCG AGCTGTCCGT GGTGGAGGAG AGGGAGCGGC GCCGTATCGC CAGTGAGCTT
CACGACGAGA TAGGTCAGAC TCTTGCCCTC GCCAAGATCA GACTGCACGA CAACCTCTGT
AACGAGCAGG TGACCACCGG CTGCGCCAGA CGGGTGCAGC ATGTAAGCGA GCTGGTCGAG
AAGACCATAC AGGCGGTAAG GACCCTTACC TTCCAGATCA GTCCTCCGCT TTTGTACGAG
GTGGGGCTCA AGGCGGCGGT GGAGTGGTTG TCCGAGCAGT TCGAGGCGGA GCACGGGCTG
AAGATAGTGA TAGACAGCCG CGAGTCGCGG ATGCGGCTCG GCGAGGAATT GAGCTCCACC
TTGTACCACG TGGTGCGGGA ACTGCTGGTC AACGTGGTAA AGCATGCCGG AGCCCGGACG
GTGACTATCC GGCTGCGACA GATGGCGCAC CGGGTGGAAC TGACCGTCAA AGATGACGGC
GGCGGCTTCG AGATCCCGGC CGGTGCCGAG CGTTCGGGCG GATTCGGCCT CTTCAACATC
AGGCAGAGGA TTCAGCACCT GGGCGGAGTC GTGAAAATAG TGGCGGAGCC GGGGCACGGA
GTCGAGGTGG ACCTGACCGT GCCGGTAGGG AAGCGGACAG CAAACAGGCA GGCGAAAAAG
CAGGAGAAGG GATGA
 
Protein sequence
MNRHPPLKAE STPWENLERE QYQSFFSLIP DLACICSADG YFSYLNPAWE ETLGYTKEEL 
LSRCFSEFVH PDDKALTSVR MRQRARFVNH YRCRDGSYRW FEWNTAPNPD GTETFAIARE
MNDLVQAQDA LLVHQEQLRL LAIELSVVEE RERRRIASEL HDEIGQTLAL AKIRLHDNLC
NEQVTTGCAR RVQHVSELVE KTIQAVRTLT FQISPPLLYE VGLKAAVEWL SEQFEAEHGL
KIVIDSRESR MRLGEELSST LYHVVRELLV NVVKHAGART VTIRLRQMAH RVELTVKDDG
GGFEIPAGAE RSGGFGLFNI RQRIQHLGGV VKIVAEPGHG VEVDLTVPVG KRTANRQAKK
QEKG