Gene GM21_0361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0361 
Symbol 
ID8135668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp435895 
End bp437838 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content60% 
IMG OID644867978 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003020200 
Protein GI253699011 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.00025473 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATGAGA TCCGTGACAA CCATAGCTTC GCCTTTTTGG AGCAGAAGGT CGAGGAGCGA 
ACCCGCGAAT TGAAACAGGA GATCCAGGAG CGCCTGCGGG CCGAAGGGCA ACTGGCCGAG
GCGCGGGACC ACTACCTGAA CATCCTGGCC GAGGCCCCCG CGCTGATCTG GCGTGCGGAC
ACGCAGGCCA AGTGCGACTG GTTCAACAAC ACCTGGCTTT CTTTTACCGG CCGCACCATG
GAAGAGGAAT ACGGCGACGG CTGGGCGCAG GGGGTCCACA CCGACGACCT GGAGCGCTGC
GTTGCCATCT GGCTCGGGGC GTTTCACCGG AAGGTCCCGT TCGAAATGGA GTACCGGCTG
CGACGCCACG ATGGCGTATT CCGCTGGATC CTGGACATCG GCCGCCCCTT CTCCGGGCTC
GACGGCAGCT TCGCCGGGTA TATCGGTTAC TGCTTCGACA TCACCGACCG CAAGGGGGCG
GAGATGGAGC TGATAGTGGC AAGGGAGGCC GCCGAGGCTG CCAGTAAAGC GAAGACCGAG
TTCCTGGCCA ACATGAGCCA CGAGATCCGC ACCCCGATGA ACAGCATCAT CGGGATGACG
CAGCTATTGG CCTACACGGA GCTTTCAGCC GAGCAGAAGG AGTACGTCGA CGGAATCCTC
ACCTCTTCGG AGGGGCTCCT TGCCATCATC AACGACATTC TCGATCTTTC GAAAGTGGAG
GCAGGGAAGA TAGAACTGGA GTCGCGGAAT TTCAGCCTCA GACAGAACAT CAACGAAATC
ATCAGGACCC AGACGGCGGC GGCCCACGAA AAGGGGCTCC AACTTAAAGT CTTCATCCCG
GAAGAGATAC CAGACGCACT GGTCGGCGAT CGGCTGAGGC TAAAGCAGGT GCTACTCAAC
ACAATCGGCA ACGCCATCAA GTTCACCGCA AGCGGGAGCA TTGCCGTAAC GGTAGCGCTG
GCGGAAAAAC AAGAGGACGC GGCTCGCCTT ACATTCAGCA TCGCCGATAC CGGCATCGGG
ATAGCGCCGG AGTCCCTGGA CCGCATTTTC GCGCCTTTCG CGCAGGAAGA CACCTCCACC
ACCAGGAGGT ACGGCGGCAC CGGTCTTGGG CTTTCCATCA GCACGAAGCT GGTCCGGCTC
ATGGGGGGAC GGATCTGGGC GGAAAGCCGC AAGAACACCG GCAGCACCTT CCACTTTGAG
ATTCCCTTCC GGCTCTGCGC CAAGAAGGCC CGGCCCAAGG CGTCCTTAAA TACCGCAAGG
AAAGTCGTTT GGGACGGGGC CAAGCTCCGC ATCCTGCTGG CCGACGACCA GGAGATGAAC
CGCAACATCA TGGAGAAACT CCTGGGGCGG CTGGGGCACG AACTGGAGAG TGCTGCGGAT
GGCGGCGAGG CGCTGGCAAA ATGGGAAAAG GGGGACTTCG ACCTCATCCT CATGGATGTC
GAGATGCCTG GTATCGACGG CACGGAAGCG ACCCGCGTGA TCCGGGAGAC CGAGACCCCG
CTGGGAAAGC GGACCCCCAT CATCGCCCTC ACCGCCCATG CGCTTAAGAA CCACCAGGAG
ATGCTCCTCC GAGTAGGCTA CGACGGGTAC GTACCCAAAC CCGTGGAGAT GTCCACGCTG
TTACGGGAGA TGAGGCGCTG CTTGCGCCTG CCCGAGCTTG ACACAGCAGG CACAAGCGAA
GCTGTCGACG GAGTCCCCAC GACTCCCGCG GCAAACCAGC CTGGCATAGT GGACAGGCAG
CAACTTGCAG ATATACTCAG TTCAATCAGT TCGCTGTTGC GGCAAAGGAA CATGAAGGTC
CTGGACCAAG TAAACGACCT CTCCGCACTG ATTCCCGGGT CACCTCTTCT AGAACAGTTA
AACCAGCAGA TACGGCATTT TGATTGCCAA AGCGCACTAA ATACAGTCAG AGAAATCCTT
CTAGATTACG ACATCCACTC CTGA
 
Protein sequence
MDEIRDNHSF AFLEQKVEER TRELKQEIQE RLRAEGQLAE ARDHYLNILA EAPALIWRAD 
TQAKCDWFNN TWLSFTGRTM EEEYGDGWAQ GVHTDDLERC VAIWLGAFHR KVPFEMEYRL
RRHDGVFRWI LDIGRPFSGL DGSFAGYIGY CFDITDRKGA EMELIVAREA AEAASKAKTE
FLANMSHEIR TPMNSIIGMT QLLAYTELSA EQKEYVDGIL TSSEGLLAII NDILDLSKVE
AGKIELESRN FSLRQNINEI IRTQTAAAHE KGLQLKVFIP EEIPDALVGD RLRLKQVLLN
TIGNAIKFTA SGSIAVTVAL AEKQEDAARL TFSIADTGIG IAPESLDRIF APFAQEDTST
TRRYGGTGLG LSISTKLVRL MGGRIWAESR KNTGSTFHFE IPFRLCAKKA RPKASLNTAR
KVVWDGAKLR ILLADDQEMN RNIMEKLLGR LGHELESAAD GGEALAKWEK GDFDLILMDV
EMPGIDGTEA TRVIRETETP LGKRTPIIAL TAHALKNHQE MLLRVGYDGY VPKPVEMSTL
LREMRRCLRL PELDTAGTSE AVDGVPTTPA ANQPGIVDRQ QLADILSSIS SLLRQRNMKV
LDQVNDLSAL IPGSPLLEQL NQQIRHFDCQ SALNTVREIL LDYDIHS