Gene GM21_1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1970 
Symbol 
ID8137304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2284942 
End bp2286207 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content46% 
IMG OID644869584 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003021781 
Protein GI253700592 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones140 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATTG GGGAAAATAC CAACAGCAAA AAAAAGACCC GCTCCAAAAC AGGAGCGTTG 
GCGGCTAGGC AGTTACAGAG CCTGTTACGA CACGTGCCTG TGGTCTTATT TCAGTATCAA
AAACATAATG ACGGAAGGCA TAGCTTTCCT TATGTAAGTG AGACCTTGAC ACAAATTTTC
CATCTTGGCC CATGTGAAGC CACGACTGAG GCTTCGGATT TCTTTTCACT TTTACACCCC
GATGACATAG GTATCGTGAC TGCGTCAATT GCAGACTCAG CAGCGAACTT ATCTCTTTGG
CAGCAGGAGT TTCGCATTGT TATAGAGCGG GAGCATTGGA TTGAAGCAGC AGCCACTCCG
GTCATGATTA ACGATGGCAG TTGCTTGTGG AGTGGGTATG CGAGAGAGAT AAATGAGAGA
AAGGTTTTAG AGCAGGATTT GCGAGAAGTA CAGGAAGACT TAATGCGGAT GACCGAGGAA
CGGACCAACA AACTGGTTGA GGCGAATACA AAATTATGTG CTTTAAACCA GGAGATAAAA
GAAGAGATTA ACCAGCGCAT CAGATTAGAG AATAGCCTAA AAGAATCATA TGAACTTCTG
TCTTTGCTCG CTGCAAATCT AGTATTTTCA GAAGAACGAG AGCGGAGAAG AATCGCAACC
GAGTTGCACG ACGATGTGGT GCAACATCTG GCGCTTTGCA AATTGAGACT GGATATGGAA
CTCAAAGACG GTGCTCCGTC GCGTGTATTG CAGGAAGAAC TCGTAGGTGA GCTGGTAAGG
ACAATGCAAC AAATCAGACG TATCTGCTAC GATCTGAGCC CGCCGGTGCT GTACGATTTT
GGGCTTCCTA ATGCGCTGCA AAATCTAGGA GAAACAATGA CGCAGGCAAC AGGTCTGCAA
TTTAGGTTTC AAAACGGCTT GAAGAAGCTT GAATTACCGA ATCATATACG CACTGTACTG
TATCAAACAG CTAAAGAACT GCTAGCCAAT GTGATGAAAC ACGCGATGGC GAGCAATGTC
TCGGTAGCCC TCACTAAAAG TGAGGAATTA ATCAGGCTAT CGGTAACTGA CGATGGAGTT
GGGTTCCCAT CACTTGGCAA GAAGGGTTTC GGATTGTCAC ATATTCAGCA AAGAGTGGCT
TTCCTCAAAG GAAATCTGAG CATTTCCTCA GGACCCGGCA GAAAAACTGT TGTAGCAGTT
GAGATACCGG CAACACCTGC GAGCAGTGCC AACACGCCCC CCTCACCTCG TAATCAGCCT
ACTTGA
 
Protein sequence
MDIGENTNSK KKTRSKTGAL AARQLQSLLR HVPVVLFQYQ KHNDGRHSFP YVSETLTQIF 
HLGPCEATTE ASDFFSLLHP DDIGIVTASI ADSAANLSLW QQEFRIVIER EHWIEAAATP
VMINDGSCLW SGYAREINER KVLEQDLREV QEDLMRMTEE RTNKLVEANT KLCALNQEIK
EEINQRIRLE NSLKESYELL SLLAANLVFS EERERRRIAT ELHDDVVQHL ALCKLRLDME
LKDGAPSRVL QEELVGELVR TMQQIRRICY DLSPPVLYDF GLPNALQNLG ETMTQATGLQ
FRFQNGLKKL ELPNHIRTVL YQTAKELLAN VMKHAMASNV SVALTKSEEL IRLSVTDDGV
GFPSLGKKGF GLSHIQQRVA FLKGNLSISS GPGRKTVVAV EIPATPASSA NTPPSPRNQP
T