Gene GM21_4026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4026 
Symbol 
ID8139400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4609799 
End bp4611613 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content62% 
IMG OID644871642 
ProductCheA signal transduction histidine kinase 
Protein accessionYP_003023800 
Protein GI253702611 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.00000000000112235 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTATAG ATTGCGAAGA TCAGGAGCTA CTCGAAGGTT TTCTTGCGGA AACCACGGAG 
CTCCTCGAAA AACTTGACGA CGACCTGATC ACCCTGGAGA AGAGCCCGGA TGACGCCGAG
CTGATGAACC GGATTTTCCG CTCCATCCAC ACGGTCAAGG GGGCCTCCAG CTTCCTCGGG
TTCGACATGC TGGTCAAAGT GACTCACAAG ACCGAGGACG TGCTGAACCG CCTGAGAAAG
GGGGAGCTTT TCCTCAACCC AGAGATCATG GACGTGATCC TGGAGGCGGT CGACCTGGTG
AAAACACTGG TGGCCGACAT CAAGGGGGGG GATATCGTCG AGAGGGACCT CGAGGGGACC
ATCTCCAAGC TGATTCCGTT TCTCTCCGAG AACGCGACCG AGGCGACGGT ACTGGCACCT
ACCTCAGCTT CCAAGGAGGA GAAGGGGGCG ACGCCTCCTC CCCCCCCCGA GTCGGTAGCG
GACACGGCGC AGGAAGCAAC CGCTCGGGAG GAGAGCGCAC CCGCTGATAC CGGCGCGTCC
AAGGCGGCAG CCGCGGCGCC GGCGGCGCAG CCCAAGCCCC AGCCTGTCAA GGAGCCGCAA
AAGCCCGCCC CCAAAGGTGA GGAACTGGCC GACAACTCGA CGGTGCGCGT CGACGTGAAG
CGCCTGGACG ACCTGATGAA CCAGGTCGGC GAGCTGGTGC TCGAGCGCAA CCGGATGATT
CAGCTGCACA GCGACTACCA GACGGGTCTC GACCCCACCG GCTTCGGCGA CGATTTCGGG
AAACTCTCCA AGAGGCTCAA CTTCGTCACC TCCGAGCTGC AGATGCAGGT CCTCAAGATG
CGGATGCTGC CGGTGGAGAA GGTCTTCAAG AAGTTCCCGC GCATCGTCAG GAACCTGGCG
CGCGACCTCG GTAAGGAAGT TGATCTGGTT ATCATCGGCG AGGAGACCGA ACTGGACCGC
TCTGTCGTCG ACGAGATCGG CGACCCCTTG ATCCACCTGA TCCGCAACGC CCTGGACCAC
GGGCTGGAAA CCCCGGACCA GCGCCTTGCC GCCGGCAAGG ACCGCACCGG TACCGTGGTC
CTCTCCGCGG CCCACGAGGG GAACCAGATC GTAATCAGCA TCAAGGACGA CGGCCGGGGC
ATAGACCCCG AGCGTATCTC CAAGAAGGCA CTGGAGAAGG GGCTCGTCAC CGATGAGCAG
CTCGCTTCCA TGGGGAACCG CGAGATCCTC GACCTCCTCT TCCTCCCGGG CTTCTCTACC
AAGGAGCAGA CCACCGACCT CTCCGGCCGC GGCGTCGGGA TGGACGTTGT GCGTACCAAC
ATCCGGAAGC TAAACGGCAT CATCGAGATC AAGAACGACG TCGGGCACGG CACCGAGTTC
ATACTGAAGC TCCCGCTCAC CCTGGCCATC ATCCAGTCCC TGCTGGTCGA GGTGGAAAAG
GAGGTCTATT CCATACCGCT TGCGTCGGTC ATCGAGACCA TGCGGGTGAG CAAGAGCGAG
TTCCACATGA TCGGCGGCCA GGAAGTGCTC AAACTTAGGG ACTCGGTGCT TCCGCTGCTG
CGGCTGCAAC AGACCTTCAG CTGCCAGGAG TACTACACCG ATCGCGACAC CTGCTATGTG
GTCATCGTCG GCGTGGCCGA AAAGCGCATC GGCCTCATCG TGACCAGGCT ACTTGGGCAG
CAGGAGGTCG CCATCAAGTC GCTGGGCAAG TTCCTCGCCA ATCTCCCGGG GATCGGCGGA
TCGACCATCA TGGGCGACGG GCGGGTAGCA CTCATCGTGG ACCCGATGGG TCTCATCGGG
GGCGGGGCAG CCTGA
 
Protein sequence
MAIDCEDQEL LEGFLAETTE LLEKLDDDLI TLEKSPDDAE LMNRIFRSIH TVKGASSFLG 
FDMLVKVTHK TEDVLNRLRK GELFLNPEIM DVILEAVDLV KTLVADIKGG DIVERDLEGT
ISKLIPFLSE NATEATVLAP TSASKEEKGA TPPPPPESVA DTAQEATARE ESAPADTGAS
KAAAAAPAAQ PKPQPVKEPQ KPAPKGEELA DNSTVRVDVK RLDDLMNQVG ELVLERNRMI
QLHSDYQTGL DPTGFGDDFG KLSKRLNFVT SELQMQVLKM RMLPVEKVFK KFPRIVRNLA
RDLGKEVDLV IIGEETELDR SVVDEIGDPL IHLIRNALDH GLETPDQRLA AGKDRTGTVV
LSAAHEGNQI VISIKDDGRG IDPERISKKA LEKGLVTDEQ LASMGNREIL DLLFLPGFST
KEQTTDLSGR GVGMDVVRTN IRKLNGIIEI KNDVGHGTEF ILKLPLTLAI IQSLLVEVEK
EVYSIPLASV IETMRVSKSE FHMIGGQEVL KLRDSVLPLL RLQQTFSCQE YYTDRDTCYV
VIVGVAEKRI GLIVTRLLGQ QEVAIKSLGK FLANLPGIGG STIMGDGRVA LIVDPMGLIG
GGAA