Gene GM21_1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1302 
Symbol 
ID8136629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1528066 
End bp1529856 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content64% 
IMG OID644868916 
Producthistidine kinase 
Protein accessionYP_003021120 
Protein GI253699931 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0000000000000333005 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGAAAAC TCCGTCAATC CGTATCCCTT TTCTTCGCCG GCTCCGCCTT GCCGTGCCGG 
CGGCAGCCTC TGCTCGTCAT AGCCCAACTG CTCCTTTGGG CCCTGCTCGT CGTCTCCATC
TGCGGCGCAG CCGAATCGGC CGCAGGCGTC AGGCCGGTCA TCGTCGGCGG CGACCGCGAC
TACCCCCCTT ATGAATTCAT CGACAAATCG GGCCATCCTG CCGGCTACAA CGTCGACCTC
ACCCGGGCCA TAGCCGACGT CATGGGGATG AAGGTCGAGT TTCGCCTGGG CGGCTGGGCC
GGGATGCGCA GCGCCCTGCA AAGCGGGAAG GTGGACGTGC TGCAGGGGAT GTCGTACTCG
CTGGAGCGTT CCTCGGAAGT CGACTTCTCC GTCCCCCATA CCGTCGTCAA CCATGCCGTC
TTCGCCCGCA AGGAATCCCC TTATCTCGCC TCCCTGACCG GTCTGAAGGG GAAGACCGTG
GCGGTGCATC GCGGCGGGAT CATGCACGAC TACCTGGTCC GGCAAGGTGT GGGGGCTAAG
CTGACCCTGA CCGAGACCCC GGCCGACGCC CTCAGGATGG TCGCTTCCGG CCGGACCGAG
TTCGCCGTGG TCGCCATCGT TCCCGGGATG TACATGATCC GGGAGTTGAA GCTCTCCAAC
CTGGTCCCGG TGCTGCGCAA CGTCGCCACC CATCGCTACT GCTATGCGGT CAAGAATGGG
AACGTCGAGC TTTTGTCGCG CTTCAACGAG GGGCTGGCGA TACTGAAGAA GACCGGCCAA
TACGACGTCA TTCACAACCG GTGGCTGGGG GTCGTGGAAC CGCAGTTGAT AGACTGGTGG
ACCTTCGTGA AATATGCGGC GGTCGTGGTG GTGCCGCTGG TGCTCCTTCT GGGGGGCTTC
GCCCTTTGGT CCCGCACCTT GCACCGGCAG GTCGCCCTGC GCACGGCGGA CCTCACCCGG
GAGATCGCCG AAAGGCGCCA GGTCGAGGAG GAACTGCGCC TGAACCAGCA GCAACTGGTG
CAGGCGGACA AGATGGCGGC CCTGGGGGTG CTGGTTTCCG GCGTCGCCCA TGAGATAAAC
AACCCGACCG GACTCATCCT TCTGGAGGTC CCGATCCTGA AGCGGTTCCA TGCGGACTCG
GTGAAGATCC TGGAGCGCTA TTACGAGGAG AACGGCGACT TCACCTGCGG CGGGCTCCCC
TATTCACGGA TGCGCCAGGA GATTCCCCGG TCCCTGGAGA AGATTCAGGA CGCCGGCAAG
CGCATCAAGC GGATCGTGGC GGATCTGAAA GACTTCGCCC GCCGCGACGA AACCGATTGC
AACGAAATCC TTGATCTGAA CGCGGCGGCT AAGGCCGCGG TACGCCTGGC CGAGCCGACC
ATAAACAAGG CGACCACCCG CTTCAGCGCC GAGTATCGCA AGTCGCTGCC GCGCATCCGG
GGGAACCGCC AGCGCATCGA GCAGGTGCTG GTCAACCTGA TCCTCAACGC CTGCCAGGCG
CTTCCCGACC CGGAGCGGGC CATCGAGCTG ATGACCTGGC ACGACGCTTT CCGGGATCAG
GTGGTCCTAC GGCTGCGGGA CGAGGGGACC GGCATCGCTC CAGAGCACCT GTCGCGCCTG
ACCGATCCGT TTTTCACCAC GAAGCAGGAC CAGGGGGGGA CCGGTCTCGG GCTTTCCGTC
TCGGCTGGGA TAGTCAAGGA GCATGGCGGG ACCCTGCAAT TCGAGTCGAA CGGGGAGGGG
GCCACGGTCA CCCTGACCTT GCCGGTGTAC CACGAGGAGA ACAACGGATG A
 
Protein sequence
MGKLRQSVSL FFAGSALPCR RQPLLVIAQL LLWALLVVSI CGAAESAAGV RPVIVGGDRD 
YPPYEFIDKS GHPAGYNVDL TRAIADVMGM KVEFRLGGWA GMRSALQSGK VDVLQGMSYS
LERSSEVDFS VPHTVVNHAV FARKESPYLA SLTGLKGKTV AVHRGGIMHD YLVRQGVGAK
LTLTETPADA LRMVASGRTE FAVVAIVPGM YMIRELKLSN LVPVLRNVAT HRYCYAVKNG
NVELLSRFNE GLAILKKTGQ YDVIHNRWLG VVEPQLIDWW TFVKYAAVVV VPLVLLLGGF
ALWSRTLHRQ VALRTADLTR EIAERRQVEE ELRLNQQQLV QADKMAALGV LVSGVAHEIN
NPTGLILLEV PILKRFHADS VKILERYYEE NGDFTCGGLP YSRMRQEIPR SLEKIQDAGK
RIKRIVADLK DFARRDETDC NEILDLNAAA KAAVRLAEPT INKATTRFSA EYRKSLPRIR
GNRQRIEQVL VNLILNACQA LPDPERAIEL MTWHDAFRDQ VVLRLRDEGT GIAPEHLSRL
TDPFFTTKQD QGGTGLGLSV SAGIVKEHGG TLQFESNGEG ATVTLTLPVY HEENNG