Gene GM21_1870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1870 
Symbol 
ID8137201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2175098 
End bp2177437 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content57% 
IMG OID644869481 
Producthistidine kinase 
Protein accessionYP_003021681 
Protein GI253700492 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones103 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGCT GCGAAGAAGC ACAGACTCAC GCCAAAATAT GGCCCTACCT CGTATTAATG 
CTCGGCATCC TTTTTTCCAG CCTCATGCCG GCCGCCCCAG TGGCTGCCCG GGAACACCCA
AAGACTCCAC AGCACGTACT CATCCTCCAC TCATACCACC CCGGCCTGCC TTGGACGGAC
AGGGTCATGG CTGGGATCCA GTCGGTCCTG GGCCAACCGG GACGCCAGAT CAACATCGAT
GTGGAATATC TGGATACGAA GCGGCACAGC GATCCGGCAT ATTTCTCACA CGTTCTCGAC
ACGATCCTTC ATTACAAGTT CAAAGACCGA TCCTACGATC TGGTGCTGGT CTCGGACAAC
GAAGCGCTCA ACTTCGCGCT GGGACATCGT GACGGCCTCT TCCGGCGCAC CCCCATCGTG
TTTTGCGGGG TGAACAGCGC CGTTCCGGTT CTGGCCGATA AAATTGAGGG TGTCACCGGG
GTGTATGCCG ATCCTGATTT TACTGGTGTC GTCAGCCAGG CCCTTGCCCT CCACCCAGCG
ACCAGGCAGA TCGTGGTCAT CGGCAGCACC CGCGAGCTCT CTGACCGTAT CAATCTTAAG
CAATTGCAGG CGCTGGGCCC TGCCGCGGCT CCCAAGGTTG CCTTTGATTT CTGGAACAAC
CTTCCCGCAG ACATGCTCAG AGAACGCTTG GCGGCTCTCG GCGCCGGGAG CATAGTAATC
ATCAATGGGT CGATTACCGA CCGAACCGGC AGCCTGCTCT CCTTCAACGA CCAGACTAAA
TTGGTGAGAT CCGCGACGAA TGTTCCCATC TACAGCTTCT GGGATGTCTA CCTCGGCGCG
GGGATCGTCG GCGGGCCGCT CGTCGCTGCC CACAAGCAGG GGACGCTCGC CGCGATGATA
GCTTGGAGGG TGCTAGACGG GGAAAAGGCG AGTTCCGTCC CCATCGTGCA TCCGGATCAG
GTGCCATACT CCTTCGACTA CAACGAATTG ATCCGGCTTG GAATACCTCT GCAGAAGCTC
CCTGAGCAGC ATTTGCTGTT TAACGCTCCG GCTTCCTCCT ATCAGATTTC CAAGTCGCAT
TTTTGGATAT CGGTGGCCGC TGTGTCGTGC CTTTTGGGCG TCGCCCTGCT GTTGACCTGG
AGTATCGTGC AACGCCGGCG GGCAGAAACT GGATTGCGGG AGAGCGAGCA GAAGTTCAGG
GAACTGGCGC AGCAGTTCCA GATAGTGCTG GATGGCATTC CAGACAGCAT CACGCTCATT
TCCAAAGAAA TGAAAGTGAT CTGGTCCAAC AAGGGAGCAG TGAATTCCGT TACCAGCCAC
CCCCCTTCCA TACCGCTGGA AGATTGCTGC AACCTGCTGT ACAACCGGAC CTCCATCTGC
GACAACTGCC CGGCGGTCAA GGCCTTTATA TCGGCCACGG CAGAGGAATC CATAATCACC
ACCCCTGACC AAAGGATCCT TGAGGTGCAG GCATTTCCGG TTTCAGGTGG AGCTCAAGAA
ATTTCCTACG TCATCTTGCT CGCCAGCGAT ATCACCGAAA AAACCCGGCT GCTGGACGAG
ACGACACGTA CCAAACGTCT GGCCGCACTA GGTGAGCTGG CGGCCGGAGT GGCCCACGAG
ATCAACAACC CTAACGCCCT CATCCTTTTC AACGCGGATC TGGTGAAAAA GGCATGCGGC
GATGCCGCAC CGATCCTTAT GCGCCACTAC GACGAGCATG GGGACTTCAC CTTAGCCGGC
ATCCCATACG TGGAGATGCG CGACGAAATG CCTCACCTTT TTTCCGAGAT GTTCGAAAGC
GCGGACCGGA TCAAGAGGAT CGTCAACGAT CTCAGGGACT TCGCCCGTGC CGACGCTCCG
GACCTGCGCG AGATGGTAGA CCTGAACCAG AGGGTGGAGT CATCGATCCG CCTTGTCGGT
AATGCGATAA AAACGGCCAC CGACAAGCTG TCCGTTGAAC TGGCCGAAAA TCTCCCTCCG
TTTGTCGGCA ACCCGCAACA CATAGAGCAG GTGACCGTGA ACCTGATCAT GAATGCCTGC
CAGGCGCTCC CGGATAAAGA GAGAGGAATT CACATATCCA CCTGGTTCAG CAAGGAGCGG
AACTCCTCCG TCATCACCGT GAGGGACGAA GGGATAGGGA TCGGCAAAAA AGACATCTCC
CAGATTACCG ATCCGTTTTT CACCACCAAA CGCGCAATTG GCGGCTCAGG CTTAGGTCTG
TCCGTGTCGA TGCGCATCGT CAGGAATTAC GGCGGCTCTC TCGAATTCAG CTCCGAGCTC
TATAAGGGAA CCACCGTCCA CCTTTACCTT CCGGTCAAAC AGGAGGCCAT CGAGGTATGA
 
Protein sequence
MTGCEEAQTH AKIWPYLVLM LGILFSSLMP AAPVAAREHP KTPQHVLILH SYHPGLPWTD 
RVMAGIQSVL GQPGRQINID VEYLDTKRHS DPAYFSHVLD TILHYKFKDR SYDLVLVSDN
EALNFALGHR DGLFRRTPIV FCGVNSAVPV LADKIEGVTG VYADPDFTGV VSQALALHPA
TRQIVVIGST RELSDRINLK QLQALGPAAA PKVAFDFWNN LPADMLRERL AALGAGSIVI
INGSITDRTG SLLSFNDQTK LVRSATNVPI YSFWDVYLGA GIVGGPLVAA HKQGTLAAMI
AWRVLDGEKA SSVPIVHPDQ VPYSFDYNEL IRLGIPLQKL PEQHLLFNAP ASSYQISKSH
FWISVAAVSC LLGVALLLTW SIVQRRRAET GLRESEQKFR ELAQQFQIVL DGIPDSITLI
SKEMKVIWSN KGAVNSVTSH PPSIPLEDCC NLLYNRTSIC DNCPAVKAFI SATAEESIIT
TPDQRILEVQ AFPVSGGAQE ISYVILLASD ITEKTRLLDE TTRTKRLAAL GELAAGVAHE
INNPNALILF NADLVKKACG DAAPILMRHY DEHGDFTLAG IPYVEMRDEM PHLFSEMFES
ADRIKRIVND LRDFARADAP DLREMVDLNQ RVESSIRLVG NAIKTATDKL SVELAENLPP
FVGNPQHIEQ VTVNLIMNAC QALPDKERGI HISTWFSKER NSSVITVRDE GIGIGKKDIS
QITDPFFTTK RAIGGSGLGL SVSMRIVRNY GGSLEFSSEL YKGTTVHLYL PVKQEAIEV