Gene GM21_3222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3222 
Symbol 
ID8138574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3737078 
End bp3739474 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content66% 
IMG OID644870827 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_003023007 
Protein GI253701818 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.150716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCC TTTCCATCGA ACAACTCAAG GCGCGCATCG GCATCACCGT CGGAATCATC 
GTCGCCGCGG TGCTGTGCGT CTCCGGCCTG GGTTTCCCCT CCAGCGGCCC GCAACGGCTG
CTCGGCCACC CGCTGCAGCA TTCGCTGCTG ACGCTGGTCT TGTGCCTGCT GATCCTGGCC
CTCACCGCCC TTTTGTTCCG GCACCTGGAC AGCTTGAAAA AGGCGCAGCT CATGGTGGAG
GGTCAGAAGG CGGAACTGGC GACAAAGGCC GCCCAGATCG ACGCCGCCAA CGACAGCATC
CTGCAGATCG ACGAGGAGGG GCGGCTGCTC CATTTCAACC AGACGCTTTG CCGGCTCACG
GAGTACGGTT GCGACGAGTT GGCGGGGATG CGGCTGCAGG AGATCGAACC GCCGGAATAC
GGCGGCCGGT TCGAGGCGAT CGTCGCCGGG CTCAGGGAGT CGAAAGAGGC GACCTTCGAA
AGCGCCTACC TCACCAAAAG CGGGGCTGCC GTTCCGGTCG AGGTCCATTC CCGCCTGCTG
GAGAGCGGGA GTGGGACCCG GATCCTCGCC ATCGCCCGGG ACATCACCGA GAGAAAGCGC
AACGAGCAAA GAGAGCGCTG CCGGTTGAAC ACCCTGGAGC GGATAGCGAC GGACGCCTCG
CTCGACGAGC TTTTGGGGTG CGTGGTGAGA TACGTGGAGC ACGAGATCTC CGGTTCGCTC
TGCTCGGTCC TTTTGGTGGA CGAATCCGGC ACAATGCTGC GCCACGGCGC CGCCCCCTCG
CTCCCCGACG CCTACAACGA GGCGGTACAC GGCCTAAGAA TCGGCAAGGG GAAGGGGTCA
TGCGGGACCG CCGCCTTTCT CAGGCAGAGG GTAGTGGTGG AGGATCTGGG GTCGCACCCG
TACTGGAAGG GGTTCCAGCC CGCCCGGGAT GCGGGGCTTT ACGCCTGCTG GTCCGAGCCG
GTCTTCGCCT CCAACGGCAT CCTGCTCGGG ACCCTCGCGG TCTATCACCG CGAGCCCCGC
GCCCCCGAAA GCGACGATCT GCGCCTCATG GAGTCGGCGG CCCATCTGGC CGGGATCGCC
ATCGGAAGGG TGCGGGCCGA CGACGGCAGG CGGGTTTTGG AGGATCAACT GCGCCACAGC
CAGAAGATCG AGGCAGTGGG GCAGCTGGCG GCTGGGGTGG CTCACGACTT CAACAACCTG
CTGACGCCGA TCATAGTCTA CGCCGACATG CTGCAGCAGT ACTGCCCGGC GGGAAGCACC
CAGGCGCGCA TGGTGGACGC CATGGGCCAG GCGGCCCAGA AGGCGGGGGA TCTGACCCAG
AAGCTCCTCT CCTTCGGACG CAAGCAGGCG CTCCACGTGG AGCTGTTGGA CCTGAACGAG
GTGATCGCCT CCTTCAGGGA CATCCTGCGT GCCACCGCGC GGGACAGCGT CACCATCGAC
CTGCGGCTTT CCCCGGGGGC CGCCAAGGTG CAGGCGGACC GGGGGCAACT GGAGCAGGTT
CTTTTGAACC TACTTTTGAA CGCCCTGGAC GCCATCGACG GGACCGGCTC GATCTGCGTC
GAGACCGGGC ACCTGATACT GGACGAGGAG TTCCACAGGC AGCACCCGGT CGCCAAGCCC
GGGCACTACA TACTGCTCGC CTTCTCGGAC GACGGCTGCG GCATGGCGGA GGAGACGCTC
AGGCACATGT ACGAGCCTTT CTTCACCACC AAGGAGACCG GACGCGGCAC AGGCCTAGGT
CTCGCCACCG TGTACGGCAT CGTCAAGCGC CACGGCGGCT GCATCGACGT CAAGAGCCGC
CCCGGAGAGG GGACCAGGTT CGGGATCTAT CTGCCGGCCA GTGCCAGCAG CGCCGAACCT
GTCACGGCCT CCCCCTCGCG CGCCGCGATC CCCCCCGCCG CCGGAAAGGG AAAGACCATA
CTCCTGGTCG AGGACAACGC CATGATCCGC GAGGTGGCGG AGGAGCTCCT CTCCTCCTTC
GGCTACGCGG TCTTGGCGGC CCAAAGCCCG GCCCGGGCGC TGGAGCTTGC CAAGGAGCAG
CAAACGATAG ACCTTTTGGC GACCGACGTG GTCATGCCCG AGATGACCGG ACCCGAGCTC
TACGAGAAAC TGCTTGAAAA TTACCCGGGG CTACCGGTAC TGTACATCTC CGGCTACAGC
GCGGGGCTCT TGCCGCTTGA CGAGAGCCAG CGGCAGGACG CCATCTTCCT GGCCAAGCCC
TTCACCCTGG AACAGTTCAT GGGCAAGATC GGGGAGATGC TCGGCTCGGA CCTCCTCCAG
GCGGTGCAGG GGCCGCTGGA CGAGGCGATG GCCAGGCTGG CGACAAGAAG CTCCCGGCGC
GTCAAGGACC CGGCGGATAT ATCCGCCACA AGCATAAAGG AAGGTGCACA TGAATAA
 
Protein sequence
MSRLSIEQLK ARIGITVGII VAAVLCVSGL GFPSSGPQRL LGHPLQHSLL TLVLCLLILA 
LTALLFRHLD SLKKAQLMVE GQKAELATKA AQIDAANDSI LQIDEEGRLL HFNQTLCRLT
EYGCDELAGM RLQEIEPPEY GGRFEAIVAG LRESKEATFE SAYLTKSGAA VPVEVHSRLL
ESGSGTRILA IARDITERKR NEQRERCRLN TLERIATDAS LDELLGCVVR YVEHEISGSL
CSVLLVDESG TMLRHGAAPS LPDAYNEAVH GLRIGKGKGS CGTAAFLRQR VVVEDLGSHP
YWKGFQPARD AGLYACWSEP VFASNGILLG TLAVYHREPR APESDDLRLM ESAAHLAGIA
IGRVRADDGR RVLEDQLRHS QKIEAVGQLA AGVAHDFNNL LTPIIVYADM LQQYCPAGST
QARMVDAMGQ AAQKAGDLTQ KLLSFGRKQA LHVELLDLNE VIASFRDILR ATARDSVTID
LRLSPGAAKV QADRGQLEQV LLNLLLNALD AIDGTGSICV ETGHLILDEE FHRQHPVAKP
GHYILLAFSD DGCGMAEETL RHMYEPFFTT KETGRGTGLG LATVYGIVKR HGGCIDVKSR
PGEGTRFGIY LPASASSAEP VTASPSRAAI PPAAGKGKTI LLVEDNAMIR EVAEELLSSF
GYAVLAAQSP ARALELAKEQ QTIDLLATDV VMPEMTGPEL YEKLLENYPG LPVLYISGYS
AGLLPLDESQ RQDAIFLAKP FTLEQFMGKI GEMLGSDLLQ AVQGPLDEAM ARLATRSSRR
VKDPADISAT SIKEGAHE