Gene GM21_2521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2521 
Symbol 
ID8137863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2948183 
End bp2949898 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content61% 
IMG OID644870130 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003022320 
Protein GI253701131 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.0595753 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGG ACACCAACGA GGGCGATAAC CTGTACCGCT ACATCGTCGA CATGATTCCG 
CAGATCGTCT GGACGGCTAC GCCCGACGGA CAACAGGATT TTGCCAACCT CCGGTGGTAC
GAGTTCAACG GACTTACGCC GGGGGAGCCG GATCCCGAAC CATGGCGCAG CATCATCCAT
CCGGACGACG TGGCCATGAC CGCCGAGAAA TGGCAGCATT CGCTGGCGAC CGGGGAACCT
TACTACTGCC TGCACCGAAA CAAAAGGCAC GACGGCGAGT ACCGCTGGAT GCTGTCGCGG
GCACTGGCCC AAAAGGACGA TGAGGGGCGG GTGGTACGCT GGATCGGCAG CGGAACCGAC
ATCACGGAGC AGAAGATCGC CGAGGCGGAG CTGATACGGT ACCGCGACCA TCTGGAGGAA
CTGGTCCGGG AGCGGACGGC TGAACTTGTG CGGGCCAAGG AGACGGCCGA GATTGCGGCC
CGGGCCGTTC AGGAAGCCAA CGAACTGCTG GAAAAGCGGG TGGAGGAGCG GACCGAGGAA
CTGAGGAAGA CCGAAAAGGA GCTGCGCCAG GCACAGAAGA TGGAGGCTGT CGGGACGCTT
GCGGCAGGTA TCGCGCATGA CTTCAACAAC ATCCTCACCT CCATCCTCGG GTTCACCGAC
ATGGTCCTGC ACAAGATTCC GGAGGGAGAA ATGGGGCGGC GGGAGATGGA ACAGGTGTTC
GTCTCGGCGC AGCGAGCCGC GGATCTCGTG CGCCAGATCC TCAGCTTCAG CAGGAGAAAC
GATCAGGAAA GGCAGCCGGT GCATGTCTCC GGCATCATCG AAGACACCTG CAAACTGCTG
CGTTCCTCGC TTCCTGCCAC AGTGGAGTTC GTCACTGAAT TTTTCGTTTC CGAGGATGAT
GACAAGGTCC TGGCCGACCC GATACAACTG CACCAGGTGC TGATGAATCT CTGCACCAAC
GCAGCCCACG CCATGCAGCC CGACGGCGGG ACGCTGACCA TCACCCTGAC CGCGGCGGAG
GCAGGGTCGC CGGGGCTTAC CTCTCTTCCT GTCCTTACTT CGCGGGACTA CATCAGGGTC
GCCGTGAGCG ACACCGGTCG CGGCATTGAG CCGTTGGTGC TGGAGAGGAT CTTCGATCCC
TATTTCACCA CCAAGCCTGC GGGGGAGGGG ACGGGGCTCG GTCTTGCCGT GGTGCAGGGG
ATCGTGAAGA ATCACGGCGG CGCCATCACG GTTCACAGCG AGCCGGGAAA GGGAACCTGT
TTCGAGGTCT TCCTCCCCAC CGTGATAAGC GACGTGCTCG AGGAGGTACA GGTTCGCGAG
CAGCTTCTGC ATGGTTCCGA ACGCGTCCTG TTCGTCGACG ACGAGGAATC GCTCACCGTC
CTCGGCAAGG GGATTCTGGA GGACCTCGGC TACAACGTGG TCACCAGTAA CAGCAGCCGC
AGGGCCATGG AGATGTTCCG TGCCGACCCG GCCCTCTTCG ACCTGGTGAT TACCGACCTG
ACCATGCCGG GATTGACGGG TAAGGCCATC GCCAAAGAGA TCCACGCGCT GAGACCTGAC
ATCCCAATCA TTCTTTGCAC CGGGTACACG GAGAGCTTTG ACGAGAAGGA CCGGGAATAC
GGCATTCGCG CCTGCCTTAT GAAGCCTTAC ACCTCGAAAA TGCTGGGGCG GACCATACGG
ATGGTGCTGG AAGGGAAGAC GACATCAACC TGCTGA
 
Protein sequence
MNLDTNEGDN LYRYIVDMIP QIVWTATPDG QQDFANLRWY EFNGLTPGEP DPEPWRSIIH 
PDDVAMTAEK WQHSLATGEP YYCLHRNKRH DGEYRWMLSR ALAQKDDEGR VVRWIGSGTD
ITEQKIAEAE LIRYRDHLEE LVRERTAELV RAKETAEIAA RAVQEANELL EKRVEERTEE
LRKTEKELRQ AQKMEAVGTL AAGIAHDFNN ILTSILGFTD MVLHKIPEGE MGRREMEQVF
VSAQRAADLV RQILSFSRRN DQERQPVHVS GIIEDTCKLL RSSLPATVEF VTEFFVSEDD
DKVLADPIQL HQVLMNLCTN AAHAMQPDGG TLTITLTAAE AGSPGLTSLP VLTSRDYIRV
AVSDTGRGIE PLVLERIFDP YFTTKPAGEG TGLGLAVVQG IVKNHGGAIT VHSEPGKGTC
FEVFLPTVIS DVLEEVQVRE QLLHGSERVL FVDDEESLTV LGKGILEDLG YNVVTSNSSR
RAMEMFRADP ALFDLVITDL TMPGLTGKAI AKEIHALRPD IPIILCTGYT ESFDEKDREY
GIRACLMKPY TSKMLGRTIR MVLEGKTTST C