Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2502 |
Symbol | |
ID | 8137844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2928796 |
End bp | 2930067 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644870111 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_003022301 |
Protein GI | 253701112 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 87 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAAC AACTGCCGGC GGCTAATCGG GGCCACGCCC TGCAACCTCC GCCCGGATTC GAACCTGAAG ACGAGACAAT GCCTATTCGA CAGTTGCGCA ACTTCGCCAA ATTATCCGAA AACCTCAACG GTGCCGAGGC GGCCAACCGG GTTTTGTCAG CCACCATCGA ACATACCCGC GACGGCATCA TGGCAGTGGA CGGAAACGGG GAGATCGTCG CGTGCAACCG GCGTTTCCTG GAGATGTGGG GCATAAGTAA CGATGCCCTG CCCTGCGGAG ATTCCGACAA ATTGCTCCTG TCGCTCTTGG GTCAGGTACG CGACCCGGTG CTGTTCTTCG AGAACTTCAG CGAGATGCAG TGGCAGCCGA ACCGGGAGAG CTACGACGTA GTGGAACTGA ACGACGGCAG GTGTTTCGAG CGCTTTTCCA GGCCGCACTA TCTGGATTGG AAGACGGCCG TGCGGATCTG GACCTTTCAC GACATCAGCG AACTGAAGAA GATGGAGAGC CAGCTGCTGC ACGCTCAGAA GATGGAGGCA ATAGGGACCC TCGCCGACGG CATCGCCCAC GACTTCAACA ACATCATGAC CGCCGTGATC GGTTACACGG ACCTGCTCAT GACCGAATTA TCCCCTCCCG CGCCTTACCG AGGATTTCTC GAGAACATAA ACACCGCCAC TCATCGCGCC ATCACGGTGG TGAAGAACCT GCTCGCCTAT TCGAGGCAGG AGCCGATGCA GACAACGAGG ATCCTTGGCA ACGACCTGAT CGAAGGGATT TTTGTGCTTT TGAAGAGGGT GGCGGGGCAA GGTATCGAGC TGGCATGGGA GCCGGCCCCC GACACCCTCC CGATAATGGT GGACCAGGCG CAGATGGAGC AGGTATTGGT AAACCTGACC GCCAATGCCA GGGACGCCAT GCCTCAAGGG GGAACGCTCA CCATAACGGT CGACGCTGTA GATCTCGCCC CCCACGAGGT CCAGGGGTAC GACCACACAC GGCCAGGGCC CCATGTGCGC ATCGCCGTTT CCGATACCGG CTCCGGGATC GACCAGGAGA CACAGAGCAG GATCTTCGAT CCTTTCTTCA CCACCAAGGA GGCGGGAAAC GGGACCGGGC TCGGGCTTTC CATCAGTTAC GGCATAGTCA AGCGCCATGG CGGCTTCATC CGCGTCGCCA GCGAAAACGG CGAGGGGACC ACGTTCTCCA TCTTGCTCCC CAAGGCTGAA GCTTCCCCCA ACCGACAGAA ACAGCCTGCC CCGTCGAACT AG
|
Protein sequence | MSQQLPAANR GHALQPPPGF EPEDETMPIR QLRNFAKLSE NLNGAEAANR VLSATIEHTR DGIMAVDGNG EIVACNRRFL EMWGISNDAL PCGDSDKLLL SLLGQVRDPV LFFENFSEMQ WQPNRESYDV VELNDGRCFE RFSRPHYLDW KTAVRIWTFH DISELKKMES QLLHAQKMEA IGTLADGIAH DFNNIMTAVI GYTDLLMTEL SPPAPYRGFL ENINTATHRA ITVVKNLLAY SRQEPMQTTR ILGNDLIEGI FVLLKRVAGQ GIELAWEPAP DTLPIMVDQA QMEQVLVNLT ANARDAMPQG GTLTITVDAV DLAPHEVQGY DHTRPGPHVR IAVSDTGSGI DQETQSRIFD PFFTTKEAGN GTGLGLSISY GIVKRHGGFI RVASENGEGT TFSILLPKAE ASPNRQKQPA PSN
|
| |