Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3096 |
Symbol | |
ID | 8138446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3585769 |
End bp | 3587748 |
Gene Length | 1980 bp |
Protein Length | 659 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644870700 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_003022882 |
Protein GI | 253701693 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 113 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAATG TCATCGAGAC CGGCGCTGTA CCGGTACCGC ACGATCTTTT CGCACCCGCG CTGGAGGGAA ACCTCGCCGA GAACCAGCAG TACCTCGCCA CTATCCTGGC CACCGCACAG GTAGGGATAC TGGTGATAGA CAGCGAGTCC CACGTCATCG TGGAGGCGAA CCCCAAGGCC GTGGAGGTGA TCGGGGTGCC ACGGGAGGAG ATCATCGGCT CGGTCTGCCA CCGCTTCATC TGCGTGGCGG AGCTGGGGCA GTGCCCGGTG ACCGACCTCG GGCAGTGCAT CCATTGCGGG GAGCGGGAGC TGATCACGGC CAAAGGGGAG CAGGTGACGG TCGTGAAGAC CGTGGCGAGT ATCACGCTTG GGGGCAGAGC GTATCTTGTC GAGACCTTCC TTGACATCTC CGACCGCAAG AAAGCGGAGC AGGCGCTGCA ACGGAGCGAG GAGCGCTGCC GCGACATTCT GGACAACGCC AACGACCTGA TCCAGAGCGT CGACGCAAAC GGCGCCTTCA TCTACGTGAA CCGGGCCTGG AAGCAGACCA TGGGGTACAG CGACGAGGAG GTCTCCCGCC TCACCATCTT CGACGTCATC GCCCCCTCCA GCAAGGAGCA CTGCTCCCTT TTGTTCCGCA GGATCATGAA TGGAGAAAAA GTCCCGGTGG TCGAGACCGA GTTCATCACC AAGGACGGGT CGGTGGTGGT GCTCGAGGGG AGCATCAACT GCAAACACCT GGGGGGGCAG CTTCTGGGAA CGCGCGGCAT CTTCCGTGAC ATCACCGAGC GCAAGAGGAT GCAGGCGGAA CTGATGCAAA GCGAGGACCG CTACCGCAAG CTCTTCGAGA ACGCGCCGGT GGCTATCGTG GTGCAGTGCG AGGGGGTCTA CGTCTGCGCC AACAACGAGG CGTGCCGCAT GCTGGGGCGC GACCTGGTCG GCGTCGATGT CCTCTCCACC GTCCATCCCG ACTACCGCGA CACCGTAATG GAGCGGATCC TGCGGGTGAG CGAGACAGGC GAGCCGTCGC CCCTTCTGGA GCAGAAGATG CTCCGCTTGG ACGGCAGCAG CATCGATGTG GAGGTGACCG GCAGCAGCAT CGTCTTCAAG GGGAAAAAAG CGACCCAGGC GGTGATCCGG GACATCACCG AAAGAAGGCT TGCCGAAGAG CAGCGCCGCG AGTGGAACCT CAGGCTGGAA AAGGAGGTGG AGGCGAAGAC CAGGCACCTC AAGGAGGCGC AGGCGAAGCT GATCCAGTCG GAGAAGATGG CGACCCTGGG CGAGGTGATC TCCGGAGCCT CGCATGAACT AAATAACCCG CTCGCCGGGA TCCTCGGGGC GATCCAGATG CTCAGAAAGA GCGCGCTGGC CCAGCCGATC GAGCCGGAAC TCCTGGAGGG GATCGACGTC CTGGAAAGCA TCGAGAGCGC CGCTATACGC TGCCAGAACA TAGTCGCCGA CCTGATCCGC TTCTCGACCC AGGCCCACTG CAACTTCAGC GAGATCGACA TCAACCAGGT GCTCAGGGAC ACCCTGGAGA TCATGGCCGC CCCCTTCGCC GATCTGGGGA TCCAGGTGGA GCTTGACTCC GATCCGGCGG TGCCGCTGAT AGAGGGGGAT TTCGTCAAGC TCCTCGAGGT GTACGTGAGC CTTTTGCGCA ACGCCCAGAA CGCGCTTCCC GACGGGGGGA CGATATACCT CGGCACCAAG GTGGTGAAGA ATTACGGCGA GCCGCCGCAG GTGGCGGTCA CCATCCGCGA CACCGGGTGC GGCATCCCTC CCCAAAACCT CTCCAAGATC TTCGATCCCT TCTTCACCAC GAAGCCGGTC GGGCGCGGGC CCGGGCTCGG GCTCACGGTG AGCTACGGCA TAGTGAAACG CCACGGCGGG GATATCGACG TGCGCAGCAC GGTGGGGAAG GGGACCGAAG TGACCGTGAC CGTGCCGCTG CGGCAGCCGA AACCGGGAAG CCTCTCCTGA
|
Protein sequence | MSNVIETGAV PVPHDLFAPA LEGNLAENQQ YLATILATAQ VGILVIDSES HVIVEANPKA VEVIGVPREE IIGSVCHRFI CVAELGQCPV TDLGQCIHCG ERELITAKGE QVTVVKTVAS ITLGGRAYLV ETFLDISDRK KAEQALQRSE ERCRDILDNA NDLIQSVDAN GAFIYVNRAW KQTMGYSDEE VSRLTIFDVI APSSKEHCSL LFRRIMNGEK VPVVETEFIT KDGSVVVLEG SINCKHLGGQ LLGTRGIFRD ITERKRMQAE LMQSEDRYRK LFENAPVAIV VQCEGVYVCA NNEACRMLGR DLVGVDVLST VHPDYRDTVM ERILRVSETG EPSPLLEQKM LRLDGSSIDV EVTGSSIVFK GKKATQAVIR DITERRLAEE QRREWNLRLE KEVEAKTRHL KEAQAKLIQS EKMATLGEVI SGASHELNNP LAGILGAIQM LRKSALAQPI EPELLEGIDV LESIESAAIR CQNIVADLIR FSTQAHCNFS EIDINQVLRD TLEIMAAPFA DLGIQVELDS DPAVPLIEGD FVKLLEVYVS LLRNAQNALP DGGTIYLGTK VVKNYGEPPQ VAVTIRDTGC GIPPQNLSKI FDPFFTTKPV GRGPGLGLTV SYGIVKRHGG DIDVRSTVGK GTEVTVTVPL RQPKPGSLS
|
| |