Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3361 |
Symbol | |
ID | 8138728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 3889846 |
End bp | 3892116 |
Gene Length | 2271 bp |
Protein Length | 756 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644870979 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_003023144 |
Protein GI | 253701955 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATTT CAGCCAAAAA AGGGGGAACG CCCCCGCAAT ACCAGGGGCC GGAACTCTCC GCCGGTGAAG TGAGGAAAAG AAAGCGCGAG GCGATAATCG TCTGTATCTC GCTTTTAACC ATCTGCCTCC TCACCTATCT GGAGATCCAC CTCTCCCGCC TAAGCGAGCA GGTGCCGATG GGCAGCAACA TCGCCATCTT CGGCATGGTC AACCTGATCA TCCTCCTCAT CATCCTGCTG GTCTACCTGG TCTTCAGAAA CATCGCCAAG CTGGTCCTGG AACGACGCAA GAACACGCCG GGGGCGAAGC TGCGCACCAA GCTGGTGCTT GCCTTCGTCA CCCTCTCGCT GCTCCCGACC ATGCTGCTGT TCTTCGTCTC CGCCGGCTTC ATCAAGAACA GCATCTCGAA CTGGTTCAAC AAGCAGGTGG AGACCTCGCT CAACGAGTCG ATGGAGGTGG CCCAGGTCTA TTACAAGACC TCTGCGGCCA ACGCCCTCTA CTACGGCGAG CAGATCAGCA CCGCCATCAA GGAACGGAAG CTTCTGAACG AGGAGAACCT CCCCGAGCTG AAGGCTCTGG TGCGCCAGAA ACAGACCGAA TACAACCTGG GGGTGGTCGA GGTCTTCTCG GCGCAGCGCG AGGAGCTGTT CCGGGCCGCG AACGCCAAGC TGCCGCTGGG GGAATTCACC AACCCCTCGT CGGAGGATAT CCAACGCGTC CTCTCCGGGG CCATGCTGAC CCGCGTCAAC GCCATCGGCA AGGCGGACCT GATCCGCGGC ATCGTCCCCA TCCGCAGCAA CTTCAACGAA AAAGACGTGG TCGGGGTGGT GGTTGTCAAC TACTACGTCC CCTACTCGCT GGTGTCCAAG ATGCGGGAGA TATCCGCCTC CTATCAGGAG TTCCGCCAGC TGAAGATCCT GAAGAACCCG ATCAGGACCG GTTACATACT CACCCTGTTC CTGATTACCA TGGTGATCCT CTTCCTGGCC GTATGGTTCG GGGTGTACCT TGCCCGAAGC CTCACCATCC CGATCCAGGA ACTGGCCGAG GCGACCCGGC AGGTGGCCGA GGGGAACCTG GACGTGCATC TGGGGGAGAG CGGGGGGGAC GAGATCGGCA TGCTGATTAC CTCCTTCAAC CGGATGACCG AGGACCTCCG GGCGAACCAG CTCGCGCTGC AGCACACCAA CGAGGAACTG CAAAAGAGCA ACCTCGAGCT GGAACAGCGC CGCCGCTACA TGGAGGCGGT GCTCGCCAAC GTCACCGCCG GCATCATCTC GGTGGACAAA AACGGCCTGC TCACCACGGT CAACAAATCT GCGGAAAAGC TCCTCCTCAT CAACACGGAC AAGGTCACCG GGCAGAACTT CCGCGAGGTG CTGCACCCCG AGCACCTGGA CATCGTCAAG GGGCTCTTGC GGGACATGGT GCTTGCCAAG CACGACTCCA TCGTGCGGCA GGTGGTGATC CCGATGCGCG ACGCGGAGCT CACCCTGCTC ACCAACCTCA CCGTCCTGAA GGACGAAAAC GACTCCTTCA TGGGGATGGT GGTGGTGCTG GACGACATGA CCTCGCTGAT CAAGGCGCAG CGCATGGCCG CCTGGCGCGA GGTGGCCCGC AGGATCGCCC ACGAGATCAA GAACCCGCTC ACCCCGATCC AGCTCTCCGC CCAGCGGCTG AGGAAGCGCT ACCTCACCCG CTTCGAGGGG GAGGAGGAGG TGTTCGACCA GTGCACCGCC ATGATCATCA AGTCCGTGGA CGAGCTGAAG GGGCTGGTGA ACGAATTCTC CAACTTCGCC CGGATGCCGG CCGCGGTCCT GAAACCAAAC GACCTGAACG GGATACTCAA GGAGGCGCTC ACCCTCTACG ACGAGGCGCA CCGGCACATA CACTTCGTGT TGAACGCCGA CGAGGAACTC CCCCCGATCC TTTTGGACCG CGACCAGATC AAGCGGGTAG TGATCAACCT CTTGGACAAC GCCGTCGCCG CCATAGAGGG GGATGGAGAG GGTGTGGTCG AACTCAGCAC CAGCTACGAC AGCCAGCTGA AGATGGTCAC TTTCACCGTT TCCGACACCG GCCACGGCAT ATCCGCCGAG GACCGCCCGC GGCTCTTCGA GCCGTACTTC TCCCGGAAAA AGAGCGGCAC GGGGCTTGGG CTCGCCATCG TCAACACCAT CATCACCGAC CACCACGGCT TCATCAGGGC CAAGGAAAAC TACCCCAAGG GGAGCAGGTT CGTCATCGAG CTCCCCGCTG ACGCGGCGTA G
|
Protein sequence | MPISAKKGGT PPQYQGPELS AGEVRKRKRE AIIVCISLLT ICLLTYLEIH LSRLSEQVPM GSNIAIFGMV NLIILLIILL VYLVFRNIAK LVLERRKNTP GAKLRTKLVL AFVTLSLLPT MLLFFVSAGF IKNSISNWFN KQVETSLNES MEVAQVYYKT SAANALYYGE QISTAIKERK LLNEENLPEL KALVRQKQTE YNLGVVEVFS AQREELFRAA NAKLPLGEFT NPSSEDIQRV LSGAMLTRVN AIGKADLIRG IVPIRSNFNE KDVVGVVVVN YYVPYSLVSK MREISASYQE FRQLKILKNP IRTGYILTLF LITMVILFLA VWFGVYLARS LTIPIQELAE ATRQVAEGNL DVHLGESGGD EIGMLITSFN RMTEDLRANQ LALQHTNEEL QKSNLELEQR RRYMEAVLAN VTAGIISVDK NGLLTTVNKS AEKLLLINTD KVTGQNFREV LHPEHLDIVK GLLRDMVLAK HDSIVRQVVI PMRDAELTLL TNLTVLKDEN DSFMGMVVVL DDMTSLIKAQ RMAAWREVAR RIAHEIKNPL TPIQLSAQRL RKRYLTRFEG EEEVFDQCTA MIIKSVDELK GLVNEFSNFA RMPAAVLKPN DLNGILKEAL TLYDEAHRHI HFVLNADEEL PPILLDRDQI KRVVINLLDN AVAAIEGDGE GVVELSTSYD SQLKMVTFTV SDTGHGISAE DRPRLFEPYF SRKKSGTGLG LAIVNTIITD HHGFIRAKEN YPKGSRFVIE LPADAA
|
| |