Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_4044 |
Symbol | |
ID | 8139418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4626920 |
End bp | 4628119 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644871660 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_003023818 |
Protein GI | 253702629 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5002] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 0.571618 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGCC CCGCAGATCT CCCCATCCGC GTCAAATTCT TCGGACTGAT GTCGCTACTT TTAATCGCCC TCTTGCTGGC AAGCGGCCTC TTCATCTACA ACCGGCAGAA AGAGTTCGTC GTCAGGTTCG CCGTCGACAA CGCCCGCAGC TTCGCCACCA CCGTAATAGA GACCCGCGAG TACATGTCTT CCGTGGTCAG GGACGAGCCC GAACAAAACT ACAACCTGGT CCCCCAGGTA GTGGCCACCC AGGTCGCAAA GAGGGTCACC CAAAACAGCA AGTTCTACCT GCGCCAGGTC TCGCTGCGCT ACCGCAATCC CAGCAACAAA CCGGACGCCT ACGAGACGAA GCAACTGCAG TACTTCATCA ACAATCCCAA CGCCGAGGTC TACAGCATCG TGCAAAGCGG CGATATCAGC CTCTTCCGTT ACCTGCAGCC GATGCGCGCC ACCGCCTCCT GCCTCGAATG CCACGGCAGC TATGAAACCG CACCCGATTT CGTGAAGAAG CGCTTCCCCC CAGGCCACTA TTCCTACAAC TACAAGGTGG GCGAGGTGAT CGGGGCGGTC TCGGTCAGCA TCCCGGTCAA GGACCTCTAC GCCCAACTGG GCGCTAACCT CAAACTCGAC CTCCTTTTCC GGGCTATGGT CTACGTGATC GTCATCCTGG TGATGGGATT CATCATGAGC CGCCAGATCC TCAATCCCAT CAAGCTCCTC TCCGAACGCA TGATCGCCGT GACCCGCACC GGCAACTTCA AAGACAAGCT GCCGCAGAAG ACCAACGACG AGATCGGCAT GCTGATCGGC TCCTTCAACG AGATGATGGA CGAACTCTCC AGCCGCACCG TCCAGTCGAA AGAGGCGGAC GAGCGCTACC GCCGCTTCAT CGAGGTGGCC GCCTCGGCGG TGATCACCTT CCTCAAGGAC GGCAAGATCG TCATCGCCAA CCAGAAAGCC GAGTCCCTCT TCGGGCGCTC GCGGCAGGAA CTGCTGGGGG AATCGATCTT CAGCTTTCTG GAGGATGGGG CAGCGCTCAA GGATAGGCTT TCCACGCAGA CGGAGTTCCG GGACGAAGCG TCCCGCCAGA TAGTGAACGG CAGCGGCGGA AAACGGACGG AGGTGGAGAT GGTGCTCTCC GTTTCCAGGA CGGACCGGGA GCCGATGTTC ACCGCCATCC TCAGGGAGCG CAGGGGATAA
|
Protein sequence | MTRPADLPIR VKFFGLMSLL LIALLLASGL FIYNRQKEFV VRFAVDNARS FATTVIETRE YMSSVVRDEP EQNYNLVPQV VATQVAKRVT QNSKFYLRQV SLRYRNPSNK PDAYETKQLQ YFINNPNAEV YSIVQSGDIS LFRYLQPMRA TASCLECHGS YETAPDFVKK RFPPGHYSYN YKVGEVIGAV SVSIPVKDLY AQLGANLKLD LLFRAMVYVI VILVMGFIMS RQILNPIKLL SERMIAVTRT GNFKDKLPQK TNDEIGMLIG SFNEMMDELS SRTVQSKEAD ERYRRFIEVA ASAVITFLKD GKIVIANQKA ESLFGRSRQE LLGESIFSFL EDGAALKDRL STQTEFRDEA SRQIVNGSGG KRTEVEMVLS VSRTDREPMF TAILRERRG
|
| |