Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1079 |
Symbol | |
ID | 8136401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1265448 |
End bp | 1266602 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644868690 |
Product | Hpt protein |
Protein accession | YP_003020898 |
Protein GI | 253699709 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2198] FOG: HPt domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 3.57713e-26 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGCGAC GGTTCCGTGA CCTCTCCATC AGGCGTAAGC TGCTCGCCAT ACTGCTTCTT ACCAGCGCAG TAGTGCTCTC CCTCGTTTCC ACCGCTTTCG TCATCACCGA GGCGACCGGG TTCCGTAGCG GCATGCAGAC CGAGCTGAGT GCGCTGGCGG AGATCGTGGG AAGCAACAGC TCAGCCGCGG TAGCCTTCAA CGACCGCAAA TCGGCGGCCG ATACCCTGGC CGCGCTGCGC GCCAAACCGT ACATACTGAC CGCGCTGGTC GTGCTGAAGG ACCACTCACT CTTCGCAAGC TATGTAGCGC CGGGCGCCAC GCTGCGGGAT CTAGGCTTTA TCGACGGTTC CGGCGAGAGC GCGCGTGTGG ACGACCGGAA GTTGAGGGTC GAGTCGGCCC GCGCCAGCTT CCCGCTTGCC TTAGGCGACC ACATCTTCGG CATCTCCCCC ATCATCCTGG ACGGCCAACA GTTGGGAACC GTGGTGGTTC TGTCCGATTC CACGGCATTA AAGCACCGGT TGAAACCGTT CTTCCTCATG CTGGCGGGGG TGCTGCTGGG CGCGCTGTCG CTGGTGTATT TTCTAGCCGC GAAGCTGCAA CGCATCATCT CCGAACCCGA CTCGCACCTG GCGCAGGTCA TGAAAGCGGT CTCCACCGAC AAGAGCTACA ACCTCAGGGC GCGAAATCAG CAGGGGAACG ATGAACTGGG GACGCTTATC GACGGCTGCA ACGAGATGCT GAGCGGTGCC GCACCCACGG CCGAAGCGGC CGCTTCCCCC GTGGAAACCG CACCTGCCGG GGAGGGGGGT GATACTTCCC CGCCGCCGGT GTTCGATCGG GCGGGCCTGC TTTACCGGGT GGGAGACCCC GAGTTCATTG GCGTGTTCGT GGAGAAGTAC CTGGCTAGCA CGGAGCAGTT GCTGGGGCTT TTGAGACAGG CCATAGCGGA CGGGGATCAA GACGGCATGC ACCTGCATTC CCACAGCATC AAGGGGGCCG CGGCCAGCAT AGGTGCCGAG GTGATGCGGA GCATTGCGTT CGAGATGGAG AAAAAGGGAG CGCAGCAAGA AGACGTTGAG GGGATGACGA GGCTTTACCA GGATCTCGAG GAGGCGTTCG ACGAGTTCAG GAGGGAAGCG GCGCAGCCTG AGTGA
|
Protein sequence | MLRRFRDLSI RRKLLAILLL TSAVVLSLVS TAFVITEATG FRSGMQTELS ALAEIVGSNS SAAVAFNDRK SAADTLAALR AKPYILTALV VLKDHSLFAS YVAPGATLRD LGFIDGSGES ARVDDRKLRV ESARASFPLA LGDHIFGISP IILDGQQLGT VVVLSDSTAL KHRLKPFFLM LAGVLLGALS LVYFLAAKLQ RIISEPDSHL AQVMKAVSTD KSYNLRARNQ QGNDELGTLI DGCNEMLSGA APTAEAAASP VETAPAGEGG DTSPPPVFDR AGLLYRVGDP EFIGVFVEKY LASTEQLLGL LRQAIADGDQ DGMHLHSHSI KGAAASIGAE VMRSIAFEME KKGAQQEDVE GMTRLYQDLE EAFDEFRREA AQPE
|
| |