Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3301 |
Symbol | |
ID | 8138663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3837501 |
End bp | 3838520 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644870914 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_003023084 |
Protein GI | 253701895 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 0.0109844 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATAGAA ACTGGCGCGA TCTGATCAGA CCTAAGAAGC TCCAGGTTGA GACGGAATCG CTCACAAATA CATACGGGAA GTTCTTTGCC GAACCCTTCG AAAGGGGCTT CGGCACCACC CTGGGAACAG GTCTGCGTCG GGTCCTCATC TCGAGCCTGC AGGGGGCAGC CATCGTCTCC GTGAAGGCGA AGGGCGTACT GCACGAGTTC TCCGCAGTCC CGGGCGTGAC CGAGGACATG ACGGACATCA TTCTGAACCT CAAGGGTGTG CGCCTCAAAG TGCACGGCAA CGAGTCCAGG ATGATCAGGA TCGTCCAGAA GGGCGAAGGT GTGGTCAAGG CCAAGGACAT CATCACGGAC AACAACGTGG AGATCCTGAA CCCCGAGCAC CACATCGCGA CCTGCTCCAA GGACGCGAAC CTCGAGATGG ACCTCATGGT CAAAGTCGGC AAAGGGTACG TCCCCGCTGA CCGCAACCGT GACGAGAAGG CTCCGGTCGG GACCATCCCC ATCGACGCGA TCTTCTCCCC GGTCCACAAG GTGAACTTCA CCGTAACCAA CGCTCGCGTA GGTCAGATCA CCGACTACGA CAAGCTCACC ATCGAGCTCT GGACCGACGG CAGCGTCAAG CCGCAGGACG CCGTGGCCTA CGCTTCCAAG ATCCTCAAGG ACCAGCTTTC CATCTTCATC AACTTCGATG AGGACGTGGA GCCCCAAGAG GAGGCGGAAC CGGAGGAGGA GCGCGAGCGC TTCAACGAGA ACCTGTACCG CTCAGTGGAC GAGCTGGAAC TCTCGGTTCG CTCCGCGAAC TGCCTGAAGA ACGCAGGGAT TAAGCTGATC GGCGAACTCG TTTCCAGAAG CGAAGCCGAG ATGCTTAAGA CCCAAAACTT CGGCAGGAAA TCTCTGAACG AAATCAAGGA CATCCTCGTC GACATGGGCC TCACCCTCGG CATGAAACTG GAGAATTTTC CGGATCCCGA GATCATGAGG CGCCTGCGCG GCGAGCAGAA AGAAGAATAG
|
Protein sequence | MYRNWRDLIR PKKLQVETES LTNTYGKFFA EPFERGFGTT LGTGLRRVLI SSLQGAAIVS VKAKGVLHEF SAVPGVTEDM TDIILNLKGV RLKVHGNESR MIRIVQKGEG VVKAKDIITD NNVEILNPEH HIATCSKDAN LEMDLMVKVG KGYVPADRNR DEKAPVGTIP IDAIFSPVHK VNFTVTNARV GQITDYDKLT IELWTDGSVK PQDAVAYASK ILKDQLSIFI NFDEDVEPQE EAEPEEERER FNENLYRSVD ELELSVRSAN CLKNAGIKLI GELVSRSEAE MLKTQNFGRK SLNEIKDILV DMGLTLGMKL ENFPDPEIMR RLRGEQKEE
|
| |