Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_4032 |
Symbol | |
ID | 8139406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4615370 |
End bp | 4616389 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644871648 |
Product | hypothetical protein |
Protein accession | YP_003023806 |
Protein GI | 253702617 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.000000100028 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATCCAC TCTGTGCCAG GATCAAGGAA AGGGTTGCGC TGGAGTTGCG GGAGGCGGTG GATCCGCCGG CGCGGGAGGC GCTCTTGATA GCGGCCCTTT ACGAGGTGAG CCGCAGCGTG CTCGCGGACC TCGGCGCGCC GAGGAAATCC GAAGAATACC TGAAGGCGCG CGACGAGGTC GTGGATCTGG TGAGCTCCAT CGCCATTCCG CTGGTAAGGG GGGGAGGGGA GAGGGGGATC GCCGTTTTTT CAGAGCTTGC GGAGTGTTGC GCCGACCCCC TCTTGAAGAA GAAGCTTTCG GTCTACGCCC AGGAGCTCCT GGCCAAGTCG AGGACTGCGG CCCGGGAGAA GGCGGGGGGC GCGAGGCCGA TTGCCGCTTG GGGCGCAGCC GTCTGCACGG TCGGCACCAT TGCCTGTTAC CTCTTCAACC ATGGCGCCGT CGCCGGCGGG GGAGGGGAGA GGCCGCGCCC CGCCGCGTCG GTACGGACTC CGGCGCTGGT CGAGGCGCCC GTCCTTCCGC CGACCTCGCC GATGCAGCAG CGAAATGCCG AGGCGTCTTC CCCCCCCGGG CTAAGCGAAG CGCTGACCCG GAAGGAGCAA GAAACTCCCG CGCCTGCGGG CTCAAACCGC GGCGAACAGG CAACCCCGGT GCGGCTGGTG AATGGCCAGC TGCTGGTCCC CGTGACGCTC AGGCACGGCG GGGTATCGGT ACAGGTCGAA CTGGTGGTGG ATACCGGCGC GACCAGGAGT GTCGTGCACG AGGGGGTTTT CGCCAGGCTC CCCATCGACC CCGGCTCCGC CCGAAGCTCC GTCTCGGAGG TGGCCGACGG GACTCTGGTC CGCTCCCGGG TCTTCAGGGT TGAGCTTTTG AAGGCGGGGC CCTTCGCCCA CCACTCGATG GAGCTGGAGG TGATACCGTT CAGCGGCGGC GGAGTACACG ACGGTCTGCT CGGCATGGAT TTCCTGGGGA AGCACCCGCA CCAGATCGAC ATGGAACGCC GGCTGATCCG CTGGTTTTGA
|
Protein sequence | MDPLCARIKE RVALELREAV DPPAREALLI AALYEVSRSV LADLGAPRKS EEYLKARDEV VDLVSSIAIP LVRGGGERGI AVFSELAECC ADPLLKKKLS VYAQELLAKS RTAAREKAGG ARPIAAWGAA VCTVGTIACY LFNHGAVAGG GGERPRPAAS VRTPALVEAP VLPPTSPMQQ RNAEASSPPG LSEALTRKEQ ETPAPAGSNR GEQATPVRLV NGQLLVPVTL RHGGVSVQVE LVVDTGATRS VVHEGVFARL PIDPGSARSS VSEVADGTLV RSRVFRVELL KAGPFAHHSM ELEVIPFSGG GVHDGLLGMD FLGKHPHQID MERRLIRWF
|
| |