Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3063 |
Symbol | |
ID | 8138409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3552238 |
End bp | 3553704 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644870663 |
Product | NHL repeat containing protein |
Protein accession | YP_003022849 |
Protein GI | 253701660 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 0.440395 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCAAGC TAATTAGAAT TGGGTGTCTG CTGTTGCTGG GGTCGGCCTT GTGCGGCTGT TACGAGATGC GCCCCACCTC CGGCGGGGGA AAGCTCACCG TGCTGCCTGA GGACGGGGTA CGGGCGCTGA GCCCAGAAGA CATCGCCCTG CCTTCTGGGT ACCGCATAGA GGCAGTAGCC ACCGGTCTCA CCTTTCCCAG CGGCGTCGCC TTCGACGAGA GAGGCACGCC GTTTGTGGTC GAGTCGGGGT ACTCCTATGG CGAGGTCTGG ACGGTGCCCC GGCTGCTCAA GGTCGAAAAC GGCAAGAAGG TTACCGTGGC GCAGGGTGGG AACAACGGCC CTTGGACCGG AGTTGCGAGG CTTGGCGGCT CCTTTTACGT GGCTGAGGGG GGAGAGCTTG AGGGGGGGAG GATCTTGCGC ATCGACGCCG ACGGCAGCAT CAATCCCATC GTGGATGGTC TCCCAAGCCG CGGCGATCAC CACACCAACG GCCCCGCGGC GGGCCCTGAC GGCTCGCTCT ATTTCGGCAT CGGCACGGCC ACGAACTCCG CTGTGGTGGG GGAGGACAAC CTCAAGTTCG GTTGGCTCAA GCGAGACCCC GAGTTTCACG ATACCCCCTG CGCCGACGTT ACGCTTAGGG GGGAGAACTA CGCCAGCGAC GATTTTCTGG AGGGAAAGGG CCTGGTGGCA ACAGGGGCCT TCATGCCCTT CGGGATCTAT ACCCTCCCGG GAGAGGTGGT CTACGGAGAA GTCCCCTGTA ACGGCGCGGT GTTGAAGGTA CCGCCGTCGG GAGGGAGGCC GCAACTGGTA GCCTGGGGCT TTCGCAACCC CTTCGGGCTC GCCTTCTCGC CGCAGGGAAA GCTTTACGTC ACCGACAACG GTTACGACGA GCGGGGCTCC CGCCCCGTGT TCGGCGCAAG CGACGTGCTC TGGGAAGTAA CGCCGGGAAC CTGGTACGGG TGGCCAGATT TCAGTGCCGG GCTCCCGCTC GATTACGGAA ACCAGTTCAA GCCTCCGCTG CGGCATAGGC CGAAGCCGTT GCTTGCCGAA CATCCCAACG TCCCGCCGGC GCCCGCCGCG ATTCTCCCGG TTCATGCCTC GGCGAACGGA CTCGACTTTT CCCGCAGCGA GCGGTTCGGC CACCGCGGCG AGGTGTTCGT CGCGCTGTTC GGGGACCAGT CGCCCGGCAC CGGCAAGGTG ATGGGTCCGG TCGGTTTCAA GGTCGTTCGG GTGAACGTGG CTGACGGGGT GATCAGGGAT TTCGCCGTCA ACAAGGGGGA GAAGAACGCG CCCGCCTCGG CATTTGGGAG CGGAGGGTTG GAGCGGCCGC TTGCCGCGCG CTTCGATCCA TCTGGGGAGG CGCTCTACGT GGTGGATTTC GGGATGCTCA AGGAAACGGT GAAGGGTAGC ATCCCCATGA AGAATACCGG CGTCTTGTGG CGCATAACGA GGGAAGATCA AGAGTAG
|
Protein sequence | MCKLIRIGCL LLLGSALCGC YEMRPTSGGG KLTVLPEDGV RALSPEDIAL PSGYRIEAVA TGLTFPSGVA FDERGTPFVV ESGYSYGEVW TVPRLLKVEN GKKVTVAQGG NNGPWTGVAR LGGSFYVAEG GELEGGRILR IDADGSINPI VDGLPSRGDH HTNGPAAGPD GSLYFGIGTA TNSAVVGEDN LKFGWLKRDP EFHDTPCADV TLRGENYASD DFLEGKGLVA TGAFMPFGIY TLPGEVVYGE VPCNGAVLKV PPSGGRPQLV AWGFRNPFGL AFSPQGKLYV TDNGYDERGS RPVFGASDVL WEVTPGTWYG WPDFSAGLPL DYGNQFKPPL RHRPKPLLAE HPNVPPAPAA ILPVHASANG LDFSRSERFG HRGEVFVALF GDQSPGTGKV MGPVGFKVVR VNVADGVIRD FAVNKGEKNA PASAFGSGGL ERPLAARFDP SGEALYVVDF GMLKETVKGS IPMKNTGVLW RITREDQE
|
| |