Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3515 |
Symbol | |
ID | 8138887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4056260 |
End bp | 4057441 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644871134 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003023294 |
Protein GI | 253702105 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.000000186871 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGGCTGAGG GGGCTCTGCG CGTACTGGTG CTGGCGCCGA CCCCGTTCTT CGCCGACCGC GGCTGCCACG TCCGCATCCT TGAGGAGGCG AGGGCGGCCA TGGCGTGCGG CGTCGAGCTG CGCCTGGTGA CCTACCACAT CGGAAGCGAC GTCCCCGGCA TCCCCACCGA GAGGATCTCC GGGTTCTCCT GGTACAAGAA GCTCGAAGCC GGCCCCTCCT GGGTCAAGCC CCTCCTCGAT CTGCAGCTCC TCTTCAAGGC CGTCAAGGTG GCGCGCCAGT TCAAGCCGCA CCTGATACAC GCCCACCTCC ACGAAGGCGC CTTCTTCGGC GCCTTCCTCA AGATGCTGAT CCGTGTCCCG ATGCTCTTCG ACTGCCAGGG AAGCCTCACC GCCGAAATCA CCGACCACGG ATTCGTGAAA CCCGGCTCCC TGCTGCAGCG CTTCTTCGCC ACCCTGGAGC GTTGGATCAA CCGCAGCTCC GACTACATCG TCACCAGCGC CACCCCGACC GTGGAACTGC TCCTGTTTGA CGGCGTTCCG AGGGACCGGG TCCGGGCCCT CATCGACGGC GTCGACACCG GCGTCTTCGC GCCGCAGCCC AAAGAGGAGA TCCGCGCGAA GCTGGGGCTG CCCCAGAAGC GTCCCGTCGT CGTCTACCTG GGGCTGATGA ACAGCTACCA GGGGGTCGAC CTGCTTCTGG AGGCCGCCGC GAACCTGAAG GGACAGGGGG CGAAGCTGCA TTACCTCATC ATGGGATTCC CCGAGGCACG CTACCGGGAG AAGGCCGAAG AGATGGGGAT CGACGACATC ATCACCTTCA CCGGCAGGAT CCCCTACAGC GAAGCGCCGC TTTATCTAAT TGCGGGAGAT CTGGCCGTCT CGCCCAAGGT CTCTCTTACC GAGGCCAACG GGAAGCTGTT CAACTACATC GCCTGCGGGC TTCCCACCGT CGTCTTCGAT ACCCCCGTCA ACCGAGAGAT TTTGGGAGAC GCCGCGTTGT ACGCCAAGTT CGGCGATGCG GCAGACCTGG CTGGAGCCAT AGGTCGGCTG GCCGGCGATC GGGAATTGAG GGAAGTGCTC GGAGAGGAAG GGCGTCAGCG CGCAATAGCC CTCCATTCAT GGCAGGCGAG GGGGAAGGAG CTTCTCGAGA TCTACAAAAC CCTTAACAAG GAGAACATCT GA
|
Protein sequence | MAEGALRVLV LAPTPFFADR GCHVRILEEA RAAMACGVEL RLVTYHIGSD VPGIPTERIS GFSWYKKLEA GPSWVKPLLD LQLLFKAVKV ARQFKPHLIH AHLHEGAFFG AFLKMLIRVP MLFDCQGSLT AEITDHGFVK PGSLLQRFFA TLERWINRSS DYIVTSATPT VELLLFDGVP RDRVRALIDG VDTGVFAPQP KEEIRAKLGL PQKRPVVVYL GLMNSYQGVD LLLEAAANLK GQGAKLHYLI MGFPEARYRE KAEEMGIDDI ITFTGRIPYS EAPLYLIAGD LAVSPKVSLT EANGKLFNYI ACGLPTVVFD TPVNREILGD AALYAKFGDA ADLAGAIGRL AGDRELREVL GEEGRQRAIA LHSWQARGKE LLEIYKTLNK ENI
|
| |