Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1650 |
Symbol | |
ID | 8136981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1919849 |
End bp | 1921888 |
Gene Length | 2040 bp |
Protein Length | 679 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644869263 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003021463 |
Protein GI | 253700274 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 189 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTGC CATCCATCAG TGTCTGCATC CTTTTCTACG AGAAACCGGA ACAGACCATC CACTGCGTCG AAAGCCTGCT CCCGGCCGGA GCCCCGATCG TTGTCCTCAA CAACGGCTCC TCGGCCGCGG CGACGGAAAG GTTCAGCGCA TGGGCCGCCG GCCACGCCCA AATCAGGGTG CTGCACTCCG ACCGCAACCT GGGTGTGAGC GCAGGGCGCA ACCTTCTTGC CGCGCAGACT AGTGAAGAGT GGCTGCTCTT TGTCGACAAC GACATCGTGA TGGAGACCCC GCAGTGGTTG CACCGGCTGC AGCGCTACGC TGACCTGGAT GTCGAGGTCA TCGTGCCGCG GCTTTACAAC CACCACGACG GTTGCTACGC GAAATTCATG GACATCGAGG TGCGGGGGGG GCGGGCTGGC TTCGTGCCTC TCCGGGACAA CCGCCTGAAC AATTTCCCCG GTGGCGCTTC TTTTGTCAGG AGGACTCTCT TCGATCGCCT GGGTGGGTAC GACGAGGAGC TTTTCGTCGG CGTAGAGGAT TTCGAGCTGA GCCTGCGGGG GGTGCTCGCC GGGGCTCCGG TACGCGCGCT GCTCGTGCCG GACATCGAAC TCACCCATAA CCACCTTGCC TCTCCGGGGA GCGACGCCGA CAGGATGGCG GTGCGCAACC GTTACGACAA GGAGGTGCTC GGGAACTCCT ATCGGCGCAT CGCGGTGAAG CACGGCCTGC TCTTCGAGGA TTTCTGGGAG GAATGGGTGG ACCAGCAGAT GGTTTCTTTG GGAGTGTCGA CGTGGCCGGA CGAACAGGCA GAGGCGTTGC GGCGGACTGG CAGACCGAGG ATCGCCCTAG TCGCCGATGT GGAGAACTGG GCCTTTGCCA ACATCTCGCG CAACGTGGTC AGCCATCTCT CGGAAAGGTA CGACCTGCAG ATCGTCTACA GCTCCCATTA CGGGCATGAC TACGACCGTC TCCTCTGCGA CCTGTACGAG CAGGGTTACG ACCTGGTGCA CTTCTTCTGG CGCGGCACCA TCCTCTGCCT ATACCTGCAC CTGCTGCGCA CAATGCCGAC CCGCACGCCA TCGGTCCCCG AGTCGTTCAT CAGGACAAAG CTCACCTTTT CCATCTATGA TCATTGTCTG CTTGGGGAGA AGGACCTTCT GGAGTACCGG ATACTGTTCA AGTACCTGGC CGACGGCTAC GTGGTCAGCT CGCAGCGATT GTTCCACACC TACACCATGC TGCCGCATTA CCCGTCCCCC TGCCGTATCA TCGAGGACGG CGTCGACCTC GCGCGGTTCA CCCCGTGCAA CACGCAGCGA CTGCTGGACC GGGACCGGCC TCTGGTCGTC GGGTGGGCGG GTAACAGCGA CTGGGGGATG GACCTGGACG CCAACGACTT CAAGGGGTTC AATACCATAA TCAAACCCGC TCTAGCCCAG CTCAGGCAGG AAGGGTACGA GGTAGTGGGG CGGTTCGCCG ACCGCAAGGT CAAGCAGGTC CCCTATGACA AGATGGTGGA GTATTACAAC TCGATCGATG TCTATGTCTG CGCCTCCTCG ATCGAGGGGA CGCCCAACCC CGTGCTGGAG GCGATGGCCT GCGGCGTCCC GGTGATCTCG ACCGATGTGG GAATCGTGCC GCAACTTCTC GGCCGGGAGC AGAAGAGGTA CATCCTCATG CACCGCTCCG TGGAGGCGTT GAAAGAAAAG CTGGCCGCTT TGGCGAAGGA CCCGGAGGAA AGGATGAAAC TCTCCGTGGA AAACCTGGAG CGCATCAAGG GGTTCACGCG GGAGATGGAA GTGCCGAAAT GGGACGATTT CTTCCAGGCC ATGTTGAGCC GGGACACCGC CTCCAGGGAG CCGCTTAAGC GGGCGCTCCT CGAAGTTCCT TACAACTTCG GCATCGAGCA TACGGTCGAT CTCTTCCTTC AAAGATCCCT GAGCTGGCGC ATTACCATGC CGCTGCGCCT TTTACAGTGG CACTCAACTC GGCTGCAAAA CAAGCTGAGG GTTAAGCTGC GCAACCGCCG GGGAAGGTGA
|
Protein sequence | MSLPSISVCI LFYEKPEQTI HCVESLLPAG APIVVLNNGS SAAATERFSA WAAGHAQIRV LHSDRNLGVS AGRNLLAAQT SEEWLLFVDN DIVMETPQWL HRLQRYADLD VEVIVPRLYN HHDGCYAKFM DIEVRGGRAG FVPLRDNRLN NFPGGASFVR RTLFDRLGGY DEELFVGVED FELSLRGVLA GAPVRALLVP DIELTHNHLA SPGSDADRMA VRNRYDKEVL GNSYRRIAVK HGLLFEDFWE EWVDQQMVSL GVSTWPDEQA EALRRTGRPR IALVADVENW AFANISRNVV SHLSERYDLQ IVYSSHYGHD YDRLLCDLYE QGYDLVHFFW RGTILCLYLH LLRTMPTRTP SVPESFIRTK LTFSIYDHCL LGEKDLLEYR ILFKYLADGY VVSSQRLFHT YTMLPHYPSP CRIIEDGVDL ARFTPCNTQR LLDRDRPLVV GWAGNSDWGM DLDANDFKGF NTIIKPALAQ LRQEGYEVVG RFADRKVKQV PYDKMVEYYN SIDVYVCASS IEGTPNPVLE AMACGVPVIS TDVGIVPQLL GREQKRYILM HRSVEALKEK LAALAKDPEE RMKLSVENLE RIKGFTREME VPKWDDFFQA MLSRDTASRE PLKRALLEVP YNFGIEHTVD LFLQRSLSWR ITMPLRLLQW HSTRLQNKLR VKLRNRRGR
|
| |