Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1651 |
Symbol | |
ID | 8136982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1921891 |
End bp | 1922832 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644869264 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003021464 |
Protein GI | 253700275 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 191 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAACC CCGTTCTTCC CGGAAAACTC TTGATTTCGG TCGTGGTTTG CACCCGCAAC CGCTCAGCCC TGCTGCGCAA CTGCCTGAGG TCGTTGGCGA AGCAGTCCGC GCCGCATGGG AGCTTCGAGG TGCTCATCGT CAACAACTGC TCCACGGACG ACACCGCGCA GGTGATCGAG GAGATGGTCG CGGCGAACGG AGGGTTTAGG AGCTTTACCG AGAACGAGCT TGGGTTGAGC TTCGCCAGAA ACCGCGGCGC ATGCGAGGCG CTTGGAGAGT TCGTGGCCTA CATTGACGAC GACGCAGAGG CCTTGCCGGA CTGGGTGGCT CAGATGCAAT CGTTTCTGGC CAGGAGCAGC TCAGTGGCAG CCTTCGGCGG CCCCTACCAG GCGATACTTG CCGTGACCCC TCCCGCCTGG TTCCCTCCGG AGTACGGCAC CCTCGACCTC GGCGAGCGGG AGCTGTCGCT CGACGCCAAG GAGCAGTTCC TAACCGGGAC CAACATGGTA TTTCACCGCG AAACCATCTT GCAGTTGGGG GGATTCTCCA CGAACCTGGG CATGAAAGGT GGGAGGATAT CCTACGGGGA GGAAACGCGC CTTCAGATAG ACCTGAAGCG GCAGGGACAC GAGATCTTTT ACCTGCCGAC CCTGCGGGTC AAGCACCTGG TGGCACAGGA GAAGATGGAA TTTTTCTGGC TGCTCAAGTC CATTTACTCC GTCGGCCGCT GCTCGTCCCA GACCTTCGAC ACGCCGCGCA CCCTCTTTTC CTGCTGTGCC GGCATCTGTT TCGGCATCGT TTACGCCCTG CGCACGCTTG CCGGCGCGCG CAAAACTCCG TTGCGGCGCA ATCTGTACTA CTCGCTGAAG CCCCTGGTTT CCGAGTTGGG CGCCCTGCAC CAGCACTTAC TCAGGAACCG CAATGAAGCA AGCCATCTCT GA
|
Protein sequence | MSNPVLPGKL LISVVVCTRN RSALLRNCLR SLAKQSAPHG SFEVLIVNNC STDDTAQVIE EMVAANGGFR SFTENELGLS FARNRGACEA LGEFVAYIDD DAEALPDWVA QMQSFLARSS SVAAFGGPYQ AILAVTPPAW FPPEYGTLDL GERELSLDAK EQFLTGTNMV FHRETILQLG GFSTNLGMKG GRISYGEETR LQIDLKRQGH EIFYLPTLRV KHLVAQEKME FFWLLKSIYS VGRCSSQTFD TPRTLFSCCA GICFGIVYAL RTLAGARKTP LRRNLYYSLK PLVSELGALH QHLLRNRNEA SHL
|
| |