Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3516 |
Symbol | |
ID | 8138888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4057434 |
End bp | 4058378 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644871135 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003023295 |
Protein GI | 253702106 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.00000921398 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGATCTCA GCATCGTAGT CCCCATTTAC AACGAGGAAG ACAACATTCC CATCCTGCAC GACCGGGTCA GCGAGGCGTT GGGCGACACC CTGCTCGAGT ACGAGCTGAT CCTCGTCGAC GACGGCTCTT CGGACAACTC CTATTCCGGG CTGAAGCGCC TGGCGGCGAA AGACGACCGG GTCAAGGTGA TACGTCTGCG CCGCAATTTC GGCCAGACCG CCGCCATGTC CGCCGGCTTC GACTTAGCCT CAGGCCGGGT GGTGATTCCC ATGGACGGGG ACCTGCAGAA CGATCCGCTC GACATCCCGC TGCTTTTGGC GCGGATCGAC GAGGGGTACG ACGTGGTATC CGGGTGGCGC AAGGACCGCA AAGACACATT CGTGAACCGC AAGCTCCCTT CCATGCTTGC CAACGGCATC ATCTCAAGGA TGACCGGCGT ACATCTGCAC GACTACGGCT GCACCCTGAA GGCCTACCGT CGCGACGTGC TGGACGACGT GAACCTTTAC GGGGAGATGC ACCGCTTCGT TCCCGCGCTG GCGCACCAGG TCGGCGCCCG GGTAACCGAA ATGCCGGTGC GTCACCACGA AAGGCTGCAC GGCAATAGCA AGTACGGCAT CTCCCGCACC ATGAAGGTCA TCCTCGACCT GATGACGGTT AAATTCCTAT TGAGCTACTC GACCAAGCCG ATCCAGCTCT TCGGCCGCTG GGGGATCTAC ACCCTCGCCG CCGGGTTCCT AAGCGGCGCG GTCACCGTCT ACATGAAGTT CTTCGAAGGC ATGAGCATGA ACCGCAACCC GCTCCTCATC CTGACCGCTT TCCTCCTTTT CATGGGGGTT CAGTTCATCG TCCTCGGGCT TTTGGCCGAG CTCTCCGCCA GGACCTATTA CGAGGCGCAG GGAAAGCCGA TTTACAACAT AAAGGAAAAG CTCAACTTTG GCTGA
|
Protein sequence | MDLSIVVPIY NEEDNIPILH DRVSEALGDT LLEYELILVD DGSSDNSYSG LKRLAAKDDR VKVIRLRRNF GQTAAMSAGF DLASGRVVIP MDGDLQNDPL DIPLLLARID EGYDVVSGWR KDRKDTFVNR KLPSMLANGI ISRMTGVHLH DYGCTLKAYR RDVLDDVNLY GEMHRFVPAL AHQVGARVTE MPVRHHERLH GNSKYGISRT MKVILDLMTV KFLLSYSTKP IQLFGRWGIY TLAAGFLSGA VTVYMKFFEG MSMNRNPLLI LTAFLLFMGV QFIVLGLLAE LSARTYYEAQ GKPIYNIKEK LNFG
|
| |