Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3834 |
Symbol | |
ID | 8139208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4420273 |
End bp | 4421331 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644871451 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003023609 |
Protein GI | 253702420 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 0.00734924 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAGTAT CGATGCCTGC CGGCGCCATG CACGGGTGGG GGATCGCCGG CAGCTACCTG GAGCGCGAGA TCTCCAAACT CCCCGGCATC GAGGGGGTGA CGCTGCACTG CATGACCAAC ACCCTGGCGC CGCTGCGCCC GGAGAGCTGG GACTCCATCA ACATCGGCTA CTGCTTCTTC GAGGACAGCA TCGAGATCCT CAACTTCACC CGTGACGCCG CGCGCCAATG GGATTTCATC GTCGCGGGTT CCAAGTGGTG CGAGTACCAA CTGAGGATCG GCGGGGTGAA AAACACCTGC ACCATCCTGC AGGGTATCGA CCCGACCAAC TTCCACCCGG TCCCCTACCC GGCGGACGAC CGCTTCGTGG TCTTCTCGGG GGGCAAATTC GAACTCCGCA AGGGTCAGGA CCTGGTGATC GCCGCCATGA AGGTGATGAT GCAGCGGCAC CGCGACGTCT TCCTCTCCTG CAGCTGGACC AACCAGTGGC CTTTTTCGCT CGCCACCATG CAGTCGTCCC CCTACATAAC CTACCGACAC GACGAGGAGA ACTTCCTCGA CCTCCCGGGG AGATGCGTGC TCGACAACGG GCTGGACCCC GCCCGGGTGG CGGTGCATCC CCTGGTGAAC AACGCCCTCA TGCGCGAAAT ATTCGCCGGG AGCCACTTAG GCCTCTTCCC CAACCGCTGC GAGGGGGGGA ACAACATGGT GATGTGCGAG TACATGGCCT GCGGCAGGAG CGTCATCGCC TCGGATACCA GCGGCCACGC CGACGTGATC AACTCCGCCA TCGCCTACCC CCTTACCCGC TACCGCCCCA TGGTGGTGGC GACCCAGGGG GTGCAGACCG GGGTCTGGGA GGAGCCGCAG GTGGAAGAGA TCATAGAGCT CCTGGAACTC GCCTACCTAA ACCGCGACCA GCTTCCCGCC AAGGGGGCGC TGGCGGCCCG GGAGATGGAG AAGCTAAGCT GGGGCGCCGC GGCGCGGCAG TTCCACTACA TCGCCACCAG GCTCGCCAAT CAGGCGGAGC TCGCCAGGAT GCAGCAGGAT GCCTGCTAG
|
Protein sequence | MKVSMPAGAM HGWGIAGSYL EREISKLPGI EGVTLHCMTN TLAPLRPESW DSINIGYCFF EDSIEILNFT RDAARQWDFI VAGSKWCEYQ LRIGGVKNTC TILQGIDPTN FHPVPYPADD RFVVFSGGKF ELRKGQDLVI AAMKVMMQRH RDVFLSCSWT NQWPFSLATM QSSPYITYRH DEENFLDLPG RCVLDNGLDP ARVAVHPLVN NALMREIFAG SHLGLFPNRC EGGNNMVMCE YMACGRSVIA SDTSGHADVI NSAIAYPLTR YRPMVVATQG VQTGVWEEPQ VEEIIELLEL AYLNRDQLPA KGALAAREME KLSWGAAARQ FHYIATRLAN QAELARMQQD AC
|
| |