Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3507 |
Symbol | |
ID | 8138879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4048003 |
End bp | 4049214 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644871126 |
Product | glycosyl transferase group 1 |
Protein accession | YP_003023286 |
Protein GI | 253702097 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 5.89342e-22 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGCTAC TATTGATAGC GCTTCCGGAC AGCGTTCATA CCGCCCGGTG GATTTCCCAG ATCTCGGACC TGGGCTGGGA CATTCACTTG TTCCCCAGCA AGGACCTCGG TTTGATCCAC CCGGATATGG CTGGTGTAAA GGCCTATGTT CCTTTGTACG GGAAAAGGGG ATGCAGTCGA ACCGTCAGGA TTTCCGGCAT CTCGATTTTC AACGATTTCC TTTCCAAGGG TATCAGCTCA GCTGTTTCCA AACTTGAAGG ATATCGGCAG TCTCAGGTTA ACCGGCTCCT TAGGGTAATA CGGAAGGTGA GGCCGGATGT CATCCATTGC CTTGAACTGC AGCAAGCAGG GTACCTGGCC CTGGAAGCGA AGAAGCTTCA TGCCGGGAAG TTCCCCCCCT TAATCGTGAC GAACTGGGGG AGCGACATCT ACCTGTTTGG GCGTCTGGCG GAGCACGAGC CGAAAATCAG GGCATTGCTG GCTGCGAGTG ACTATTACTC GTGTGAGTGC CGGCGTGACG TCTGCCTTGC CAAGGCTTAC GGATTCAACG GCACTGTCCT TCCCGTCTTT CCCAATGCCG GAGGATTCGA TCTCGAACAG GTGAAGCGGT TGCGGCAGCC AGGTCCGGTA TCTGCTCGCC GCCTTATCAT GCTCAAGGGG TACCAGCACT GGGCAGGGCG CGCTTTGGTG GGACTTCGCG CGCTGGAGCG GTGCGCGGAA GCACTTACCG GGTACGAGGT CGTTATCTAT GGAGCATCCT CGGAAGTGGC CCTTGCAGCA GAACTCTTTT CTACATCCAC CGGCATTGCC ACCAAGATCA TCCCGCCTAA CTCACCTCAC GAAGAGATCA TGCGGCATCA CGGAGCGGCG CGTTTTTCTA TAGGGCTTAG CATAAGCGAC GGTATCAGCA CTTCCCTGTT GGAGGCCCTG GTCATGGGAT CCTTGCCGAT CCAGTCGTGG ACGGCATGTG CTGATGAGTG GATCGAAGAC GGCGTGACGG GTTTGCTGGT TCCACCCGAA GACCCGGATG TAATTGAGCA GGCAATTCGG AGGGCTCTTG CAGATGACGC GCTGGTCGAA GGCGCTGCCA GTTGCAACTT TAAACTTGCA GAAGATAAGC TCGCACAGTC GAGCTTAAAG CAAAAGACGG TGGAACTATA CCAGACGGTG TTGAACGATC TGGAGCGATC AGATAGTCCT TTGTCTTCTT GA
|
Protein sequence | MKLLLIALPD SVHTARWISQ ISDLGWDIHL FPSKDLGLIH PDMAGVKAYV PLYGKRGCSR TVRISGISIF NDFLSKGISS AVSKLEGYRQ SQVNRLLRVI RKVRPDVIHC LELQQAGYLA LEAKKLHAGK FPPLIVTNWG SDIYLFGRLA EHEPKIRALL AASDYYSCEC RRDVCLAKAY GFNGTVLPVF PNAGGFDLEQ VKRLRQPGPV SARRLIMLKG YQHWAGRALV GLRALERCAE ALTGYEVVIY GASSEVALAA ELFSTSTGIA TKIIPPNSPH EEIMRHHGAA RFSIGLSISD GISTSLLEAL VMGSLPIQSW TACADEWIED GVTGLLVPPE DPDVIEQAIR RALADDALVE GAASCNFKLA EDKLAQSSLK QKTVELYQTV LNDLERSDSP LSS
|
| |