Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2470 |
Symbol | |
ID | 8137811 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2888020 |
End bp | 2890425 |
Gene Length | 2406 bp |
Protein Length | 801 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644870080 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003022271 |
Protein GI | 253701082 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 152 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTATGGA CCGAAATCAC CATGCTGGCG CTACTTTTGC TCCTGGCCTA CATCTACCTG GGGTACCCGC TACTTCTTTG CCTGCTGGCG CGCCTCTTCC CCACGCCCCA CCATGTCGAT GACGGTTTTC TTCCCACGGT CACTCTGGTC ATCTCCGCCT ACAACGAAAA AGCGGTTATC AGGGCGAAAC TGGAAAATTC GCTGGCCCTT GACTACCCGC CGGAAAGGCT CTCCATCTTG GTCGTGTCCG ACTGCTCCAG CGACGGGACT GACGCGGAGG TGCTCTCCTT CTCGGACAAA GGGGTGAAGC TCATAAGGGC AGAGGAGCGC CGGGGGAAGA CGTCGGCCCT CAACACAGCG CTTGCCGGTA TTGATAGTGA GATGGTGGTC TTCTCCGACG CCAACGCCAT CTACGACGCA ATGGCCATCC GCCGCTTGGT GCGGCATTTC GCCGATCCCA ACATTGGCTA CGTCGTCGGT TGCGCCCGTT ATATTGAGGA GACCGTCAGC GCCGCCGGCG GGAGCGAAGG GACTTACTGG GACCTCGAGG TGATGCTGAA ACGGTGGGAA TCGCGCCTTT CGTCAGTGGT TGGAGGGGAC GGCGCCATCT ATGCGATTCG CCGCTCGCTA TACGAGCCCC TGAGGGAAGA GGATATCAAC GACTTCGTCA ACCCCCTTCA GATCGTCGCC AAGGGGTTCC GCGGCGTCTT CGACAATGAG GCCTGGTGCG TGGAGCGTCC TGCCGGCGAC TTCCATAAGG AATTTTCCCG CAAGGTGCGC ATCGTCAACC GCAGCTTCAG CGGATTTTTG CGGGTGCCCA AGGCTGCGAA CCCCTTTGTC GTAGGTGCCT TTGCGTGGCA GCTCCTTTCG CACAAGGTCC TGCGCTGGTT TTCCCCTTAT TTTCTGGGCC TGTTTCTGGG CTTGCTGGTG CTCGACCGGC TGTTGCATCC AGCCTCCCTC ACCGGAGAGG TTTTGTTGGC CCTGACAGCC ATGGGTGCTT TCCTCTCCAT GCTAGGTGCG CTCCTGAACC GTTTCTGGCG GCTCCCCCTG CCGCTGCTTC TTCCCTACTA CTTCGTACTG GTGAATACGG CCTCCGCCCT CGGGGTGTTT TACCGCCTGT TGGGCCGGAC CATAGTGACC TGGAGCACGG TAAGGGAGGG GGAGGCTGCG CCCCCAGCGC CGCTTTTGCC GCGCTTGGCC CTTTTCTGCG CTGCGGTGCT GCTCGTGGTC ACGGCAAGCG GGTTGGCGCT GGATGAGGAT GCCCTTGGCG CTGCCGCCAT CCTGCTGATG CTCCTGCTGG TTCACACCTT CGTCGGCTAC CAGTTCCTTC TGCTGCCGCT GTCGCGTCTC TCACGCAGGA AACCGGCGCC GGACGAGAGC TACACCCCCA CGGTCACTCT GCTGGTTGTC GCCTATAACG AGGCGAAGGT GATCGAGAAA AAGCTCCTGA ACTCGCTGGC GCTTGAGTAC CCCAGCGACC GGTTGCGCAT CCTGGTGGCC TCCGACGGCT CGAAAGACGA CACCGACCAG ATCGTCTCCC GATACCTGGA CAGGGGGGTC GAGTTTATCT CCTTCCCAGT AAACCGCGGC AAGATTTCCG CCCTGAACGA CGCCATGCGG CAGATCGACT CGGAGATCGT GGTTCTCTCC GATGCCAACG TCTACTACAT GCCCACCGCC GTCAGGAACC TGGTGCGCAA CTTCGCCGAC CCGAGTGTCG GAGCGGTTTC GGGCAAGGTG GTGCTTTTGA ACGACACCCT GAGCTACAGC GCCGCCGAGA AATCGTACTA CTCGATCGAG CACCTGATAC AGGAACTGGA GGGAAGTCTC GGTGCCCTCA TCGGCGCCGA CGGCGCCATG TACGCCATCC GCAGGCAGCT TTTCACCCCC CCAAGCCCCG ACACCATCCT CGACGACCTG GTGATCGCGA TGGGGATCGC CAGGCAGGGG CATCTGGTGC TGCACGAAAA GGAGGCGCTT GGCTTCGAGG AGAACCTTTT GGAGATCGTG GGAGAGTTCC GCCGCAAGGT GAGGATCATT GCCGGCGGCT ACCAATGCCT CTTGCGCGGG GGGGTGATTC CCAGGCTGTC CCAGCCGCTG CTCATGTTCT GTTTCATCTC CCACAAGCTG CTGCGCTGGG TGAGCGGTTA CCTTCTCATG GCTCTTGTGG GGGTGCTGGT GCAGATCCAA CTGCGCAATG CATCACCTAC CTATGCCCTG GTGCTGGCAG TGCTGCTCGC AGCTCTTGTC CTGGCTCTGC TGGTGCAACT GTTCCCGAAG ATGAAGTCGA TCAAGATCGC GTCCCTTTGC CATTACTTCT ACATGCTGAT GGCCGCCTCG CTAATCGGGG GGTACCGCGG CGTTACCGGG AGACAGCAGG TCACCTGGCG CAGGGAGGCA GCCTAG
|
Protein sequence | MVWTEITMLA LLLLLAYIYL GYPLLLCLLA RLFPTPHHVD DGFLPTVTLV ISAYNEKAVI RAKLENSLAL DYPPERLSIL VVSDCSSDGT DAEVLSFSDK GVKLIRAEER RGKTSALNTA LAGIDSEMVV FSDANAIYDA MAIRRLVRHF ADPNIGYVVG CARYIEETVS AAGGSEGTYW DLEVMLKRWE SRLSSVVGGD GAIYAIRRSL YEPLREEDIN DFVNPLQIVA KGFRGVFDNE AWCVERPAGD FHKEFSRKVR IVNRSFSGFL RVPKAANPFV VGAFAWQLLS HKVLRWFSPY FLGLFLGLLV LDRLLHPASL TGEVLLALTA MGAFLSMLGA LLNRFWRLPL PLLLPYYFVL VNTASALGVF YRLLGRTIVT WSTVREGEAA PPAPLLPRLA LFCAAVLLVV TASGLALDED ALGAAAILLM LLLVHTFVGY QFLLLPLSRL SRRKPAPDES YTPTVTLLVV AYNEAKVIEK KLLNSLALEY PSDRLRILVA SDGSKDDTDQ IVSRYLDRGV EFISFPVNRG KISALNDAMR QIDSEIVVLS DANVYYMPTA VRNLVRNFAD PSVGAVSGKV VLLNDTLSYS AAEKSYYSIE HLIQELEGSL GALIGADGAM YAIRRQLFTP PSPDTILDDL VIAMGIARQG HLVLHEKEAL GFEENLLEIV GEFRRKVRII AGGYQCLLRG GVIPRLSQPL LMFCFISHKL LRWVSGYLLM ALVGVLVQIQ LRNASPTYAL VLAVLLAALV LALLVQLFPK MKSIKIASLC HYFYMLMAAS LIGGYRGVTG RQQVTWRREA A
|
| |