Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3821 |
Symbol | |
ID | 8139195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4393554 |
End bp | 4398773 |
Gene Length | 5220 bp |
Protein Length | 1739 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644871438 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003023596 |
Protein GI | 253702407 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCACA GCGACGAGCA TTCGCAGCAG CAGGCACAGA GGGGTCCCGG CTCCACCATC ACAGAGCGCA GGATCATCCC GGAGGGGGGT GGAACCTTCC GCCTTTCCCT TGGCTGCCAA TTGCAGGCAA TGGTCAGCAA CGAGGAGTAC CACTACGGGC AGGGGACAAA CCTCTTCACC CGCTGCGGCT ATCTTCCGGC GCTCGCGGAG CGGATGCGGG AATGGCTCGC GCGGGACGCC CCCGCGGAGG TACGCGACGG CGTCGGGGAG TGCTGCCGGC AAGCAAGCTG TGCGTTTAGG GAATTGCTTG AGAGGGTCTG CGCGCCCAAA AAGGAAGCGT ACCGTCCCCA CGACGCTCTT GAGTATCTGG CAAGACAGCT GGAGTTGGCG AAGCGGGAGG GCGACCGCGG CGAGGAAGAG GCCATCCACC GCTACCTCGC CATGATCGGG GAACAGAAAG ACCGAAAAGG AGCGACCATG ATCCCAGGCG AGGCAGCGGC GGCAGCGCCG TTTTTCTCCG TACTCATCCC GACCTACAAC CAGGAGGCGT TTCTGCCAGC CGCCCTGGAA AGCCTTCTGG CGCAGAGCCT CGACGACTGG GAGGCGGTCA TCGTGAACGA CGGCTCGACC GACGGCACTG CCCAGGTAAT GGAGCGCTAC GCGGCGGCCG ACTCACGGTT CCGGGTCTTC CACCAGGAAA ACGGCGGAGT CGGCGCCGCC CTGAACACCG CATTGGAAAA AACACACGGG AAATGGCTCT GCTGGCTGTC ATCGGACGAC CTGTTTGAAT CGGACGCCCT GGAAACATTC GCAGCGGCGA TTGATGCGGA TCCCGTTGCC CGCTTCTTCC ACAGCGATTT CTTCGAGATG GACGATGCGA CCGGAATCCG CACCCCTTCT CCCGACAACC GGCACAAGGC CCTCCCCCCC CCGGGGCTCC AGACCCTTCA GTTCTTCAAC GGCAACTACG TACACGGAAT CAGCATCGCG GTGCACCGCT CGGTCTTCGC CGAGATTGGA GAGTTCAACA CCGACTTCAG CTACGGTCAA GACGTTGACA TGTGGCTGCG CGCCAGTGCC AGATTCCGAC TGCACCACAT AAGGCGCCGC ACCTGCGTCA GCCGCCAGCA CTCCGTTATG GGGACCGCGA CTTTCCCGCA GGCCGGGCCG ATGGACGTGG CGCGCTCGGC ACTAGAGTTT TTGAACCGGA GTCCGTTCCC CGCTTGCTTC CCGTTCCTCG ACCTCGGGAA GGACGAGGAA CTGACCATGG CGCTGCGCGC GACGCTTGAG GTTGCCCTGA ACGAACATGC CGGGATGTAC CAAGGCATCG GCCCCGTCCC GGCGCTTCTG GAACGTTTGC GCGAGTGGAT CTGGCGCGGC GCTTCCGAAG AGGTCCGGCA CCGGGTGCTC GCCGCGCTGC AACCGGTGGT AGTCTCCACC ACCAACCTCT TGCCGCCCAG GCTGGGCCGG GCGCTTGAGG CGCTTCTCAA AGACGGCGAG GTCGTCTACC ACCCGCACGA TTACCACCAC GAACTCGCCT CACGCTACAG CGAGCTTACC TGCTCCGGCA GAAGTGGCGA GGCCGCCGTC ATCGCCCGTT ACGCCTTGGT GCAAAGGGGA GAGAACGGTC TTCTCCTCAA GCAGGCGCTC GAGGAGGCCG ACAGCGCAGT AGAGACGCCG GCCCCGTCGG GCAGCCGCTG CGTGAGCAGG TGGCTCGACG GCCTGCGTGG CCTTGAGATA GGCCCCTCCA GCCACAACGA CTTCGGGCTT AACGCCAAAA ACGTCGGGAT GAGGGACCCT ATTTACGAGG AGGAGCAACT CCGCAGCACC GGCAGGGTGG CGCGGATCGA CATTGAGGCG CTGGCGGACC GGATTCCGGT CCCGAACGAG TCGGAGGATT TCATCTTCTC CTCGCACGTG ATCGAGCACT GCCCCGATCT GATCAATACC CTGCTGGAAT GGTTCCGTAT CGTGCGGCCC GGGGGGATTA TTTACATGAT CGTCCCGGGC CGCAACGCCT CCCCCGGCGA CGTCGGCAAG CAACTCACCA CCTGGGAGCA CGCAATGAAC GACTTCCAGA AGCGCGCTGA CCGATTCAGC GAACCGGAAG CCGGACTCTT CGGGCATTGC CACTACCACG TCTTCGACCC GGATAGCATG CATTGCATCG TGGAGCGGAT CTTCGACGCC AGGTTGGAAC TCGTGGAACG CCAGGATGTG GACGACAAGG TGGGCAACGG CTTTGCGCTG GTGTACCGGA AGAGAAGCGG CATCGACTCC TCGCTACCCT GGCCGCTCTG GGAGCAGTTC CTGTCGCGGC GAGTGGCAAA ATTTGTCAAC ATCTGCATGG TCACCTACAA CCGGCTGGAA TTCACCCGCC AGTCTATCGA GTCCATCGCC GGCCGCACCG ACTACCCCTA CGTCCTAACC GTCGTGGACA ACAACAGCCA GGACGGGACC CGAGAATACC TGAAGCAGAT GAAGCAGCAG GGGGTGATCA AGAACCTGGT CCTGCTCGAC GAGAACATCG GAGTAGCCAA GGGGAGTAAC CTCGCTTGGA GCCAGGAACC GGAGGCGGAG TACTACCTGA AGCTCGACAA CGACATCGTA ATCAGGAAGA ACTGCTGGCT CTCTCCGATG GTGGCTGCAC TGGAAGCGAT CCCGGAATTG GGTGCTGTCG GGTACAACTT TGAGCCGGTG AGTTACCCCC TGCAGGTGGT CAACGGGCAG AGGATCAGGC CTAAGCAGGC GAACATCGGT GGTGCTTGTT ACATGATATC TAAGCGGACC GAGCGGATCC TTGGCTTCTG GTGCGAGGAC TACGGTCTTT ACGGCGAGGA GGACGGCGAA TACAGCTTAA GGCTTAGAGT AGCCGGGCTA GCCAACGCGT ACCTGGAGGA TGAGGACATG GGTGTCCATC TACCAGGTGG CAAGGCAGGG GCAATCGACG AAGTGACGAA GAAGGCGCTG GACGATGTAG AGCTCGGCAC CCATTTCGAG TACCGCACCT GGAAGGACGA ACAGCGCCGG CACCTGCAAC AGCCAGGTGG GCTTTTGCAT CGCAACTGCC ACCTCTACCA GTCCGGTGAG CGTTCAATCT ACGTAACCCG CGGCGAGTTC CTGGGAAAAA TCTCGCCGGA GATCCAGCTT TTTGACCGCA AGAACTCCCT GGCCTTTCTC CCCTTGAACG GGACCCTCTC CCCCTCGCAA CGTGAGGATA TTCGCGCCTG GTGCCTGCAG AATTCCATCA CGTTCGCCAC CGTCACGCTC GCCGAGGAGA ACGGACGCGA GATCCTGGAG GTGGCACACA AGGCGGAGCG GGTGCAGGTG ACGGCGTCGA TCGTGATCCC GGTCTTCAAC AAGGTGGAGT TCACCAGGCT CTGCCTGGAG ACGCTCTACG TCAACACGCC GCGGGATCTC TTCGAGCTGA TCATCATCGA TAACGCATCT AGCGACGGAA CCGCTGAATA CCTGGCGGCC CAGGAAGGGC CCGGGGTGCA GATCATCTCC AACGCGAAGA ACGTCGGTTA CACCATCGCA TGCAACCAGG GTGCGGCCAA AGCATCCGGT AAGTACATTG TTTTCCTAAA CAACGACACG GAGCCCCAGA AGGGGTGGCT GGAAAACCTA GTGCTTATGG CCGAGCATGA CCTGCGCGTC GGCGCCGTCG GGGCCAAGCT GATCTACCCC GACGGCCGAC TGCAGGAGGC AGGCGCGATC ATCTTCAGCG ACGGCAGCAA CTACTCAATC GGCAGCTGCG AAGACCCCAA GGATCCCAGG TTCAACACCC CGAGGGTGGT GGACTACTGC ACCGGCGCGT GTCTGATGGT TCGCCACGAC CTGTTCAGGA AGATAGGCGG GTTCGACGAG CGCTATGCGC CGGCATACTA CGAAGATCCA GATATCTGCT TCGCCATCGC CGATCTCGGC TACTACGTGG TCTACTGCCC GCAGTCCGAG GTGATCCACC ACGAATCGGT TACCGCAGGT TTCGATTTGG TGAATGGGAT AAAGAAGCAC TACTTCATCA ACAGGGAGAA ATTCGTTGAG AAATGGGGGC GGGTGCTGGC AGGCAAGCTG GCCCGGCCAG CGCTCCCCGG CCAGGCGACC GTCAAAAGTT GCCAGGAACC GGAGGCCCCC TCCCAGAAGC TTCCGTGCGT GATGGTGGAC GGAGTGTTCT ACCAGATGAA CGCCACCGGG ATAGCGAGGG TCTGGACTTC ACTCCTGTCC CAGTGGGCCG GAACCGACTT CGGCCGTGGC ATCGTGGTGC TGGACCGGGC CGGCACGGCA CCCAGGGTGC CGGGGATCCG CTACCGGAAC ATACCAGGGT ACGTGCCTGG CGAGCCGGAC CGAGCCATGC TGCAGAAGGT CTGTGACGAT GAGGGGGCGG ACCTTTTCAT CTCCACCTAC TACACGACGC CGCTCTCCAC GCCGTCGCTG TTTCTGGCCC ACGACATGAT CCCCGAGTTC ACCGACTTCT ACGACCTCGC CGATGCCCAG TGGCAGGAGA AGCACCATGC CATCAGCCGC GCCTCTGCCT TCGTGGCAGT CTCCCGCAGC ACTGCCAAGG ACCTGGCCAA GCTCTACCCC GACCTGGAGG GGCGCATCGT AGTGGCGCAC AACGGCATCG ACCACGACGT CTTCTCGTCC GCATCAAAGG AGGAGGTGCA GGACTTTCGC AAGCTCCACG GGCTGGACAA GCCGTACTTC CTCTTCGTGG GGATGCGCCA AGCCTACAAG AACGGGCTGC TCCCTTTGAA CGCGTTGAAC CTGCTCCCGA ACAAGGGGTC GTTCACCCTT CTCTATGTGG GGGGTGGTCC GGCGCTGGAG CCCGAAGTGA AGCGGTTGGC CGGCGATCTT GACGTGCGGC TGGCAAAGCT GTCGGACCGA GAGCTTATCT GCGCCTACAG CGGCGCCGCC GCTCTCATCT ACCCCTCGAC CTACGAGGGG TTCGGGCTCC CCATCCTCGA GGCCATGGCA TGCCGCTGCC CGGTCATTAC CTGCCAAAAC TCATCGATCC CCGAGGTGGC CGGCGCCGCG GCGCTGTACG TCTCGGAGAG CGATCCCCGG GAGCTGTCGG AGGCGATGCT GCAGATCACC GCCATCGAGG TACGCCAGCA CCTGATCCAC ATGGGAACCC GCCAAGTCGC GAAGTTCTCC TGGAAGAAGA TGGCCGACAT CATTCAGGAG GTCATCTACA GAGAGTTCCC GGTGCGCTAA
|
Protein sequence | MNHSDEHSQQ QAQRGPGSTI TERRIIPEGG GTFRLSLGCQ LQAMVSNEEY HYGQGTNLFT RCGYLPALAE RMREWLARDA PAEVRDGVGE CCRQASCAFR ELLERVCAPK KEAYRPHDAL EYLARQLELA KREGDRGEEE AIHRYLAMIG EQKDRKGATM IPGEAAAAAP FFSVLIPTYN QEAFLPAALE SLLAQSLDDW EAVIVNDGST DGTAQVMERY AAADSRFRVF HQENGGVGAA LNTALEKTHG KWLCWLSSDD LFESDALETF AAAIDADPVA RFFHSDFFEM DDATGIRTPS PDNRHKALPP PGLQTLQFFN GNYVHGISIA VHRSVFAEIG EFNTDFSYGQ DVDMWLRASA RFRLHHIRRR TCVSRQHSVM GTATFPQAGP MDVARSALEF LNRSPFPACF PFLDLGKDEE LTMALRATLE VALNEHAGMY QGIGPVPALL ERLREWIWRG ASEEVRHRVL AALQPVVVST TNLLPPRLGR ALEALLKDGE VVYHPHDYHH ELASRYSELT CSGRSGEAAV IARYALVQRG ENGLLLKQAL EEADSAVETP APSGSRCVSR WLDGLRGLEI GPSSHNDFGL NAKNVGMRDP IYEEEQLRST GRVARIDIEA LADRIPVPNE SEDFIFSSHV IEHCPDLINT LLEWFRIVRP GGIIYMIVPG RNASPGDVGK QLTTWEHAMN DFQKRADRFS EPEAGLFGHC HYHVFDPDSM HCIVERIFDA RLELVERQDV DDKVGNGFAL VYRKRSGIDS SLPWPLWEQF LSRRVAKFVN ICMVTYNRLE FTRQSIESIA GRTDYPYVLT VVDNNSQDGT REYLKQMKQQ GVIKNLVLLD ENIGVAKGSN LAWSQEPEAE YYLKLDNDIV IRKNCWLSPM VAALEAIPEL GAVGYNFEPV SYPLQVVNGQ RIRPKQANIG GACYMISKRT ERILGFWCED YGLYGEEDGE YSLRLRVAGL ANAYLEDEDM GVHLPGGKAG AIDEVTKKAL DDVELGTHFE YRTWKDEQRR HLQQPGGLLH RNCHLYQSGE RSIYVTRGEF LGKISPEIQL FDRKNSLAFL PLNGTLSPSQ REDIRAWCLQ NSITFATVTL AEENGREILE VAHKAERVQV TASIVIPVFN KVEFTRLCLE TLYVNTPRDL FELIIIDNAS SDGTAEYLAA QEGPGVQIIS NAKNVGYTIA CNQGAAKASG KYIVFLNNDT EPQKGWLENL VLMAEHDLRV GAVGAKLIYP DGRLQEAGAI IFSDGSNYSI GSCEDPKDPR FNTPRVVDYC TGACLMVRHD LFRKIGGFDE RYAPAYYEDP DICFAIADLG YYVVYCPQSE VIHHESVTAG FDLVNGIKKH YFINREKFVE KWGRVLAGKL ARPALPGQAT VKSCQEPEAP SQKLPCVMVD GVFYQMNATG IARVWTSLLS QWAGTDFGRG IVVLDRAGTA PRVPGIRYRN IPGYVPGEPD RAMLQKVCDD EGADLFISTY YTTPLSTPSL FLAHDMIPEF TDFYDLADAQ WQEKHHAISR ASAFVAVSRS TAKDLAKLYP DLEGRIVVAH NGIDHDVFSS ASKEEVQDFR KLHGLDKPYF LFVGMRQAYK NGLLPLNALN LLPNKGSFTL LYVGGGPALE PEVKRLAGDL DVRLAKLSDR ELICAYSGAA ALIYPSTYEG FGLPILEAMA CRCPVITCQN SSIPEVAGAA ALYVSESDPR ELSEAMLQIT AIEVRQHLIH MGTRQVAKFS WKKMADIIQE VIYREFPVR
|
| |