Gene GM21_3821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3821 
Symbol 
ID8139195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4393554 
End bp4398773 
Gene Length5220 bp 
Protein Length1739 aa 
Translation table11 
GC content62% 
IMG OID644871438 
Productglycosyl transferase family 2 
Protein accessionYP_003023596 
Protein GI253702407 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCACA GCGACGAGCA TTCGCAGCAG CAGGCACAGA GGGGTCCCGG CTCCACCATC 
ACAGAGCGCA GGATCATCCC GGAGGGGGGT GGAACCTTCC GCCTTTCCCT TGGCTGCCAA
TTGCAGGCAA TGGTCAGCAA CGAGGAGTAC CACTACGGGC AGGGGACAAA CCTCTTCACC
CGCTGCGGCT ATCTTCCGGC GCTCGCGGAG CGGATGCGGG AATGGCTCGC GCGGGACGCC
CCCGCGGAGG TACGCGACGG CGTCGGGGAG TGCTGCCGGC AAGCAAGCTG TGCGTTTAGG
GAATTGCTTG AGAGGGTCTG CGCGCCCAAA AAGGAAGCGT ACCGTCCCCA CGACGCTCTT
GAGTATCTGG CAAGACAGCT GGAGTTGGCG AAGCGGGAGG GCGACCGCGG CGAGGAAGAG
GCCATCCACC GCTACCTCGC CATGATCGGG GAACAGAAAG ACCGAAAAGG AGCGACCATG
ATCCCAGGCG AGGCAGCGGC GGCAGCGCCG TTTTTCTCCG TACTCATCCC GACCTACAAC
CAGGAGGCGT TTCTGCCAGC CGCCCTGGAA AGCCTTCTGG CGCAGAGCCT CGACGACTGG
GAGGCGGTCA TCGTGAACGA CGGCTCGACC GACGGCACTG CCCAGGTAAT GGAGCGCTAC
GCGGCGGCCG ACTCACGGTT CCGGGTCTTC CACCAGGAAA ACGGCGGAGT CGGCGCCGCC
CTGAACACCG CATTGGAAAA AACACACGGG AAATGGCTCT GCTGGCTGTC ATCGGACGAC
CTGTTTGAAT CGGACGCCCT GGAAACATTC GCAGCGGCGA TTGATGCGGA TCCCGTTGCC
CGCTTCTTCC ACAGCGATTT CTTCGAGATG GACGATGCGA CCGGAATCCG CACCCCTTCT
CCCGACAACC GGCACAAGGC CCTCCCCCCC CCGGGGCTCC AGACCCTTCA GTTCTTCAAC
GGCAACTACG TACACGGAAT CAGCATCGCG GTGCACCGCT CGGTCTTCGC CGAGATTGGA
GAGTTCAACA CCGACTTCAG CTACGGTCAA GACGTTGACA TGTGGCTGCG CGCCAGTGCC
AGATTCCGAC TGCACCACAT AAGGCGCCGC ACCTGCGTCA GCCGCCAGCA CTCCGTTATG
GGGACCGCGA CTTTCCCGCA GGCCGGGCCG ATGGACGTGG CGCGCTCGGC ACTAGAGTTT
TTGAACCGGA GTCCGTTCCC CGCTTGCTTC CCGTTCCTCG ACCTCGGGAA GGACGAGGAA
CTGACCATGG CGCTGCGCGC GACGCTTGAG GTTGCCCTGA ACGAACATGC CGGGATGTAC
CAAGGCATCG GCCCCGTCCC GGCGCTTCTG GAACGTTTGC GCGAGTGGAT CTGGCGCGGC
GCTTCCGAAG AGGTCCGGCA CCGGGTGCTC GCCGCGCTGC AACCGGTGGT AGTCTCCACC
ACCAACCTCT TGCCGCCCAG GCTGGGCCGG GCGCTTGAGG CGCTTCTCAA AGACGGCGAG
GTCGTCTACC ACCCGCACGA TTACCACCAC GAACTCGCCT CACGCTACAG CGAGCTTACC
TGCTCCGGCA GAAGTGGCGA GGCCGCCGTC ATCGCCCGTT ACGCCTTGGT GCAAAGGGGA
GAGAACGGTC TTCTCCTCAA GCAGGCGCTC GAGGAGGCCG ACAGCGCAGT AGAGACGCCG
GCCCCGTCGG GCAGCCGCTG CGTGAGCAGG TGGCTCGACG GCCTGCGTGG CCTTGAGATA
GGCCCCTCCA GCCACAACGA CTTCGGGCTT AACGCCAAAA ACGTCGGGAT GAGGGACCCT
ATTTACGAGG AGGAGCAACT CCGCAGCACC GGCAGGGTGG CGCGGATCGA CATTGAGGCG
CTGGCGGACC GGATTCCGGT CCCGAACGAG TCGGAGGATT TCATCTTCTC CTCGCACGTG
ATCGAGCACT GCCCCGATCT GATCAATACC CTGCTGGAAT GGTTCCGTAT CGTGCGGCCC
GGGGGGATTA TTTACATGAT CGTCCCGGGC CGCAACGCCT CCCCCGGCGA CGTCGGCAAG
CAACTCACCA CCTGGGAGCA CGCAATGAAC GACTTCCAGA AGCGCGCTGA CCGATTCAGC
GAACCGGAAG CCGGACTCTT CGGGCATTGC CACTACCACG TCTTCGACCC GGATAGCATG
CATTGCATCG TGGAGCGGAT CTTCGACGCC AGGTTGGAAC TCGTGGAACG CCAGGATGTG
GACGACAAGG TGGGCAACGG CTTTGCGCTG GTGTACCGGA AGAGAAGCGG CATCGACTCC
TCGCTACCCT GGCCGCTCTG GGAGCAGTTC CTGTCGCGGC GAGTGGCAAA ATTTGTCAAC
ATCTGCATGG TCACCTACAA CCGGCTGGAA TTCACCCGCC AGTCTATCGA GTCCATCGCC
GGCCGCACCG ACTACCCCTA CGTCCTAACC GTCGTGGACA ACAACAGCCA GGACGGGACC
CGAGAATACC TGAAGCAGAT GAAGCAGCAG GGGGTGATCA AGAACCTGGT CCTGCTCGAC
GAGAACATCG GAGTAGCCAA GGGGAGTAAC CTCGCTTGGA GCCAGGAACC GGAGGCGGAG
TACTACCTGA AGCTCGACAA CGACATCGTA ATCAGGAAGA ACTGCTGGCT CTCTCCGATG
GTGGCTGCAC TGGAAGCGAT CCCGGAATTG GGTGCTGTCG GGTACAACTT TGAGCCGGTG
AGTTACCCCC TGCAGGTGGT CAACGGGCAG AGGATCAGGC CTAAGCAGGC GAACATCGGT
GGTGCTTGTT ACATGATATC TAAGCGGACC GAGCGGATCC TTGGCTTCTG GTGCGAGGAC
TACGGTCTTT ACGGCGAGGA GGACGGCGAA TACAGCTTAA GGCTTAGAGT AGCCGGGCTA
GCCAACGCGT ACCTGGAGGA TGAGGACATG GGTGTCCATC TACCAGGTGG CAAGGCAGGG
GCAATCGACG AAGTGACGAA GAAGGCGCTG GACGATGTAG AGCTCGGCAC CCATTTCGAG
TACCGCACCT GGAAGGACGA ACAGCGCCGG CACCTGCAAC AGCCAGGTGG GCTTTTGCAT
CGCAACTGCC ACCTCTACCA GTCCGGTGAG CGTTCAATCT ACGTAACCCG CGGCGAGTTC
CTGGGAAAAA TCTCGCCGGA GATCCAGCTT TTTGACCGCA AGAACTCCCT GGCCTTTCTC
CCCTTGAACG GGACCCTCTC CCCCTCGCAA CGTGAGGATA TTCGCGCCTG GTGCCTGCAG
AATTCCATCA CGTTCGCCAC CGTCACGCTC GCCGAGGAGA ACGGACGCGA GATCCTGGAG
GTGGCACACA AGGCGGAGCG GGTGCAGGTG ACGGCGTCGA TCGTGATCCC GGTCTTCAAC
AAGGTGGAGT TCACCAGGCT CTGCCTGGAG ACGCTCTACG TCAACACGCC GCGGGATCTC
TTCGAGCTGA TCATCATCGA TAACGCATCT AGCGACGGAA CCGCTGAATA CCTGGCGGCC
CAGGAAGGGC CCGGGGTGCA GATCATCTCC AACGCGAAGA ACGTCGGTTA CACCATCGCA
TGCAACCAGG GTGCGGCCAA AGCATCCGGT AAGTACATTG TTTTCCTAAA CAACGACACG
GAGCCCCAGA AGGGGTGGCT GGAAAACCTA GTGCTTATGG CCGAGCATGA CCTGCGCGTC
GGCGCCGTCG GGGCCAAGCT GATCTACCCC GACGGCCGAC TGCAGGAGGC AGGCGCGATC
ATCTTCAGCG ACGGCAGCAA CTACTCAATC GGCAGCTGCG AAGACCCCAA GGATCCCAGG
TTCAACACCC CGAGGGTGGT GGACTACTGC ACCGGCGCGT GTCTGATGGT TCGCCACGAC
CTGTTCAGGA AGATAGGCGG GTTCGACGAG CGCTATGCGC CGGCATACTA CGAAGATCCA
GATATCTGCT TCGCCATCGC CGATCTCGGC TACTACGTGG TCTACTGCCC GCAGTCCGAG
GTGATCCACC ACGAATCGGT TACCGCAGGT TTCGATTTGG TGAATGGGAT AAAGAAGCAC
TACTTCATCA ACAGGGAGAA ATTCGTTGAG AAATGGGGGC GGGTGCTGGC AGGCAAGCTG
GCCCGGCCAG CGCTCCCCGG CCAGGCGACC GTCAAAAGTT GCCAGGAACC GGAGGCCCCC
TCCCAGAAGC TTCCGTGCGT GATGGTGGAC GGAGTGTTCT ACCAGATGAA CGCCACCGGG
ATAGCGAGGG TCTGGACTTC ACTCCTGTCC CAGTGGGCCG GAACCGACTT CGGCCGTGGC
ATCGTGGTGC TGGACCGGGC CGGCACGGCA CCCAGGGTGC CGGGGATCCG CTACCGGAAC
ATACCAGGGT ACGTGCCTGG CGAGCCGGAC CGAGCCATGC TGCAGAAGGT CTGTGACGAT
GAGGGGGCGG ACCTTTTCAT CTCCACCTAC TACACGACGC CGCTCTCCAC GCCGTCGCTG
TTTCTGGCCC ACGACATGAT CCCCGAGTTC ACCGACTTCT ACGACCTCGC CGATGCCCAG
TGGCAGGAGA AGCACCATGC CATCAGCCGC GCCTCTGCCT TCGTGGCAGT CTCCCGCAGC
ACTGCCAAGG ACCTGGCCAA GCTCTACCCC GACCTGGAGG GGCGCATCGT AGTGGCGCAC
AACGGCATCG ACCACGACGT CTTCTCGTCC GCATCAAAGG AGGAGGTGCA GGACTTTCGC
AAGCTCCACG GGCTGGACAA GCCGTACTTC CTCTTCGTGG GGATGCGCCA AGCCTACAAG
AACGGGCTGC TCCCTTTGAA CGCGTTGAAC CTGCTCCCGA ACAAGGGGTC GTTCACCCTT
CTCTATGTGG GGGGTGGTCC GGCGCTGGAG CCCGAAGTGA AGCGGTTGGC CGGCGATCTT
GACGTGCGGC TGGCAAAGCT GTCGGACCGA GAGCTTATCT GCGCCTACAG CGGCGCCGCC
GCTCTCATCT ACCCCTCGAC CTACGAGGGG TTCGGGCTCC CCATCCTCGA GGCCATGGCA
TGCCGCTGCC CGGTCATTAC CTGCCAAAAC TCATCGATCC CCGAGGTGGC CGGCGCCGCG
GCGCTGTACG TCTCGGAGAG CGATCCCCGG GAGCTGTCGG AGGCGATGCT GCAGATCACC
GCCATCGAGG TACGCCAGCA CCTGATCCAC ATGGGAACCC GCCAAGTCGC GAAGTTCTCC
TGGAAGAAGA TGGCCGACAT CATTCAGGAG GTCATCTACA GAGAGTTCCC GGTGCGCTAA
 
Protein sequence
MNHSDEHSQQ QAQRGPGSTI TERRIIPEGG GTFRLSLGCQ LQAMVSNEEY HYGQGTNLFT 
RCGYLPALAE RMREWLARDA PAEVRDGVGE CCRQASCAFR ELLERVCAPK KEAYRPHDAL
EYLARQLELA KREGDRGEEE AIHRYLAMIG EQKDRKGATM IPGEAAAAAP FFSVLIPTYN
QEAFLPAALE SLLAQSLDDW EAVIVNDGST DGTAQVMERY AAADSRFRVF HQENGGVGAA
LNTALEKTHG KWLCWLSSDD LFESDALETF AAAIDADPVA RFFHSDFFEM DDATGIRTPS
PDNRHKALPP PGLQTLQFFN GNYVHGISIA VHRSVFAEIG EFNTDFSYGQ DVDMWLRASA
RFRLHHIRRR TCVSRQHSVM GTATFPQAGP MDVARSALEF LNRSPFPACF PFLDLGKDEE
LTMALRATLE VALNEHAGMY QGIGPVPALL ERLREWIWRG ASEEVRHRVL AALQPVVVST
TNLLPPRLGR ALEALLKDGE VVYHPHDYHH ELASRYSELT CSGRSGEAAV IARYALVQRG
ENGLLLKQAL EEADSAVETP APSGSRCVSR WLDGLRGLEI GPSSHNDFGL NAKNVGMRDP
IYEEEQLRST GRVARIDIEA LADRIPVPNE SEDFIFSSHV IEHCPDLINT LLEWFRIVRP
GGIIYMIVPG RNASPGDVGK QLTTWEHAMN DFQKRADRFS EPEAGLFGHC HYHVFDPDSM
HCIVERIFDA RLELVERQDV DDKVGNGFAL VYRKRSGIDS SLPWPLWEQF LSRRVAKFVN
ICMVTYNRLE FTRQSIESIA GRTDYPYVLT VVDNNSQDGT REYLKQMKQQ GVIKNLVLLD
ENIGVAKGSN LAWSQEPEAE YYLKLDNDIV IRKNCWLSPM VAALEAIPEL GAVGYNFEPV
SYPLQVVNGQ RIRPKQANIG GACYMISKRT ERILGFWCED YGLYGEEDGE YSLRLRVAGL
ANAYLEDEDM GVHLPGGKAG AIDEVTKKAL DDVELGTHFE YRTWKDEQRR HLQQPGGLLH
RNCHLYQSGE RSIYVTRGEF LGKISPEIQL FDRKNSLAFL PLNGTLSPSQ REDIRAWCLQ
NSITFATVTL AEENGREILE VAHKAERVQV TASIVIPVFN KVEFTRLCLE TLYVNTPRDL
FELIIIDNAS SDGTAEYLAA QEGPGVQIIS NAKNVGYTIA CNQGAAKASG KYIVFLNNDT
EPQKGWLENL VLMAEHDLRV GAVGAKLIYP DGRLQEAGAI IFSDGSNYSI GSCEDPKDPR
FNTPRVVDYC TGACLMVRHD LFRKIGGFDE RYAPAYYEDP DICFAIADLG YYVVYCPQSE
VIHHESVTAG FDLVNGIKKH YFINREKFVE KWGRVLAGKL ARPALPGQAT VKSCQEPEAP
SQKLPCVMVD GVFYQMNATG IARVWTSLLS QWAGTDFGRG IVVLDRAGTA PRVPGIRYRN
IPGYVPGEPD RAMLQKVCDD EGADLFISTY YTTPLSTPSL FLAHDMIPEF TDFYDLADAQ
WQEKHHAISR ASAFVAVSRS TAKDLAKLYP DLEGRIVVAH NGIDHDVFSS ASKEEVQDFR
KLHGLDKPYF LFVGMRQAYK NGLLPLNALN LLPNKGSFTL LYVGGGPALE PEVKRLAGDL
DVRLAKLSDR ELICAYSGAA ALIYPSTYEG FGLPILEAMA CRCPVITCQN SSIPEVAGAA
ALYVSESDPR ELSEAMLQIT AIEVRQHLIH MGTRQVAKFS WKKMADIIQE VIYREFPVR