Gene GM21_3823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3823 
Symbol 
ID8139197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4399675 
End bp4404246 
Gene Length4572 bp 
Protein Length1523 aa 
Translation table11 
GC content67% 
IMG OID644871440 
Productglycosyl transferase family 2 
Protein accessionYP_003023598 
Protein GI253702409 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.588357 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGGGG CCGGGAAGAA ATACCTGGTC TCCGCCATCG TCTCCACCTA CAAGGCGGAG 
CGCTTCCTGC GCTGGAAACT GGAGGACCTG GAGGCGCAGA CCATTGCGGG GGAGCTGGAG
ATCGTGGTCA TCGACTCCGG GTCGCCGCAG AACGAGCGCG CCATCGTGGA GGAGTTCCAG
AAGCGCTACG ACAACATCCG GTACCTGCGG ACCGAAGAGC GCGAGACGGT GTACCAGGCC
TGGAACCGCG GCATCCGCAT GGCGACAGGC GAGTTCGTCA CCAACGCCAA CACGGACGAC
CGCCTGCGAA ACGACGCCTA CGAGGTCCTG GTGCGGACGC TCAGGGAGCA CCCGGAATGC
GTGCTCGCCT ACCCGGACAT GCGCATCACG CAAAAGGAGA ACGCCACCTT CGACCGGCAC
GCCTCCTTCG GCTTCCGCGA CTGGCCCGAG TTCAATCGGC TGAGCCTGTT GGAGCTCTGC
TGCGTCGGCC CCTTCCCGCT TTGGAGACGC TCGCTGCACG AAAGGATCGG CTACTTCGAC
GAGCGCTTCA AAAGCGCTGC GGACTACGAA TTCTGGCTGC GCGCCGCACT TAAGTACGAC
TTCATCCACG TCCCCGAGTT CCTTGGGCTT TACTGGCTTT CGGAGGAGAC CGTATCGCGG
AAGGGGGATC TCCCCACCCT CGAATACCTG GAGGTGCAGA AGGAGTACCG CGCGCGTTTT
GCGCCGGTCA CCCCTCCCCC TGTGGAGCCG ACGGCCGGGG AGTGGGACCG CTTCCACGCG
CTGACGGCGC GTCTGGAAGC TGGGGACGCC ACGGTGCTTC CCGAACTGGA ACGCTTCGAA
GCAAGCCACC CGCGCGCCGC CGCCCCCCAT TTGGAGCTGG CCCGGATCTA CTACCGCATG
GGGGAGATCG GCTACGCCAA GAAGCACTTC GAGAAGGCCG CCATCGTTGA CCCCTTTTCC
AAGACATACA GCGACAGCCT CATCTGCTTC ATGAAAAGCG AGTTGTACCA GGCGCTGCAG
CACCAGACCG CCGTGCTGAG TTCGAACCCC GACGACCTGG AGGCGCACCT TTGCGCCGGA
ATGATCCTCA TCCTGCTGGA TCGCTACCAG GCGGCCCTGG AGCACTACCG GCGCGCCCTG
GAAATCATTC CCGGAAACCC ACTCGCCGCC AGGAACATCT CCTTCGTGGA ACACCGCTTG
CTCCAGAAAA AGCCCGGCAG TTACTACACC TGCAAGCGCC CGGAGGTGCG GTGCATGGTG
AGCCGCCGGG CGCGCCGGGT GCTGGACGTG GGATGCGCCG CGGGAGAACT GGGACAGGCC
CTGAAGAAGC GACAGGGGGC CGAGGTCTGG GGGGTGGAGC CGAACAGCGA GGCCGCGGCG
ACAGCCAGGC AGGCACTGTA CCGAGTGCTG GAGGCGAGCG TCGAGGACGC CCTCTGCTCT
TTGCCCCAGA GGCACTTCGA CAGCATCGTC GCCGCCGATG TCCTGGAACA TCTGGTCGAC
CCGGAACGGG TGTTGCGCGA GCTCTCAGGG AAACTCACCA GGTCCGGCGA ACTCATTGTC
TCCCTCCCGA ACGTCAGGCA CTGGAGCGTG GTGCAGGGGC TTTTGGAGGG GTCGTGGGAG
TACGCCGACG CGGGAATCCT GGACCGGACC CATCTCAGGT TCTTCACCCG GAGGAGCGCG
CTCGCGCTCT TCGAGGCGGC CGGGTACGCG GTGGAAAGCG TGGAGCCGAT CGCCCTTTCC
GGGGACGAGG GGATGCCTAA GGCCCTTTTG CAGGCCCTTG CCGAGGGGGG TGTGCTGGAG
CCGACCCTGG AGGAGGAGAG TGCGGCCTAC CAGTACCTGT TCCGGCTGGT CCCCAAGGCC
TCCCGGCTCA CCTCCATCGT CATCCTCACC TGGAACGAGC TCTCCTGCAC CCGCGAGTGC
CTGGAATCGA TCGAGAGGCA CACCCCCGAG CCGCACGAGG TGATCCTGGT GGACAACGGC
TCCAGCGACG GGACCGTACC TTTCCTGCGC GAGTTCTGCG CGCAAAGGGA GAACTACCGC
CTCATCGAGA ACGGGAAAAA TCTCGGCTTC GCCGCGGGGT GCAACATCGG CATGCGCGAG
GCCGTGGGGG GGCATATCCT CCTTCTGAAC AACGACACGG TGGTGACCCG CGGCTGGCTT
TCCGGCATGA TCGAGGCCTT GGAGCGAGAC CCGAAGGCGG GGATCGTCGG CCCCATGACC
AACGAGATCG CGGGACCGCA GAAGCTCGCC AGAGTCCCCT ACCGCGACAT GGAGGAGCTG
CAAGCCTTCG CGGAGCGCTT CAGGGGCGAG CACTACGGGC GCCGCATCGA GGTGGACCGC
GTGGTCGGCT TCTGCATGCT CTTCACCCGC GAGGTGCTGG AGACGGTAGG GGAGCTGGAC
GAGCGCTTCG GCTCCGGCAA CTTCGAGGAC GACGATTTCT GCCTGCGGGG GGCTCTCGCA
GGCTACCGCT GCCTCATCGC CGGCGACGTC TTCATCCACC ACTACGGCAG CCGCTCCTTC
ACCGGCAACC GCGTCGACTA CGCGGCCGCC ATGTGGAAGA ACCGCAAGGC CTTCGACGCC
AAATGGGACC TGGCCGCCCT GGACGAGGTA ACGGCGGCGC GGGTGGTGAC CCACAACGCG
ACGCTCAGGG GCGCCAAGTT CGCCCGACGC GGGAAGCTGA ACGACGCGGT GGAACTGATG
CTGCAGGAGG GGATCCGCTT CTCCCCCGCC TCCCCCGCCC CCTACCTTGC GCTCGCCGGG
ATACTATGCG AGGCGGGGAA GTGGCGCGAG GCCCTGGAGG TCTTGGAACA GGTCCCGGCC
GGCTCCGAGC TAGCCGCGGC CCTGATGCGG GGGCGCGCCT TCAAGGAATC GGGGGAACCG
ACGCAGACTA TGGAAGCCGC CGGGGAGGCC GAGGCGATCG ACCCGGAGGC CCCGGGAACG
CTCCATCTGA AGGGGGTGCT CGCGCTGTGG CAGGGGGAAA CCGAAAGAGG CGAGGAGCTC
TTGCGGCGCG CCATCGCCCT CGACCCCGGC TTCGCCCTCC CCTACGGCGC CCTGGCCCAG
GCCGCCTGGG AGCGCGGGGA GCGCGAACAG GGGGTGAGGC TCGCCGAGCT TTGCTTCGTG
CTGGCCCCCT TGGAGCTCTC GGCGCTCGCG CGCTTCCACG AATTCGCCAC CGCCTGCGAC
CGGCTTCCCC GCGAGGAGGA GCTGTTGCGG GAAGCGCTGG AGATCCACCG GGACCACAAG
GGGCTTTGCT ACGGCCTGAT CGAGCTTTTG ATCCGCAGCG GCCGCTATGG CGAGGCGATG
ACGGAGATCG AGAAGGGGGC CGCCCGCTTC GGGCTCGACG ACGGGAGCAT CGACGCAGCG
CTGCAGATAA GGAAACTGGC CGGGCCTCCG CTTCCTTGCG CCAGCGGCAA GGGGAGCGTC
ACGCTCTGCA TGATCGTCAA GGACGAGGCG AGGCACCTCC CCGCCGTCCT CGACTCGGTG
CGGGGGCTCG CCGACGAACT GGTGGTGGTC GACACCGGCT CCAGCGACCG CAGCTGCGAC
ATCGCCCGGA TCTTCGGGGC CAGGCTCTTC AGCTTCCCCT GGAACGGGAG CTTCGCCGAC
GCCCGCAACT TCTCCCTCTC GCAGGCGCTG GGAGAGTGGA TCCTGGTGCT CGACGCGGAC
GAGGTGATAG CGGCAGACGA CGCGGTGGCC CTCAAGGAGC TGGCGCAAAG GACAGCCCTT
CCCACAGCGT TCTCCTTCAC CACGAGGAAC TACACCCACG AGGTGACCCG CCGCAACTGG
AGCGCCAACG CGGGGGAATA CCCCGCCGAG GAAGAGGGGC GCGGCTGGAC CCCGAGCGAC
AAGGTGAGGC TCTTCCCGAA CGACCCGGGT ATCCGCTTCG AGGGGGCGGT GCACGAGCTG
GTGGAGCCGT CGCTGCTCCG CCTCGGCCTC CCCATCCACG CCTGCGACGT CCCGGTGCAC
CACTACGGAA AGCTCGACGC CGAGCGCTGC GCCCAGAAGC AGGAGGCGTA CTACCTTCTG
GGGCTGAAGA AGCTGGAAGA GGACGGGGGG TCGGTGGAGG CGCTCACCGA GCTGGCGCGG
CAGGCGACCG AGCTGAACCG GGGCGAGGAG GCGCAAAGGC TGTGGCACCG GCTTTTGCAG
GTGCACCCGG AGAACGCGGA GGCCTATTTC AACCTGGGGT ACCTGCAGCT TTGCGCCGGA
GAGTACCCGA AGGCGCGGGA GAGCGCGCTC AAGGGGGCCC AGCTCGCGCC GGGGATGAAG
GAGGCCGCCT TCAACCTGGC CAAGTGCGAG CTCTTCTTGG GGAACACGGA AAAGGCGCGG
GAGAGCTGCC GCGAGATGCT GGAGAAATGG CCCGATTACC CCCCCGCCCT GTCGCTTTCC
TGCGTCTGCC TCCTCTTGCA GGGGGAGAAG GCCCAGGCGC AACACCTTTT GCAAAGGCTC
GCCGCCATGC GCTTTGACTG CGCCGATTTC CTGGAAGAGT ACGCGGCAGG GCTCAAGAAA
GGGGAGCATG CCGATCTCGC ACTCCCGCTT ATCGAGCTTG CCCGCGGCAT CTCCGGCGGG
GCCGCCCCGT GA
 
Protein sequence
MKGAGKKYLV SAIVSTYKAE RFLRWKLEDL EAQTIAGELE IVVIDSGSPQ NERAIVEEFQ 
KRYDNIRYLR TEERETVYQA WNRGIRMATG EFVTNANTDD RLRNDAYEVL VRTLREHPEC
VLAYPDMRIT QKENATFDRH ASFGFRDWPE FNRLSLLELC CVGPFPLWRR SLHERIGYFD
ERFKSAADYE FWLRAALKYD FIHVPEFLGL YWLSEETVSR KGDLPTLEYL EVQKEYRARF
APVTPPPVEP TAGEWDRFHA LTARLEAGDA TVLPELERFE ASHPRAAAPH LELARIYYRM
GEIGYAKKHF EKAAIVDPFS KTYSDSLICF MKSELYQALQ HQTAVLSSNP DDLEAHLCAG
MILILLDRYQ AALEHYRRAL EIIPGNPLAA RNISFVEHRL LQKKPGSYYT CKRPEVRCMV
SRRARRVLDV GCAAGELGQA LKKRQGAEVW GVEPNSEAAA TARQALYRVL EASVEDALCS
LPQRHFDSIV AADVLEHLVD PERVLRELSG KLTRSGELIV SLPNVRHWSV VQGLLEGSWE
YADAGILDRT HLRFFTRRSA LALFEAAGYA VESVEPIALS GDEGMPKALL QALAEGGVLE
PTLEEESAAY QYLFRLVPKA SRLTSIVILT WNELSCTREC LESIERHTPE PHEVILVDNG
SSDGTVPFLR EFCAQRENYR LIENGKNLGF AAGCNIGMRE AVGGHILLLN NDTVVTRGWL
SGMIEALERD PKAGIVGPMT NEIAGPQKLA RVPYRDMEEL QAFAERFRGE HYGRRIEVDR
VVGFCMLFTR EVLETVGELD ERFGSGNFED DDFCLRGALA GYRCLIAGDV FIHHYGSRSF
TGNRVDYAAA MWKNRKAFDA KWDLAALDEV TAARVVTHNA TLRGAKFARR GKLNDAVELM
LQEGIRFSPA SPAPYLALAG ILCEAGKWRE ALEVLEQVPA GSELAAALMR GRAFKESGEP
TQTMEAAGEA EAIDPEAPGT LHLKGVLALW QGETERGEEL LRRAIALDPG FALPYGALAQ
AAWERGEREQ GVRLAELCFV LAPLELSALA RFHEFATACD RLPREEELLR EALEIHRDHK
GLCYGLIELL IRSGRYGEAM TEIEKGAARF GLDDGSIDAA LQIRKLAGPP LPCASGKGSV
TLCMIVKDEA RHLPAVLDSV RGLADELVVV DTGSSDRSCD IARIFGARLF SFPWNGSFAD
ARNFSLSQAL GEWILVLDAD EVIAADDAVA LKELAQRTAL PTAFSFTTRN YTHEVTRRNW
SANAGEYPAE EEGRGWTPSD KVRLFPNDPG IRFEGAVHEL VEPSLLRLGL PIHACDVPVH
HYGKLDAERC AQKQEAYYLL GLKKLEEDGG SVEALTELAR QATELNRGEE AQRLWHRLLQ
VHPENAEAYF NLGYLQLCAG EYPKARESAL KGAQLAPGMK EAAFNLAKCE LFLGNTEKAR
ESCREMLEKW PDYPPALSLS CVCLLLQGEK AQAQHLLQRL AAMRFDCADF LEEYAAGLKK
GEHADLALPL IELARGISGG AAP