Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3823 |
Symbol | |
ID | 8139197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4399675 |
End bp | 4404246 |
Gene Length | 4572 bp |
Protein Length | 1523 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644871440 |
Product | glycosyl transferase family 2 |
Protein accession | YP_003023598 |
Protein GI | 253702409 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 0.588357 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGGGGG CCGGGAAGAA ATACCTGGTC TCCGCCATCG TCTCCACCTA CAAGGCGGAG CGCTTCCTGC GCTGGAAACT GGAGGACCTG GAGGCGCAGA CCATTGCGGG GGAGCTGGAG ATCGTGGTCA TCGACTCCGG GTCGCCGCAG AACGAGCGCG CCATCGTGGA GGAGTTCCAG AAGCGCTACG ACAACATCCG GTACCTGCGG ACCGAAGAGC GCGAGACGGT GTACCAGGCC TGGAACCGCG GCATCCGCAT GGCGACAGGC GAGTTCGTCA CCAACGCCAA CACGGACGAC CGCCTGCGAA ACGACGCCTA CGAGGTCCTG GTGCGGACGC TCAGGGAGCA CCCGGAATGC GTGCTCGCCT ACCCGGACAT GCGCATCACG CAAAAGGAGA ACGCCACCTT CGACCGGCAC GCCTCCTTCG GCTTCCGCGA CTGGCCCGAG TTCAATCGGC TGAGCCTGTT GGAGCTCTGC TGCGTCGGCC CCTTCCCGCT TTGGAGACGC TCGCTGCACG AAAGGATCGG CTACTTCGAC GAGCGCTTCA AAAGCGCTGC GGACTACGAA TTCTGGCTGC GCGCCGCACT TAAGTACGAC TTCATCCACG TCCCCGAGTT CCTTGGGCTT TACTGGCTTT CGGAGGAGAC CGTATCGCGG AAGGGGGATC TCCCCACCCT CGAATACCTG GAGGTGCAGA AGGAGTACCG CGCGCGTTTT GCGCCGGTCA CCCCTCCCCC TGTGGAGCCG ACGGCCGGGG AGTGGGACCG CTTCCACGCG CTGACGGCGC GTCTGGAAGC TGGGGACGCC ACGGTGCTTC CCGAACTGGA ACGCTTCGAA GCAAGCCACC CGCGCGCCGC CGCCCCCCAT TTGGAGCTGG CCCGGATCTA CTACCGCATG GGGGAGATCG GCTACGCCAA GAAGCACTTC GAGAAGGCCG CCATCGTTGA CCCCTTTTCC AAGACATACA GCGACAGCCT CATCTGCTTC ATGAAAAGCG AGTTGTACCA GGCGCTGCAG CACCAGACCG CCGTGCTGAG TTCGAACCCC GACGACCTGG AGGCGCACCT TTGCGCCGGA ATGATCCTCA TCCTGCTGGA TCGCTACCAG GCGGCCCTGG AGCACTACCG GCGCGCCCTG GAAATCATTC CCGGAAACCC ACTCGCCGCC AGGAACATCT CCTTCGTGGA ACACCGCTTG CTCCAGAAAA AGCCCGGCAG TTACTACACC TGCAAGCGCC CGGAGGTGCG GTGCATGGTG AGCCGCCGGG CGCGCCGGGT GCTGGACGTG GGATGCGCCG CGGGAGAACT GGGACAGGCC CTGAAGAAGC GACAGGGGGC CGAGGTCTGG GGGGTGGAGC CGAACAGCGA GGCCGCGGCG ACAGCCAGGC AGGCACTGTA CCGAGTGCTG GAGGCGAGCG TCGAGGACGC CCTCTGCTCT TTGCCCCAGA GGCACTTCGA CAGCATCGTC GCCGCCGATG TCCTGGAACA TCTGGTCGAC CCGGAACGGG TGTTGCGCGA GCTCTCAGGG AAACTCACCA GGTCCGGCGA ACTCATTGTC TCCCTCCCGA ACGTCAGGCA CTGGAGCGTG GTGCAGGGGC TTTTGGAGGG GTCGTGGGAG TACGCCGACG CGGGAATCCT GGACCGGACC CATCTCAGGT TCTTCACCCG GAGGAGCGCG CTCGCGCTCT TCGAGGCGGC CGGGTACGCG GTGGAAAGCG TGGAGCCGAT CGCCCTTTCC GGGGACGAGG GGATGCCTAA GGCCCTTTTG CAGGCCCTTG CCGAGGGGGG TGTGCTGGAG CCGACCCTGG AGGAGGAGAG TGCGGCCTAC CAGTACCTGT TCCGGCTGGT CCCCAAGGCC TCCCGGCTCA CCTCCATCGT CATCCTCACC TGGAACGAGC TCTCCTGCAC CCGCGAGTGC CTGGAATCGA TCGAGAGGCA CACCCCCGAG CCGCACGAGG TGATCCTGGT GGACAACGGC TCCAGCGACG GGACCGTACC TTTCCTGCGC GAGTTCTGCG CGCAAAGGGA GAACTACCGC CTCATCGAGA ACGGGAAAAA TCTCGGCTTC GCCGCGGGGT GCAACATCGG CATGCGCGAG GCCGTGGGGG GGCATATCCT CCTTCTGAAC AACGACACGG TGGTGACCCG CGGCTGGCTT TCCGGCATGA TCGAGGCCTT GGAGCGAGAC CCGAAGGCGG GGATCGTCGG CCCCATGACC AACGAGATCG CGGGACCGCA GAAGCTCGCC AGAGTCCCCT ACCGCGACAT GGAGGAGCTG CAAGCCTTCG CGGAGCGCTT CAGGGGCGAG CACTACGGGC GCCGCATCGA GGTGGACCGC GTGGTCGGCT TCTGCATGCT CTTCACCCGC GAGGTGCTGG AGACGGTAGG GGAGCTGGAC GAGCGCTTCG GCTCCGGCAA CTTCGAGGAC GACGATTTCT GCCTGCGGGG GGCTCTCGCA GGCTACCGCT GCCTCATCGC CGGCGACGTC TTCATCCACC ACTACGGCAG CCGCTCCTTC ACCGGCAACC GCGTCGACTA CGCGGCCGCC ATGTGGAAGA ACCGCAAGGC CTTCGACGCC AAATGGGACC TGGCCGCCCT GGACGAGGTA ACGGCGGCGC GGGTGGTGAC CCACAACGCG ACGCTCAGGG GCGCCAAGTT CGCCCGACGC GGGAAGCTGA ACGACGCGGT GGAACTGATG CTGCAGGAGG GGATCCGCTT CTCCCCCGCC TCCCCCGCCC CCTACCTTGC GCTCGCCGGG ATACTATGCG AGGCGGGGAA GTGGCGCGAG GCCCTGGAGG TCTTGGAACA GGTCCCGGCC GGCTCCGAGC TAGCCGCGGC CCTGATGCGG GGGCGCGCCT TCAAGGAATC GGGGGAACCG ACGCAGACTA TGGAAGCCGC CGGGGAGGCC GAGGCGATCG ACCCGGAGGC CCCGGGAACG CTCCATCTGA AGGGGGTGCT CGCGCTGTGG CAGGGGGAAA CCGAAAGAGG CGAGGAGCTC TTGCGGCGCG CCATCGCCCT CGACCCCGGC TTCGCCCTCC CCTACGGCGC CCTGGCCCAG GCCGCCTGGG AGCGCGGGGA GCGCGAACAG GGGGTGAGGC TCGCCGAGCT TTGCTTCGTG CTGGCCCCCT TGGAGCTCTC GGCGCTCGCG CGCTTCCACG AATTCGCCAC CGCCTGCGAC CGGCTTCCCC GCGAGGAGGA GCTGTTGCGG GAAGCGCTGG AGATCCACCG GGACCACAAG GGGCTTTGCT ACGGCCTGAT CGAGCTTTTG ATCCGCAGCG GCCGCTATGG CGAGGCGATG ACGGAGATCG AGAAGGGGGC CGCCCGCTTC GGGCTCGACG ACGGGAGCAT CGACGCAGCG CTGCAGATAA GGAAACTGGC CGGGCCTCCG CTTCCTTGCG CCAGCGGCAA GGGGAGCGTC ACGCTCTGCA TGATCGTCAA GGACGAGGCG AGGCACCTCC CCGCCGTCCT CGACTCGGTG CGGGGGCTCG CCGACGAACT GGTGGTGGTC GACACCGGCT CCAGCGACCG CAGCTGCGAC ATCGCCCGGA TCTTCGGGGC CAGGCTCTTC AGCTTCCCCT GGAACGGGAG CTTCGCCGAC GCCCGCAACT TCTCCCTCTC GCAGGCGCTG GGAGAGTGGA TCCTGGTGCT CGACGCGGAC GAGGTGATAG CGGCAGACGA CGCGGTGGCC CTCAAGGAGC TGGCGCAAAG GACAGCCCTT CCCACAGCGT TCTCCTTCAC CACGAGGAAC TACACCCACG AGGTGACCCG CCGCAACTGG AGCGCCAACG CGGGGGAATA CCCCGCCGAG GAAGAGGGGC GCGGCTGGAC CCCGAGCGAC AAGGTGAGGC TCTTCCCGAA CGACCCGGGT ATCCGCTTCG AGGGGGCGGT GCACGAGCTG GTGGAGCCGT CGCTGCTCCG CCTCGGCCTC CCCATCCACG CCTGCGACGT CCCGGTGCAC CACTACGGAA AGCTCGACGC CGAGCGCTGC GCCCAGAAGC AGGAGGCGTA CTACCTTCTG GGGCTGAAGA AGCTGGAAGA GGACGGGGGG TCGGTGGAGG CGCTCACCGA GCTGGCGCGG CAGGCGACCG AGCTGAACCG GGGCGAGGAG GCGCAAAGGC TGTGGCACCG GCTTTTGCAG GTGCACCCGG AGAACGCGGA GGCCTATTTC AACCTGGGGT ACCTGCAGCT TTGCGCCGGA GAGTACCCGA AGGCGCGGGA GAGCGCGCTC AAGGGGGCCC AGCTCGCGCC GGGGATGAAG GAGGCCGCCT TCAACCTGGC CAAGTGCGAG CTCTTCTTGG GGAACACGGA AAAGGCGCGG GAGAGCTGCC GCGAGATGCT GGAGAAATGG CCCGATTACC CCCCCGCCCT GTCGCTTTCC TGCGTCTGCC TCCTCTTGCA GGGGGAGAAG GCCCAGGCGC AACACCTTTT GCAAAGGCTC GCCGCCATGC GCTTTGACTG CGCCGATTTC CTGGAAGAGT ACGCGGCAGG GCTCAAGAAA GGGGAGCATG CCGATCTCGC ACTCCCGCTT ATCGAGCTTG CCCGCGGCAT CTCCGGCGGG GCCGCCCCGT GA
|
Protein sequence | MKGAGKKYLV SAIVSTYKAE RFLRWKLEDL EAQTIAGELE IVVIDSGSPQ NERAIVEEFQ KRYDNIRYLR TEERETVYQA WNRGIRMATG EFVTNANTDD RLRNDAYEVL VRTLREHPEC VLAYPDMRIT QKENATFDRH ASFGFRDWPE FNRLSLLELC CVGPFPLWRR SLHERIGYFD ERFKSAADYE FWLRAALKYD FIHVPEFLGL YWLSEETVSR KGDLPTLEYL EVQKEYRARF APVTPPPVEP TAGEWDRFHA LTARLEAGDA TVLPELERFE ASHPRAAAPH LELARIYYRM GEIGYAKKHF EKAAIVDPFS KTYSDSLICF MKSELYQALQ HQTAVLSSNP DDLEAHLCAG MILILLDRYQ AALEHYRRAL EIIPGNPLAA RNISFVEHRL LQKKPGSYYT CKRPEVRCMV SRRARRVLDV GCAAGELGQA LKKRQGAEVW GVEPNSEAAA TARQALYRVL EASVEDALCS LPQRHFDSIV AADVLEHLVD PERVLRELSG KLTRSGELIV SLPNVRHWSV VQGLLEGSWE YADAGILDRT HLRFFTRRSA LALFEAAGYA VESVEPIALS GDEGMPKALL QALAEGGVLE PTLEEESAAY QYLFRLVPKA SRLTSIVILT WNELSCTREC LESIERHTPE PHEVILVDNG SSDGTVPFLR EFCAQRENYR LIENGKNLGF AAGCNIGMRE AVGGHILLLN NDTVVTRGWL SGMIEALERD PKAGIVGPMT NEIAGPQKLA RVPYRDMEEL QAFAERFRGE HYGRRIEVDR VVGFCMLFTR EVLETVGELD ERFGSGNFED DDFCLRGALA GYRCLIAGDV FIHHYGSRSF TGNRVDYAAA MWKNRKAFDA KWDLAALDEV TAARVVTHNA TLRGAKFARR GKLNDAVELM LQEGIRFSPA SPAPYLALAG ILCEAGKWRE ALEVLEQVPA GSELAAALMR GRAFKESGEP TQTMEAAGEA EAIDPEAPGT LHLKGVLALW QGETERGEEL LRRAIALDPG FALPYGALAQ AAWERGEREQ GVRLAELCFV LAPLELSALA RFHEFATACD RLPREEELLR EALEIHRDHK GLCYGLIELL IRSGRYGEAM TEIEKGAARF GLDDGSIDAA LQIRKLAGPP LPCASGKGSV TLCMIVKDEA RHLPAVLDSV RGLADELVVV DTGSSDRSCD IARIFGARLF SFPWNGSFAD ARNFSLSQAL GEWILVLDAD EVIAADDAVA LKELAQRTAL PTAFSFTTRN YTHEVTRRNW SANAGEYPAE EEGRGWTPSD KVRLFPNDPG IRFEGAVHEL VEPSLLRLGL PIHACDVPVH HYGKLDAERC AQKQEAYYLL GLKKLEEDGG SVEALTELAR QATELNRGEE AQRLWHRLLQ VHPENAEAYF NLGYLQLCAG EYPKARESAL KGAQLAPGMK EAAFNLAKCE LFLGNTEKAR ESCREMLEKW PDYPPALSLS CVCLLLQGEK AQAQHLLQRL AAMRFDCADF LEEYAAGLKK GEHADLALPL IELARGISGG AAP
|
| |