Gene GM21_1650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1650 
Symbol 
ID8136981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1919849 
End bp1921888 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content61% 
IMG OID644869263 
Productglycosyl transferase family 2 
Protein accessionYP_003021463 
Protein GI253700274 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones189 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTGC CATCCATCAG TGTCTGCATC CTTTTCTACG AGAAACCGGA ACAGACCATC 
CACTGCGTCG AAAGCCTGCT CCCGGCCGGA GCCCCGATCG TTGTCCTCAA CAACGGCTCC
TCGGCCGCGG CGACGGAAAG GTTCAGCGCA TGGGCCGCCG GCCACGCCCA AATCAGGGTG
CTGCACTCCG ACCGCAACCT GGGTGTGAGC GCAGGGCGCA ACCTTCTTGC CGCGCAGACT
AGTGAAGAGT GGCTGCTCTT TGTCGACAAC GACATCGTGA TGGAGACCCC GCAGTGGTTG
CACCGGCTGC AGCGCTACGC TGACCTGGAT GTCGAGGTCA TCGTGCCGCG GCTTTACAAC
CACCACGACG GTTGCTACGC GAAATTCATG GACATCGAGG TGCGGGGGGG GCGGGCTGGC
TTCGTGCCTC TCCGGGACAA CCGCCTGAAC AATTTCCCCG GTGGCGCTTC TTTTGTCAGG
AGGACTCTCT TCGATCGCCT GGGTGGGTAC GACGAGGAGC TTTTCGTCGG CGTAGAGGAT
TTCGAGCTGA GCCTGCGGGG GGTGCTCGCC GGGGCTCCGG TACGCGCGCT GCTCGTGCCG
GACATCGAAC TCACCCATAA CCACCTTGCC TCTCCGGGGA GCGACGCCGA CAGGATGGCG
GTGCGCAACC GTTACGACAA GGAGGTGCTC GGGAACTCCT ATCGGCGCAT CGCGGTGAAG
CACGGCCTGC TCTTCGAGGA TTTCTGGGAG GAATGGGTGG ACCAGCAGAT GGTTTCTTTG
GGAGTGTCGA CGTGGCCGGA CGAACAGGCA GAGGCGTTGC GGCGGACTGG CAGACCGAGG
ATCGCCCTAG TCGCCGATGT GGAGAACTGG GCCTTTGCCA ACATCTCGCG CAACGTGGTC
AGCCATCTCT CGGAAAGGTA CGACCTGCAG ATCGTCTACA GCTCCCATTA CGGGCATGAC
TACGACCGTC TCCTCTGCGA CCTGTACGAG CAGGGTTACG ACCTGGTGCA CTTCTTCTGG
CGCGGCACCA TCCTCTGCCT ATACCTGCAC CTGCTGCGCA CAATGCCGAC CCGCACGCCA
TCGGTCCCCG AGTCGTTCAT CAGGACAAAG CTCACCTTTT CCATCTATGA TCATTGTCTG
CTTGGGGAGA AGGACCTTCT GGAGTACCGG ATACTGTTCA AGTACCTGGC CGACGGCTAC
GTGGTCAGCT CGCAGCGATT GTTCCACACC TACACCATGC TGCCGCATTA CCCGTCCCCC
TGCCGTATCA TCGAGGACGG CGTCGACCTC GCGCGGTTCA CCCCGTGCAA CACGCAGCGA
CTGCTGGACC GGGACCGGCC TCTGGTCGTC GGGTGGGCGG GTAACAGCGA CTGGGGGATG
GACCTGGACG CCAACGACTT CAAGGGGTTC AATACCATAA TCAAACCCGC TCTAGCCCAG
CTCAGGCAGG AAGGGTACGA GGTAGTGGGG CGGTTCGCCG ACCGCAAGGT CAAGCAGGTC
CCCTATGACA AGATGGTGGA GTATTACAAC TCGATCGATG TCTATGTCTG CGCCTCCTCG
ATCGAGGGGA CGCCCAACCC CGTGCTGGAG GCGATGGCCT GCGGCGTCCC GGTGATCTCG
ACCGATGTGG GAATCGTGCC GCAACTTCTC GGCCGGGAGC AGAAGAGGTA CATCCTCATG
CACCGCTCCG TGGAGGCGTT GAAAGAAAAG CTGGCCGCTT TGGCGAAGGA CCCGGAGGAA
AGGATGAAAC TCTCCGTGGA AAACCTGGAG CGCATCAAGG GGTTCACGCG GGAGATGGAA
GTGCCGAAAT GGGACGATTT CTTCCAGGCC ATGTTGAGCC GGGACACCGC CTCCAGGGAG
CCGCTTAAGC GGGCGCTCCT CGAAGTTCCT TACAACTTCG GCATCGAGCA TACGGTCGAT
CTCTTCCTTC AAAGATCCCT GAGCTGGCGC ATTACCATGC CGCTGCGCCT TTTACAGTGG
CACTCAACTC GGCTGCAAAA CAAGCTGAGG GTTAAGCTGC GCAACCGCCG GGGAAGGTGA
 
Protein sequence
MSLPSISVCI LFYEKPEQTI HCVESLLPAG APIVVLNNGS SAAATERFSA WAAGHAQIRV 
LHSDRNLGVS AGRNLLAAQT SEEWLLFVDN DIVMETPQWL HRLQRYADLD VEVIVPRLYN
HHDGCYAKFM DIEVRGGRAG FVPLRDNRLN NFPGGASFVR RTLFDRLGGY DEELFVGVED
FELSLRGVLA GAPVRALLVP DIELTHNHLA SPGSDADRMA VRNRYDKEVL GNSYRRIAVK
HGLLFEDFWE EWVDQQMVSL GVSTWPDEQA EALRRTGRPR IALVADVENW AFANISRNVV
SHLSERYDLQ IVYSSHYGHD YDRLLCDLYE QGYDLVHFFW RGTILCLYLH LLRTMPTRTP
SVPESFIRTK LTFSIYDHCL LGEKDLLEYR ILFKYLADGY VVSSQRLFHT YTMLPHYPSP
CRIIEDGVDL ARFTPCNTQR LLDRDRPLVV GWAGNSDWGM DLDANDFKGF NTIIKPALAQ
LRQEGYEVVG RFADRKVKQV PYDKMVEYYN SIDVYVCASS IEGTPNPVLE AMACGVPVIS
TDVGIVPQLL GREQKRYILM HRSVEALKEK LAALAKDPEE RMKLSVENLE RIKGFTREME
VPKWDDFFQA MLSRDTASRE PLKRALLEVP YNFGIEHTVD LFLQRSLSWR ITMPLRLLQW
HSTRLQNKLR VKLRNRRGR