Gene GM21_2470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2470 
Symbol 
ID8137811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2888020 
End bp2890425 
Gene Length2406 bp 
Protein Length801 aa 
Translation table11 
GC content61% 
IMG OID644870080 
Productglycosyl transferase family 2 
Protein accessionYP_003022271 
Protein GI253701082 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones152 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATGGA CCGAAATCAC CATGCTGGCG CTACTTTTGC TCCTGGCCTA CATCTACCTG 
GGGTACCCGC TACTTCTTTG CCTGCTGGCG CGCCTCTTCC CCACGCCCCA CCATGTCGAT
GACGGTTTTC TTCCCACGGT CACTCTGGTC ATCTCCGCCT ACAACGAAAA AGCGGTTATC
AGGGCGAAAC TGGAAAATTC GCTGGCCCTT GACTACCCGC CGGAAAGGCT CTCCATCTTG
GTCGTGTCCG ACTGCTCCAG CGACGGGACT GACGCGGAGG TGCTCTCCTT CTCGGACAAA
GGGGTGAAGC TCATAAGGGC AGAGGAGCGC CGGGGGAAGA CGTCGGCCCT CAACACAGCG
CTTGCCGGTA TTGATAGTGA GATGGTGGTC TTCTCCGACG CCAACGCCAT CTACGACGCA
ATGGCCATCC GCCGCTTGGT GCGGCATTTC GCCGATCCCA ACATTGGCTA CGTCGTCGGT
TGCGCCCGTT ATATTGAGGA GACCGTCAGC GCCGCCGGCG GGAGCGAAGG GACTTACTGG
GACCTCGAGG TGATGCTGAA ACGGTGGGAA TCGCGCCTTT CGTCAGTGGT TGGAGGGGAC
GGCGCCATCT ATGCGATTCG CCGCTCGCTA TACGAGCCCC TGAGGGAAGA GGATATCAAC
GACTTCGTCA ACCCCCTTCA GATCGTCGCC AAGGGGTTCC GCGGCGTCTT CGACAATGAG
GCCTGGTGCG TGGAGCGTCC TGCCGGCGAC TTCCATAAGG AATTTTCCCG CAAGGTGCGC
ATCGTCAACC GCAGCTTCAG CGGATTTTTG CGGGTGCCCA AGGCTGCGAA CCCCTTTGTC
GTAGGTGCCT TTGCGTGGCA GCTCCTTTCG CACAAGGTCC TGCGCTGGTT TTCCCCTTAT
TTTCTGGGCC TGTTTCTGGG CTTGCTGGTG CTCGACCGGC TGTTGCATCC AGCCTCCCTC
ACCGGAGAGG TTTTGTTGGC CCTGACAGCC ATGGGTGCTT TCCTCTCCAT GCTAGGTGCG
CTCCTGAACC GTTTCTGGCG GCTCCCCCTG CCGCTGCTTC TTCCCTACTA CTTCGTACTG
GTGAATACGG CCTCCGCCCT CGGGGTGTTT TACCGCCTGT TGGGCCGGAC CATAGTGACC
TGGAGCACGG TAAGGGAGGG GGAGGCTGCG CCCCCAGCGC CGCTTTTGCC GCGCTTGGCC
CTTTTCTGCG CTGCGGTGCT GCTCGTGGTC ACGGCAAGCG GGTTGGCGCT GGATGAGGAT
GCCCTTGGCG CTGCCGCCAT CCTGCTGATG CTCCTGCTGG TTCACACCTT CGTCGGCTAC
CAGTTCCTTC TGCTGCCGCT GTCGCGTCTC TCACGCAGGA AACCGGCGCC GGACGAGAGC
TACACCCCCA CGGTCACTCT GCTGGTTGTC GCCTATAACG AGGCGAAGGT GATCGAGAAA
AAGCTCCTGA ACTCGCTGGC GCTTGAGTAC CCCAGCGACC GGTTGCGCAT CCTGGTGGCC
TCCGACGGCT CGAAAGACGA CACCGACCAG ATCGTCTCCC GATACCTGGA CAGGGGGGTC
GAGTTTATCT CCTTCCCAGT AAACCGCGGC AAGATTTCCG CCCTGAACGA CGCCATGCGG
CAGATCGACT CGGAGATCGT GGTTCTCTCC GATGCCAACG TCTACTACAT GCCCACCGCC
GTCAGGAACC TGGTGCGCAA CTTCGCCGAC CCGAGTGTCG GAGCGGTTTC GGGCAAGGTG
GTGCTTTTGA ACGACACCCT GAGCTACAGC GCCGCCGAGA AATCGTACTA CTCGATCGAG
CACCTGATAC AGGAACTGGA GGGAAGTCTC GGTGCCCTCA TCGGCGCCGA CGGCGCCATG
TACGCCATCC GCAGGCAGCT TTTCACCCCC CCAAGCCCCG ACACCATCCT CGACGACCTG
GTGATCGCGA TGGGGATCGC CAGGCAGGGG CATCTGGTGC TGCACGAAAA GGAGGCGCTT
GGCTTCGAGG AGAACCTTTT GGAGATCGTG GGAGAGTTCC GCCGCAAGGT GAGGATCATT
GCCGGCGGCT ACCAATGCCT CTTGCGCGGG GGGGTGATTC CCAGGCTGTC CCAGCCGCTG
CTCATGTTCT GTTTCATCTC CCACAAGCTG CTGCGCTGGG TGAGCGGTTA CCTTCTCATG
GCTCTTGTGG GGGTGCTGGT GCAGATCCAA CTGCGCAATG CATCACCTAC CTATGCCCTG
GTGCTGGCAG TGCTGCTCGC AGCTCTTGTC CTGGCTCTGC TGGTGCAACT GTTCCCGAAG
ATGAAGTCGA TCAAGATCGC GTCCCTTTGC CATTACTTCT ACATGCTGAT GGCCGCCTCG
CTAATCGGGG GGTACCGCGG CGTTACCGGG AGACAGCAGG TCACCTGGCG CAGGGAGGCA
GCCTAG
 
Protein sequence
MVWTEITMLA LLLLLAYIYL GYPLLLCLLA RLFPTPHHVD DGFLPTVTLV ISAYNEKAVI 
RAKLENSLAL DYPPERLSIL VVSDCSSDGT DAEVLSFSDK GVKLIRAEER RGKTSALNTA
LAGIDSEMVV FSDANAIYDA MAIRRLVRHF ADPNIGYVVG CARYIEETVS AAGGSEGTYW
DLEVMLKRWE SRLSSVVGGD GAIYAIRRSL YEPLREEDIN DFVNPLQIVA KGFRGVFDNE
AWCVERPAGD FHKEFSRKVR IVNRSFSGFL RVPKAANPFV VGAFAWQLLS HKVLRWFSPY
FLGLFLGLLV LDRLLHPASL TGEVLLALTA MGAFLSMLGA LLNRFWRLPL PLLLPYYFVL
VNTASALGVF YRLLGRTIVT WSTVREGEAA PPAPLLPRLA LFCAAVLLVV TASGLALDED
ALGAAAILLM LLLVHTFVGY QFLLLPLSRL SRRKPAPDES YTPTVTLLVV AYNEAKVIEK
KLLNSLALEY PSDRLRILVA SDGSKDDTDQ IVSRYLDRGV EFISFPVNRG KISALNDAMR
QIDSEIVVLS DANVYYMPTA VRNLVRNFAD PSVGAVSGKV VLLNDTLSYS AAEKSYYSIE
HLIQELEGSL GALIGADGAM YAIRRQLFTP PSPDTILDDL VIAMGIARQG HLVLHEKEAL
GFEENLLEIV GEFRRKVRII AGGYQCLLRG GVIPRLSQPL LMFCFISHKL LRWVSGYLLM
ALVGVLVQIQ LRNASPTYAL VLAVLLAALV LALLVQLFPK MKSIKIASLC HYFYMLMAAS
LIGGYRGVTG RQQVTWRREA A