Gene GM21_2467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2467 
Symbol 
ID8137808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2884377 
End bp2885615 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content66% 
IMG OID644870077 
Productglycosyl transferase group 1 
Protein accessionYP_003022268 
Protein GI253701079 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones158 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCCGA GACCGACACA AGGGGTGCCG CGGGTGATGG ACCTGCGGGG AACCTACAAG 
GGAGGGGGAG GGCCGGACAA GACGGTCCTG AACTCGGCGG CGCAGCACGA CCCGGCGCGG
GTCTACGTGC TGGTGACCTA TCTGCGCCAG CCTGACGACC ACGAGTTCCA GATCCCGGAG
ATGGCCAAAA AGCTCGGCAT CGACTACGTC GACCTCTGCG ACGGGAGCAC CCTCGACCTG
GCCTGCCTGC GCGGGCTCGC GGCGCTTTTG GACCGGCACC AGCTGGAGGT CGTGCACGCC
CACGACGACA AGACGCTCCT CTACGCCTAC ATCCTGAGGC TGATGCGCCC GGGTCTGCGC
ATCCTCTATA CCTGCCACTC CCACGCCGTG ATGCTGCGCG AAGATTTCCG CTCGCTTGCG
GCCTACCTGA AATTCCGGGC GCGCCAGAAG CTGCAGATCT GGCTCATGTG TCAGTACCTG
AAGCCGGTCA TCACCGTCTC CAACGACACC CGCGACCGGC TGGTGGCAAA CGGGGTGGAC
GAGGGCGGAG TCGCCGTGCT CCATAACGGC ATCGATACCT CCGTCTGGCA GCGCGCCGGG
AGCACCCCGG TGCTGCGCGA CGAGCTCAAG ATAGGCGAGG GGGGGCTATT GGTCGGGACC
GTCGCCCGCA TCACGCCGGA GAAGGATCTC GGCACCTTCT ACGAGGTGGC CAGGCGCGTG
GCCCTGGAAC TTCCCGAAGT GCGCTTCGCG ATCGTAGGGG ACGGCTACGG AGACGAGCTG
GAGCAGGCGC GGGGCGAAGT GGCGCGCCTG GGGTTAGAGA AGGTGGTGCA CTTCACCGGG
CACAGAAACG ACCTGCGCGA CGTCTACGTC TCCTTCGACG TCTTCCTGAT GACCTCCGTC
ACCGAAGGAC TCCCCAACAC GCTTTTAGAG GCGATGGCGC TAGGCGTTCC CTCCGTCTCC
ACCGACGTGG GCGGGATACC GGAGTTGCTG CAAGACGGCG AGGGGGGATA TCTCGCCCCT
GCCGGCGACG CGGAAAAACT GGCGCGGCGG GTGCTTGAGC TTTTGGGCTC GGCGGACCTG
CGGGAGCGCT TCTCGCGGCA GTGCCGCGAG CGGATCGAGC GGCATTTCTC CTTCGGGCGC
AGGGTCCGCC TCATGGAGGA TTACTACCAC TGGTTTGCCG GTTGCGGGAA TCGCCCGGAT
CAGGAAGCCG CCACCGAGGA ACTCCGCTAT GTCGGTTAA
 
Protein sequence
MEPRPTQGVP RVMDLRGTYK GGGGPDKTVL NSAAQHDPAR VYVLVTYLRQ PDDHEFQIPE 
MAKKLGIDYV DLCDGSTLDL ACLRGLAALL DRHQLEVVHA HDDKTLLYAY ILRLMRPGLR
ILYTCHSHAV MLREDFRSLA AYLKFRARQK LQIWLMCQYL KPVITVSNDT RDRLVANGVD
EGGVAVLHNG IDTSVWQRAG STPVLRDELK IGEGGLLVGT VARITPEKDL GTFYEVARRV
ALELPEVRFA IVGDGYGDEL EQARGEVARL GLEKVVHFTG HRNDLRDVYV SFDVFLMTSV
TEGLPNTLLE AMALGVPSVS TDVGGIPELL QDGEGGYLAP AGDAEKLARR VLELLGSADL
RERFSRQCRE RIERHFSFGR RVRLMEDYYH WFAGCGNRPD QEAATEELRY VG