Gene GM21_3834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3834 
Symbol 
ID8139208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4420273 
End bp4421331 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content64% 
IMG OID644871451 
Productglycosyl transferase group 1 
Protein accessionYP_003023609 
Protein GI253702420 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.00734924 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGTAT CGATGCCTGC CGGCGCCATG CACGGGTGGG GGATCGCCGG CAGCTACCTG 
GAGCGCGAGA TCTCCAAACT CCCCGGCATC GAGGGGGTGA CGCTGCACTG CATGACCAAC
ACCCTGGCGC CGCTGCGCCC GGAGAGCTGG GACTCCATCA ACATCGGCTA CTGCTTCTTC
GAGGACAGCA TCGAGATCCT CAACTTCACC CGTGACGCCG CGCGCCAATG GGATTTCATC
GTCGCGGGTT CCAAGTGGTG CGAGTACCAA CTGAGGATCG GCGGGGTGAA AAACACCTGC
ACCATCCTGC AGGGTATCGA CCCGACCAAC TTCCACCCGG TCCCCTACCC GGCGGACGAC
CGCTTCGTGG TCTTCTCGGG GGGCAAATTC GAACTCCGCA AGGGTCAGGA CCTGGTGATC
GCCGCCATGA AGGTGATGAT GCAGCGGCAC CGCGACGTCT TCCTCTCCTG CAGCTGGACC
AACCAGTGGC CTTTTTCGCT CGCCACCATG CAGTCGTCCC CCTACATAAC CTACCGACAC
GACGAGGAGA ACTTCCTCGA CCTCCCGGGG AGATGCGTGC TCGACAACGG GCTGGACCCC
GCCCGGGTGG CGGTGCATCC CCTGGTGAAC AACGCCCTCA TGCGCGAAAT ATTCGCCGGG
AGCCACTTAG GCCTCTTCCC CAACCGCTGC GAGGGGGGGA ACAACATGGT GATGTGCGAG
TACATGGCCT GCGGCAGGAG CGTCATCGCC TCGGATACCA GCGGCCACGC CGACGTGATC
AACTCCGCCA TCGCCTACCC CCTTACCCGC TACCGCCCCA TGGTGGTGGC GACCCAGGGG
GTGCAGACCG GGGTCTGGGA GGAGCCGCAG GTGGAAGAGA TCATAGAGCT CCTGGAACTC
GCCTACCTAA ACCGCGACCA GCTTCCCGCC AAGGGGGCGC TGGCGGCCCG GGAGATGGAG
AAGCTAAGCT GGGGCGCCGC GGCGCGGCAG TTCCACTACA TCGCCACCAG GCTCGCCAAT
CAGGCGGAGC TCGCCAGGAT GCAGCAGGAT GCCTGCTAG
 
Protein sequence
MKVSMPAGAM HGWGIAGSYL EREISKLPGI EGVTLHCMTN TLAPLRPESW DSINIGYCFF 
EDSIEILNFT RDAARQWDFI VAGSKWCEYQ LRIGGVKNTC TILQGIDPTN FHPVPYPADD
RFVVFSGGKF ELRKGQDLVI AAMKVMMQRH RDVFLSCSWT NQWPFSLATM QSSPYITYRH
DEENFLDLPG RCVLDNGLDP ARVAVHPLVN NALMREIFAG SHLGLFPNRC EGGNNMVMCE
YMACGRSVIA SDTSGHADVI NSAIAYPLTR YRPMVVATQG VQTGVWEEPQ VEEIIELLEL
AYLNRDQLPA KGALAAREME KLSWGAAARQ FHYIATRLAN QAELARMQQD AC