Gene GM21_3504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3504 
Symbol 
ID8138876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4044819 
End bp4046021 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content65% 
IMG OID644871123 
Productglycosyl transferase group 1 
Protein accessionYP_003023283 
Protein GI253702094 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value3.0009700000000003e-28 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAAGATCG TCTTCCTGGC TCCCTTCGGC ATCCGCCCCA AGGGCACTGT CATCGCCCGG 
ATGCTACCGC TGGCTGTAGA ACTGCAGGGG TTGGGGCACG AGGTCGTCAT CGTGGCGCCT
CCCTACACGA ACCCTGAGGA TTCGGGAAAG ACCGAAACGG TGCGGGGGGT ACGGCTGGTG
AACGTCCTTC TCGGGCCCAA GCACAAGGCA CTCGCCGCGC CCTTCCTCGC CTGGCGCATG
CTGCGCGCGG CGTTGGCCGA GCGCCCTGAC CTGATTCATC TCTTCAAGCC CAAGGGGTAC
GGCGGCATCG CCGGCATGCT CCTCATCTCG CTGCAGCGCC TGGGAATCAG GATGCCGCCG
CTTTTCCTCG ACACCGACGA CTGGGAAGGC GAGGGGGGGA TGAACGAACT GCACGACTAC
TCCGGCGTCG AGAAGCGCTT TTACCGGTTC CAGGAACAGT GGATCACGCA GCACGCGGTG
GGGGTGACGG CGGCGAGCCG GGAACTGGAG CGGCTGGTAA CGGAGATGGG TGTTCCGGGG
GGGCGGATGC TTTATCTTCC CAACTGCGTC GGTGCGGCGC CCGCCGTCGA CGGAGCCGGG
GCCCGAGCCC GGCTCGGCAT CGCTCCGGAC GCGCCGGTCG TCCTTCTCTA CACCCGCTTC
TTCGAGTTCA GCCAGGAAAA GCTGCACTAC CTTTTCGCCG AATTGTTCAA GCAGATGCCG
CAGGTCCGCT TCCTGGTGGT GGGGAAGGGG CGTCACGGGG AGGAGGACCT GCTTGCCAAG
GCGGCAAGGG AGTCTGGCTT CGACGCAGCG CTGGCCATGG CCGGATGGGT GGCCCCGGAG
GCGATCCCCG ACCTGCTGGC GGCCGGAAAC GTCGCCATCT ACCCCTTCGC ACAGAACCTG
GTGAACCGCA CGAAGTGCCC GGCAAAGCTT ACCGAGATCC TCCTGGCGGG GACTCCGGCC
GTCGGCGACC GCGTCGGGCA GTTGACCGAG TACATCGACG ACGGGCGCTC CGGCATCCTC
TGCGACCCGG ACGATTGGCG GCAGATGGCG GATGAGACCC TGGCGTTGCT CCGTTCGCCG
GAGAGACAGC GGCAGATGGG GGAGCACGCA CGCCTTTATC TGCAGGAAAA CTTCAACTGG
AAGGATGCGG CGCTTCGGCT CGATGACTTC TATCGCAGGA ACGCCGGCAC CTCGAAAAGT
TGA
 
Protein sequence
MKIVFLAPFG IRPKGTVIAR MLPLAVELQG LGHEVVIVAP PYTNPEDSGK TETVRGVRLV 
NVLLGPKHKA LAAPFLAWRM LRAALAERPD LIHLFKPKGY GGIAGMLLIS LQRLGIRMPP
LFLDTDDWEG EGGMNELHDY SGVEKRFYRF QEQWITQHAV GVTAASRELE RLVTEMGVPG
GRMLYLPNCV GAAPAVDGAG ARARLGIAPD APVVLLYTRF FEFSQEKLHY LFAELFKQMP
QVRFLVVGKG RHGEEDLLAK AARESGFDAA LAMAGWVAPE AIPDLLAAGN VAIYPFAQNL
VNRTKCPAKL TEILLAGTPA VGDRVGQLTE YIDDGRSGIL CDPDDWRQMA DETLALLRSP
ERQRQMGEHA RLYLQENFNW KDAALRLDDF YRRNAGTSKS