Gene GM21_3507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3507 
Symbol 
ID8138879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4048003 
End bp4049214 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content55% 
IMG OID644871126 
Productglycosyl transferase group 1 
Protein accessionYP_003023286 
Protein GI253702097 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value5.89342e-22 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCTAC TATTGATAGC GCTTCCGGAC AGCGTTCATA CCGCCCGGTG GATTTCCCAG 
ATCTCGGACC TGGGCTGGGA CATTCACTTG TTCCCCAGCA AGGACCTCGG TTTGATCCAC
CCGGATATGG CTGGTGTAAA GGCCTATGTT CCTTTGTACG GGAAAAGGGG ATGCAGTCGA
ACCGTCAGGA TTTCCGGCAT CTCGATTTTC AACGATTTCC TTTCCAAGGG TATCAGCTCA
GCTGTTTCCA AACTTGAAGG ATATCGGCAG TCTCAGGTTA ACCGGCTCCT TAGGGTAATA
CGGAAGGTGA GGCCGGATGT CATCCATTGC CTTGAACTGC AGCAAGCAGG GTACCTGGCC
CTGGAAGCGA AGAAGCTTCA TGCCGGGAAG TTCCCCCCCT TAATCGTGAC GAACTGGGGG
AGCGACATCT ACCTGTTTGG GCGTCTGGCG GAGCACGAGC CGAAAATCAG GGCATTGCTG
GCTGCGAGTG ACTATTACTC GTGTGAGTGC CGGCGTGACG TCTGCCTTGC CAAGGCTTAC
GGATTCAACG GCACTGTCCT TCCCGTCTTT CCCAATGCCG GAGGATTCGA TCTCGAACAG
GTGAAGCGGT TGCGGCAGCC AGGTCCGGTA TCTGCTCGCC GCCTTATCAT GCTCAAGGGG
TACCAGCACT GGGCAGGGCG CGCTTTGGTG GGACTTCGCG CGCTGGAGCG GTGCGCGGAA
GCACTTACCG GGTACGAGGT CGTTATCTAT GGAGCATCCT CGGAAGTGGC CCTTGCAGCA
GAACTCTTTT CTACATCCAC CGGCATTGCC ACCAAGATCA TCCCGCCTAA CTCACCTCAC
GAAGAGATCA TGCGGCATCA CGGAGCGGCG CGTTTTTCTA TAGGGCTTAG CATAAGCGAC
GGTATCAGCA CTTCCCTGTT GGAGGCCCTG GTCATGGGAT CCTTGCCGAT CCAGTCGTGG
ACGGCATGTG CTGATGAGTG GATCGAAGAC GGCGTGACGG GTTTGCTGGT TCCACCCGAA
GACCCGGATG TAATTGAGCA GGCAATTCGG AGGGCTCTTG CAGATGACGC GCTGGTCGAA
GGCGCTGCCA GTTGCAACTT TAAACTTGCA GAAGATAAGC TCGCACAGTC GAGCTTAAAG
CAAAAGACGG TGGAACTATA CCAGACGGTG TTGAACGATC TGGAGCGATC AGATAGTCCT
TTGTCTTCTT GA
 
Protein sequence
MKLLLIALPD SVHTARWISQ ISDLGWDIHL FPSKDLGLIH PDMAGVKAYV PLYGKRGCSR 
TVRISGISIF NDFLSKGISS AVSKLEGYRQ SQVNRLLRVI RKVRPDVIHC LELQQAGYLA
LEAKKLHAGK FPPLIVTNWG SDIYLFGRLA EHEPKIRALL AASDYYSCEC RRDVCLAKAY
GFNGTVLPVF PNAGGFDLEQ VKRLRQPGPV SARRLIMLKG YQHWAGRALV GLRALERCAE
ALTGYEVVIY GASSEVALAA ELFSTSTGIA TKIIPPNSPH EEIMRHHGAA RFSIGLSISD
GISTSLLEAL VMGSLPIQSW TACADEWIED GVTGLLVPPE DPDVIEQAIR RALADDALVE
GAASCNFKLA EDKLAQSSLK QKTVELYQTV LNDLERSDSP LSS