Gene GSU1976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1976 
Symbol 
ID2686175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2166708 
End bp2167829 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content64% 
IMG OID637126667 
Productglycosyl transferase, group 1 family protein 
Protein accessionNP_953025 
Protein GI39997074 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTTT TGCACGTCAT CGACAGCGGT GGCCTGTACG GAGCCGAAGT CATGCTCCTC 
AATCTCGCTG CCGAGCAGGC CGCCATGGGG CTTGAACCGG TCATCGCCAG CATCGGCGAT
CCCCTCTGCG GCGAAAAACC GCTGGAAAAG GAGGCAGTAC GGAGGGGGTT GCGGGTGGAG
AGATTTCGCA TGAGGCCGGG GGCGAATATT GCCGGCGCCT TCAGCGTGCT TCGTTTCGCG
TGGCGTGAGC AATGCGACGT GCTCCATTCC CACGGCTACA AGGGGAATAT CCTGTTCGGC
TTCATGCCGC GGGCGCTCCG CCGGCTGCCA ATGGTCACCA CTCTTCATGG CTGGACCTGG
ACTGGCGGGA TGGACCGGAT GGGCCTCTAC GAATGGCTCG ACCGACTGAG CCTGCGCTTT
GTGGATGCGG TGGTGATGGT GAACGACGCC ATGCGCCGGA AGATCGACCT TCCCGGCATT
CACGTGGTGC CTAACGGCAT CCCGCTCGCC GGAGAGGCCG AGCGGCCCGC GGTGCCCCTC
GACCCCCGGA TCGTAGAGTT CTGCCGGGGA GGCATCACCC TGGGCGCAAT AGGCCGTCTG
TCCCCGGAAA AGGGGTTCGA TATCCTGCTG GACGCGGTCA GGGAGGTGGC GGAGACGAAT
CCCGGAGTCC GGCTGGCACT CCTCGGGGAG GGAGTCGAGC GAGACGCCCT GGAGGCGAAG
ATCCGGGAAC TGGGGCTGAC GGAAAGGGTG CTGCTGCCGG GATATGTGCC GGACGCCAAT
CGCTACCTGC CCCTGTTCCG GGCGTTTGTG CTCTCGTCGC TGACCGAAGG GCTTCCCATG
GTCATACTTG AAGCAATGCT GGCCGGGGTC CCGATTGTCG CCACAAGGGT AGGGGGCGTG
CCCGAAGTGC TGGATGGCGG TGCAGCCGGT CTTCTGGCTG AACCGCGCCA TGCTGGCAGC
CTTGCAGGGT GCGTGTCGCG CCTGATCGGA GACGACCTAC TGGCCGCGCG TCTCGCGGAG
CGGGGAAGAC ACTTGGTCGA AACACGCTAC GCAGCCGGCG CGATGGCCAT CAAATACAGC
GAAATCTATG ACGGTGTTCA TCCCGCCATA CATCGAAAGT GA
 
Protein sequence
MKVLHVIDSG GLYGAEVMLL NLAAEQAAMG LEPVIASIGD PLCGEKPLEK EAVRRGLRVE 
RFRMRPGANI AGAFSVLRFA WREQCDVLHS HGYKGNILFG FMPRALRRLP MVTTLHGWTW
TGGMDRMGLY EWLDRLSLRF VDAVVMVNDA MRRKIDLPGI HVVPNGIPLA GEAERPAVPL
DPRIVEFCRG GITLGAIGRL SPEKGFDILL DAVREVAETN PGVRLALLGE GVERDALEAK
IRELGLTERV LLPGYVPDAN RYLPLFRAFV LSSLTEGLPM VILEAMLAGV PIVATRVGGV
PEVLDGGAAG LLAEPRHAGS LAGCVSRLIG DDLLAARLAE RGRHLVETRY AAGAMAIKYS
EIYDGVHPAI HRK