Gene GSU0624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0624 
Symbol 
ID2687376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp661333 
End bp662484 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content66% 
IMG OID637125291 
Productglycosyl transferase, group 1 family protein 
Protein accessionNP_951682 
Protein GI39995731 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGCCC CCACGCCGTT TTTCGCCGAC CGCGGGTGCC ACGTCAGGAT TTACGAAGAG 
GCCCGGACGC TCATTGCCCG CGGGCACCAG GTGCGGATAG TCACCTACCA TCTGGGGCGG
GACATGGCGG GCATCCCCAC GGACCGCACC CTGCGCGTCC CCTGGTACAC GAAGCTTTCC
GCCGGACCCT CCTGGCACAA GCCCTACCTG GATGTTCTCC TCTGCGGCAC CGCCCTGCGC
ACCGCCCGGC GCCTGCGCCC CGACCTGATT CATGCCCATC TGCATGAAGG GGCGTTCTTC
GGCGTGTTCC TGAAAAAACT GATGGGTATC CCCCTTCTGT TCGACTGTCA GGGGAGCCTC
ACCATGGAGC TGGCCGACCA CGGGTTTGTC CGCGAAGGGT CGCTGCTCTA CCGTTTTTTC
GCCCTAATGG AGGGGGGAAT CAACCGCAGC GCCGATGCCA TCGTGACCAG TTCGGGGCCG
GGCAGGGACG ATCTCGTCAC GAAATGGGGG GTGCCGGCTG TAAAGGTGAC GGCCCTCATG
GACGGGGTCG ATACCGCGGT GTTCCGTCCC CATGACCGCA CGGAGGTTCG CCGCCGGCTC
GGCATCGCGC CGGACGTGCC GCTGGCGGTC TATCTAGGAG TGCTGAACCG CTACCAGGGG
ATCGATCTTC TCCTGTCGGC CATGGTGATC CTCAAATCCC GGGGGAACCC GCTCCGGCTC
CTGGTCATGG GATTTCCGGA AGAGGGGTAC CGGCAGAAGG CGCGCGACCT GGGTATTGCC
GACATGGTGA CCTTCACCGG GCGGATCGAC TACGGCAAGG CGCCCCTCTA CCTCTCGGCG
GGGGATATGG CCGTGTCGCC CAAGGTGTCC CTCACCGAGG CCAACGGCAA GCTCTTCAAC
TACATGGCGT GCGGGCTCCC TACGATTGCG TTCGATACGC CGGTGAACCG GGAAATCCTG
GGTGAGACGG GCATCTACGC CCGTTATGGC GACGCGGCGG ATCTGGCCGC GCACCTGGCC
GGCCTGGCCG GCGATGCGGC GGCTCGGGCC GAGCGTGCTC GCCTGGCGCG GGAGCGGGCC
GAGCGCGAAC ATTCCTGGCA GGCGCGGGCC GATGTCCTCG AAGCGGTCTA CCGCCGGATG
AAACGCGCAT AA
 
Protein sequence
MLAPTPFFAD RGCHVRIYEE ARTLIARGHQ VRIVTYHLGR DMAGIPTDRT LRVPWYTKLS 
AGPSWHKPYL DVLLCGTALR TARRLRPDLI HAHLHEGAFF GVFLKKLMGI PLLFDCQGSL
TMELADHGFV REGSLLYRFF ALMEGGINRS ADAIVTSSGP GRDDLVTKWG VPAVKVTALM
DGVDTAVFRP HDRTEVRRRL GIAPDVPLAV YLGVLNRYQG IDLLLSAMVI LKSRGNPLRL
LVMGFPEEGY RQKARDLGIA DMVTFTGRID YGKAPLYLSA GDMAVSPKVS LTEANGKLFN
YMACGLPTIA FDTPVNREIL GETGIYARYG DAADLAAHLA GLAGDAAARA ERARLARERA
EREHSWQARA DVLEAVYRRM KRA