Gene GSU1980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1980 
Symbol 
ID2688172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2170504 
End bp2171385 
Gene Length882 bp 
Protein Length293 aa 
Translation table11 
GC content61% 
IMG OID637126671 
Productpolysaccharide deacetylase domain-containing protein 
Protein accessionNP_953029 
Protein GI39997078 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID[TIGR03006] polysaccharide deactylase family protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGGATT ACTTCCAGGT AAGCGCCTTT GAAGGGTGCT CTCCTCCGGA ACAGTGGGAC 
TGTTTTCCCC TTCGGGTGGA GAAGAATACG TCCCGTATCC TCGACATGCT CGATGCCGGG
GGCGTTAAGG CCACGTTCTT CGTCCTCGGC TGGGTGGCGG AACGCGCGCC GGAACTGGTA
AAAGAGATAG CACGACGGGG CCACGAGGTT GCCAGCCACG GCTATGGTCA CCGCCGGGTT
TCTACCCAGA CCAGGCAGGA ATTCCGGGCC GACATCCGCA GAAGCAAGGC GCTGATCGAG
AATCTTACCG GCAGTCCCGT CCACGGCTAT CGCGCGCCCA GCTATTCGAT CTCACGAAAA
GTCCTCTGGG CCTTCGACGA ACTGCTCGAC GCCGGTTACT GTTACGACTC CAGCGTCTTT
CCCGTCCGCC ATGACCTCTA CGGCATCCCT GACTGGCCGA GCCACCCCTT TCGGGTCGTG
AAGGGGAACG GTGGGTGGGA GCCGTCCGCA ACTGCCCCTG ACAGCGAAGA CGGCATCACC
GGCCAGATGC CGTCCATCCT TGAAATGCCG ATAACCACCC TTACCCTCGG GGGCAGGAAC
ATTCCCATTG CCGGGGGCGG CTATTTCCGC TTTTTCCCCT ATGCCTTCAC CCGGTGGGGA
CTGCGGCGCA TCAACCGGCG TGAGAAACGG TCGTTCATCT TCTATCTCCA TCCCTGGGAG
ATGGACCCCG ACCAGCCCAG AATGGCCGGT GCTCCGGCCA AGAGCCGCTT TCGGCATTAC
CTGAACCTGC ACCGGACCGA AGAGCGGTTC CGCCGACTGT TGGGAGAATT TCGCTTCACC
CCCGTGATGG ACCTTCTGGC CGTAGGGATG GTTGAACCAT GA
 
Protein sequence
MEDYFQVSAF EGCSPPEQWD CFPLRVEKNT SRILDMLDAG GVKATFFVLG WVAERAPELV 
KEIARRGHEV ASHGYGHRRV STQTRQEFRA DIRRSKALIE NLTGSPVHGY RAPSYSISRK
VLWAFDELLD AGYCYDSSVF PVRHDLYGIP DWPSHPFRVV KGNGGWEPSA TAPDSEDGIT
GQMPSILEMP ITTLTLGGRN IPIAGGGYFR FFPYAFTRWG LRRINRREKR SFIFYLHPWE
MDPDQPRMAG APAKSRFRHY LNLHRTEERF RRLLGEFRFT PVMDLLAVGM VEP