Gene GSU2011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2011 
Symbol 
ID2688069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2203783 
End bp2204958 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content60% 
IMG OID637126702 
Productcysteine desulfurase 
Protein accessionNP_953060 
Protein GI39997109 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.079673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGAGA TCTACCTGGA CAACAACGCC ACCACCAAGG TGGACGAGGC TGTTTTCGAG 
GAGATGCGTC CCTATTTCTG TGACCTCTAC GGCAACCCCA GCTCCATGCA CTATTTCGGC
GGTCAGGTCC AGAGGAAGGT GGACGAGGCC CGGAATCGGG TCGCAGCGCT CCTGGGCGCC
CTGCCCGAAG AGATCATCTT CACCGCCTGT GGTACCGAGA GCGACAACGC GGCCATCCGT
TCGGCCCTTG AAGTTTTCCC GGAGCGCCGC CACATCATTA CCACCCGTGT GGAGCATCCG
GCAGTCCTTA CCCTCTGCCG CAACCTGTCA AAGCGGGGGT ACCGGGTCAC CGAACTGGGT
GTTGACGGGG AAGGCCGGCT TGACCTGAAT GAACTGCGGA GCGCGATTGA CGAGGATACC
GTCGTGGTAT CAGTTATGTG GGCCAACAAC GAAACAGGCG TGATCTTCCC GGTTGAGGAG
ATCAGCCGGA TCGTCAAGGA GAAAGGAAAG GGGGCGCTGT TCCATACCGA CGCGGTTCAG
GCCGTGGGGA AAATACCCAT CAACATGGCC ACGTCGTCCA TCGACATGCT TTCCATTTCG
GGACACAAGC TCCATGCTCC TAAGGGTACG GGGGTTCTTT ACCTGCGCAA AGGGGTACCG
TTCAGGCCGT TCATGGTCGG CGGGCACCAG GAGCACAGTC GCCGGGCCGG CACCGAAAAT
ACCGCAGGCA TCATCGCGCT CGGCAAGGCA TGCGAACTGG CAGGACACTG GATGGAAGAC
GAAAATACGC GGGTCAAGGC CCTGCGCGAC CGGCTCGAGG CAGCGCTGCT CGAACTGATC
CCCCGGGCCA GGATCAACGG GGGCGAAGCT GAGCGCCTCC CCAATACCCT CTCCATTGCC
TTCGAGTTTG TGGAAGGGGA GGCGATCCTT ATGCTGATGT CCGAGAAGGG GATTTGTGCA
TCGTCGGGGA GCGCCTGCAC CTCCGGCTCC CTTGAGCCGT CCCACGTGCT GCGGGCCATG
GGGGTTCCGT TTACCTGTGC CCACGGTTCC ATCCGATTCT CGTTGTCTCG TTACACCACG
GACGAAGAGA TCGACACCAT AATCCGCGAG TTGCCGCCCA TCATCCGTCG GTTGCGGGAA
ATGTCGCCCT TTGGTCGGGA GTTTCTCAAC GCCTGA
 
Protein sequence
MKEIYLDNNA TTKVDEAVFE EMRPYFCDLY GNPSSMHYFG GQVQRKVDEA RNRVAALLGA 
LPEEIIFTAC GTESDNAAIR SALEVFPERR HIITTRVEHP AVLTLCRNLS KRGYRVTELG
VDGEGRLDLN ELRSAIDEDT VVVSVMWANN ETGVIFPVEE ISRIVKEKGK GALFHTDAVQ
AVGKIPINMA TSSIDMLSIS GHKLHAPKGT GVLYLRKGVP FRPFMVGGHQ EHSRRAGTEN
TAGIIALGKA CELAGHWMED ENTRVKALRD RLEAALLELI PRARINGGEA ERLPNTLSIA
FEFVEGEAIL MLMSEKGICA SSGSACTSGS LEPSHVLRAM GVPFTCAHGS IRFSLSRYTT
DEEIDTIIRE LPPIIRRLRE MSPFGREFLN A