Gene GSU2786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2786 
Symbol 
ID2686974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3064891 
End bp3066027 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content70% 
IMG OID637127476 
Productcysteine desulfurase 
Protein accessionNP_953830 
Protein GI39997879 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03235] cysteine desulfurase DndA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTCT ACCTGGACTG TAATGCCACG ACCCCCCTGG AGCCGGCCGT CATGGCCGTG 
GTGACCCGCT TCATGGAGCG GGATTACGGC AACGCGGCGA GCCCCATTCA TGATTTCGGA
GTGTTCGCGC GGCTGGCGGT TGAGCATGCC CGGGGCCAGG TGGCTGAGGT GGCGGCTGCC
CGGCGGGACG AGGTGATCTT CACCAGCGGG GCTACCGAGG CCGACAACCT GGCGCTCCTG
GGCCTGGCGG ACCACGGCCT GGCATGCGGG CGACGTCATG TGATCAGCAC GGCCGTCGAG
CACAAGGCGG TGCTCGAACC GCTGGAAGAG CTGGCGCGCC GCGGATTCCA GGTGGAGCTC
CTCCCCGTGG GGGCGTCGGG GCGGCTGGAC CCTGACCGGC TGCGTGCGGC GCTCCGGCCT
GACACCCTTC TCGTTTCCAC CATGCACGTC AACAACGAAA CCGGCGTGGT CCAGCCCCTG
GCCGAACTGG CTGAGATCCT GGCCGGCCAC GGCGCCTACT GGCACGTGGA CGCGGCCCAG
GGCTTCGGCA AGGAGATCGA CGGTCTGCGC AATCCGCGGA TCGACCTGAT CGCCGTGAGC
GGCCACAAGA TCTACGCCCC CAAGGGGGTG GGCGCCCTCA TCGCCCGCAA GCGGGACCGC
GCCTTTCCGC CGCTGCGGCC CCTGATGCTG GGCGGCGGCC AGGAGCAGGG GCTGCGGCCC
GGAACCCTGC CCGTCCCCCT CATTGCCGGT TTCGGCGAGG CAGCCAAGCT GGCGGTGCGC
ACCCACGAGG CGCGCTCCGC CGCCAACCGC GCCTTCCGGG AAAAACTCCT GGCCGCCCTG
GCCCCACTGG AGCCGACCCT CAACGGCGAC CAGGAGCACG TCCTTCCCCA TGCCGTGAAC
CTTTCCCTGG CCGGGATCGA GGCCGACCGG GCCATCACCG CCCTCAAGGG GGTCATTGCC
GTGTCGAGCA CCTCGGCCTG CACCTCCCAC ACCCGGGCGC CGAGCCACGT TCTCACCGCC
ATGGGGCTTT CCCCGGAGCG GGTCGAGACG TCGCTGCGGC TTTCCTGGTG CCACCTGACC
CCGGCCGTGG ACTGGGACGA GGTCGTCTCC ATCCTGCGCG GACTCCGCGC ATCATGA
 
Protein sequence
MTVYLDCNAT TPLEPAVMAV VTRFMERDYG NAASPIHDFG VFARLAVEHA RGQVAEVAAA 
RRDEVIFTSG ATEADNLALL GLADHGLACG RRHVISTAVE HKAVLEPLEE LARRGFQVEL
LPVGASGRLD PDRLRAALRP DTLLVSTMHV NNETGVVQPL AELAEILAGH GAYWHVDAAQ
GFGKEIDGLR NPRIDLIAVS GHKIYAPKGV GALIARKRDR AFPPLRPLML GGGQEQGLRP
GTLPVPLIAG FGEAAKLAVR THEARSAANR AFREKLLAAL APLEPTLNGD QEHVLPHAVN
LSLAGIEADR AITALKGVIA VSSTSACTSH TRAPSHVLTA MGLSPERVET SLRLSWCHLT
PAVDWDEVVS ILRGLRAS