Gene Avin_40400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_40400 
SymboliscS 
ID7762926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4090290 
End bp4091504 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content64% 
IMG OID643806899 
Productcysteine desulfurase 
Protein accessionYP_002801151 
Protein GI226946078 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR02006] cysteine desulfurase IscS 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTGC CGATTTATCT GGATTATTCC GCCACCACTC CGGTCGACCC GCGGGTGGCG 
CAGAAGATGT GCGAGTGCCT GACCATGGAG GGCAATTTCG GCAATCCGGC CTCGCGTTCC
CACGTCTTCG GCTGGAAGGC CGAGGAGGCC GTGGAGAACG CCCGCCGTCA GGTGGCGGAA
CTGGTCAACG CCGATCCGCG GGAGATCGTC TGGACTTCCG GCGCCACCGA GTCCGATAAC
CTGGCGATCA AGGGCGTCGC GCACTTCTAC GCGAGCAAGG GCAAGCACAT CATCACCTCG
AAGATCGAAC ACAAGGCGGT GCTGGATACC ACCCGCCAGC TCGAGCGCGA AGGTTTCGAG
GTGACCTACC TCGAGCCCGG CGAGGATGGC CTGATCACTC CGGCGATGGT CGCGGCGGCG
CTGCGCGAGG ACACCATCCT GGTCTCGGTG ATGCACGTCA ACAACGAGAT CGGCACCGTC
AACGACATCG CAGCCATCGG CGAACTGACC CGTTCGCGCG GCGTGCTCTA TCACGTGGAT
GCCGCCCAGT CGACCGGCAA GGTGGCCATC GACCTAGAGC GCATGAAGGT CGACCTGATG
TCCTTCTCCG CCCACAAGAC TTACGGCCCC AAGGGGATCG GCGCGCTCTA CGTGCGGCGC
AAGCCGCGCG TACGCCTGGA GGCGCAGATG CACGGCGGCG GCCACGAGCG CGGCATGCGT
TCCGGTACCC TGGCGACCCA CCAGATCGTC GGCATGGGCG AGGCCTTTCG CATCGCCAGG
GAAGAGATGG CCGCGGAAAG CCGGCGTATC GCCGGGCTCA GCCATCGCTT CCACGAGCAG
GTCAGCACCC TCGAAGAGGT CTACCTGAAC GGCAGCGCCA CGGCACGGGT GCCGCACAAC
CTCAATCTCA GCTTCAACTA CGTGGAAGGC GAGTCGCTGA TCATGTCGCT CAGGGATCTG
GCGGTTTCCT CTGGGTCGGC CTGCACCTCG GCGTCCCTGG AGCCGTCCTA CGTGCTGCGC
GCTCTGGGTC GCAACGACGA ACTGGCGCAC AGCTCGATCC GCTTCACTTT CGGTCGTTTC
ACCACCGAGG AAGAGGTCGA TTACGCTGCG CGGAAGGTAT GCGAGGCGGT CGGCAAGCTG
CGCGAGCTGT CGCCGCTCTG GGACATGTAC AAGGATGGGG TCGATCTGTC CAAGATCGAG
TGGCAGGCCC ACTGA
 
Protein sequence
MKLPIYLDYS ATTPVDPRVA QKMCECLTME GNFGNPASRS HVFGWKAEEA VENARRQVAE 
LVNADPREIV WTSGATESDN LAIKGVAHFY ASKGKHIITS KIEHKAVLDT TRQLEREGFE
VTYLEPGEDG LITPAMVAAA LREDTILVSV MHVNNEIGTV NDIAAIGELT RSRGVLYHVD
AAQSTGKVAI DLERMKVDLM SFSAHKTYGP KGIGALYVRR KPRVRLEAQM HGGGHERGMR
SGTLATHQIV GMGEAFRIAR EEMAAESRRI AGLSHRFHEQ VSTLEEVYLN GSATARVPHN
LNLSFNYVEG ESLIMSLRDL AVSSGSACTS ASLEPSYVLR ALGRNDELAH SSIRFTFGRF
TTEEEVDYAA RKVCEAVGKL RELSPLWDMY KDGVDLSKIE WQAH