Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_39100 |
Symbol | |
ID | 7762799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3961166 |
End bp | 3962515 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643806773 |
Product | cysteine desulfurase, sufS |
Protein accession | YP_002801025 |
Protein GI | 226945952 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0233253 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTGGAAA ACGCTGCGCG GTTTTCCACC CTACGGATCG CGGACGAATC CGTTTCTTAC AAAAACGGTG TTTACAACGA ATCGCATGAC GACAAACGTC GCCGCCCCGA AGGCCTCCCC ATCATTCACG ACGACGCACC GACCATGTCC CTTCCCTCCC CCTGGCGGGC CGACTTCCCG GCCTTCGCCG CCTTCGCCCG GGACGGCCAG ACCTACCTGG ACAGTGCCGC CACCGCGCAG AAGCCACAGG CGATGCTGGA CGCCCTGCTC GGCCACTACG CCGGCGGCGC GGCCAACGTG CACCGCGCCC AGCACTGCCC CGGCGAGCGC GCCACGCGGG CCTTCGAGGC GGCGCGGGCG AAGGTCGCCG CCTGGCTGAA CGCCGGAAGC GCCGAGCGGA TCGTCTTCAC CCGCGGCGCC ACCGAGTCCC TCAACCTGCT CGCCTACGGA CTCGAACACC TGTTCGCGCC GGGCGACACG ATCGCCGTCG GCGCCCTGGA GCACCACGCC AACCTGCTGC CCTGGCAACG CCTGGCGCAG CGCCGCGGTC TCGACCTGGT GGTGCTGCCG CTGGATGGCG CCGGCGACAT CGACCTGGAG CAGGCCGAGC GGCTGATCGG CCCGCGCACC CGGCTGCTCG CGGTCAGTCA GTTGTCCAAC GTGCTCGGCC GCTGGCAGCC GCTCGACGCA CTGCTCGCCC TGGCCAGTTC CCGGGGCGCG CTGACGGTGG TCGACGGCGC CCAGGGCGCG GTCCACGGTC GCCACGACCT GCGGTCGCTG GCCTGCGACT TCTACGTGTT CTCCGCGCAC AAGCTCTACG GCCCGGACGG CCTCGGCGTG CTCCACGGCC GTCCGCAGGC CCTGGAACGC CTGCAGCACT GGCAGTTCGG CGGCGAGATG GTGCAGCAGG CCGACTACCA CGAGGCGCGC TTTCGCCCGG CGCCGCTCGG CTTCGAGGCC GGCACCCCGG CGATCGGCGC GGCCATCGCC TTCGGCGCCA CCCTGGACTA CCTGCAGAGC CTGGACGGCA CGGCGGTCGC CGCCCACGAG GCGGCTCTGC ACCGGCGCCT GCTCGCCGGA CTGCGCGCTG TCGCCGGTCT GCGCCTGCTC GCCGAGCCGC ACAACGCGCT GGCCAGTTTC GTGATCGAAG GGGTGCACAA CGCCGATCTG GCCCATCTGC TCGCCGAACA GGGCATCGCC GTGCGCGCCG GCCAGCACTG CGCCATGCCG CTGCTGCGCC GCCTCGGCCT GCCCGGCGCG CTGCGCGTGT CGCTCGGCCT GTACAGCGAC GGGAACGATC TGGAGCGCTT CTTCGCCGCC CTCGAACGCG CCCTGGAACT GCTGCGATGA
|
Protein sequence | MVENAARFST LRIADESVSY KNGVYNESHD DKRRRPEGLP IIHDDAPTMS LPSPWRADFP AFAAFARDGQ TYLDSAATAQ KPQAMLDALL GHYAGGAANV HRAQHCPGER ATRAFEAARA KVAAWLNAGS AERIVFTRGA TESLNLLAYG LEHLFAPGDT IAVGALEHHA NLLPWQRLAQ RRGLDLVVLP LDGAGDIDLE QAERLIGPRT RLLAVSQLSN VLGRWQPLDA LLALASSRGA LTVVDGAQGA VHGRHDLRSL ACDFYVFSAH KLYGPDGLGV LHGRPQALER LQHWQFGGEM VQQADYHEAR FRPAPLGFEA GTPAIGAAIA FGATLDYLQS LDGTAVAAHE AALHRRLLAG LRAVAGLRLL AEPHNALASF VIEGVHNADL AHLLAEQGIA VRAGQHCAMP LLRRLGLPGA LRVSLGLYSD GNDLERFFAA LERALELLR
|
| |