Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_21000 |
Symbol | |
ID | 7761025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 2095267 |
End bp | 2097081 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643804995 |
Product | Cysteine desulfurase, SufS-like protein |
Protein accession | YP_002799276 |
Protein GI | 226944203 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.382512 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGGATC TGGCGCGGCT GGCCAACGCC TTCTTCGCCA GCCTGCCGGG TAGCGCTCCG TCGGACGGCG TTCCGTCGGG CGGCGCGCCC TTCGGCAAGG GTCTGTCGGA CAGCGTTCTG CCGGGCGGCG TTTCGGCGCT GGCCGGCGTG TCGCCCGTCG AGCCGTTTCC GAGTGGGGCC GCCGCCTCGC TCGGGGCGGC CGAACGGCAC GCCGGGGAGA CGCTTCCGGC GGGCATCGCC GACGGCCTGG AGCTCGGCTC GCCCGAGGCC TACGCCGCCG CGTTGCCGAC GCTGTTCCCG GCTGCCGGCG GCCTCGCGCC GTTGGCCGGG GGCGCGCCGT CCACGCCGTA CTACTTCCTC GGCGAGGGCA GCGCCTACAG CGGCGAGCCG GAGCGCTTCG CGGACCTTCC CGTCGAGCCG GACAGCCGCG CGGTGCCGGG CGGGGACGCG CTGGGCGAGG TCCTCCGCTC GATCCTCGCC GAGCCGTCCG CGCCGGCGGC GCCGGCGGGT CCGGGGGCCG GACAGTTCTA TTTCCTCGAA CGGAGCGGTC CGGGCCTGGA GCAGGCGCCC GATCCCGTCG TCCAGCCGCA GGTCCGCAGC GGTTTCGACG TGCAGGCGGT GCGCCGGGAC TTCCCGATTC TCGCCGAACG GGTCAACGGC AAGCCGCTGG TGTGGTTCGA CAACGCCGCC ACCACGCAGA AGCCCAAGGC GGTGATCGAT CGCCTGGCGT ACTTCTACGA GCACGAGAAC TCCAACATCC ACCGCGCCGC CCACGAGCTG GCGGCGCGGG CGACGGACGC CTACGAGGGC GCGCGCAGCA AGGCGGCGCG CTTTCTCGGC GCGAAGTCGA CGGACGAGAT CGTCTTCGTG CGCGGGGCGA CCGAGGGCAT CAACCTGCTG GCCAACACCT TCGGCCGCCG GTTCATCGGC GAGGGGGACG AGATCATCGT CTCCCACCTG GAGCACCACG CCAACATCGT GCCCTGGCAG TTGTTGGCCA ACGCGGTGGG CGCCAGGCTC AAGGTGATCC CGGTGGACGA CTCCGGGCAG ATCATCCTGG AGGAGTACGC CAGGCTGCTC GGTCCGCGCA CCCGGCTGGT GAGCATCACC CAGGTGTCCA ACGCCCTCGG CACGGTCACC CCGGTGGCGG AGGTGATCGC CCTGGCGCAT GCCGCCGGTG CGCGGGTACT GGTGGACGGC GCGCAGTCGG TGTCGCACCT GAAGGTCGAC GTGCGGGCGC TGGACGCGGA CTTCTTCGTG TTCTCCGGGC ACAAGGTCTT CGGGCCGACC GGCATCGGTG TGGTCTACGG CAAGCAGGAA CTGCTCGACG AGCTGCCGCC CTGGCAGGGC GGCGGCAACA TGATCGCCGA TGTGACCTTC GAGAAGACCC AGTACCAGGG TGCGCCGGCG CGCTTCGAAG CCGGTACCGG CAACATCGCC GATGCGGTGG GGCTCGGTGC GGCGCTGGAC TATGTCGAGC GCATCGGCCT GGAAGCCATC GCCCGCTACG AGCACGAACT GCTGGAGTAC GCCACCCGGG GCCTCGCGAC GATTCCCGGC CTGCGCCTGA TCGGTACCGC GGCGAACAAG GCCAGCGTGC TGTCCTTCGT GCTGCAGGGC TACCGCACCG AGGAGATCGG CGCCGCCCTC AATCGCGAGG GTGTCGCCGT GCGTTCCGGC CACCACTGCG CGCAGCCGAT CCTGCGCCGT TTCGGGGTGG AAACCACGGT GCGGCCGTCG TTGGCGTTCT ACAACACCTT CGACGAGATC GACCTGCTGG TGAGCACCGT GCACCGTCTG GCGACCCGGC GCTGA
|
Protein sequence | MADLARLANA FFASLPGSAP SDGVPSGGAP FGKGLSDSVL PGGVSALAGV SPVEPFPSGA AASLGAAERH AGETLPAGIA DGLELGSPEA YAAALPTLFP AAGGLAPLAG GAPSTPYYFL GEGSAYSGEP ERFADLPVEP DSRAVPGGDA LGEVLRSILA EPSAPAAPAG PGAGQFYFLE RSGPGLEQAP DPVVQPQVRS GFDVQAVRRD FPILAERVNG KPLVWFDNAA TTQKPKAVID RLAYFYEHEN SNIHRAAHEL AARATDAYEG ARSKAARFLG AKSTDEIVFV RGATEGINLL ANTFGRRFIG EGDEIIVSHL EHHANIVPWQ LLANAVGARL KVIPVDDSGQ IILEEYARLL GPRTRLVSIT QVSNALGTVT PVAEVIALAH AAGARVLVDG AQSVSHLKVD VRALDADFFV FSGHKVFGPT GIGVVYGKQE LLDELPPWQG GGNMIADVTF EKTQYQGAPA RFEAGTGNIA DAVGLGAALD YVERIGLEAI ARYEHELLEY ATRGLATIPG LRLIGTAANK ASVLSFVLQG YRTEEIGAAL NREGVAVRSG HHCAQPILRR FGVETTVRPS LAFYNTFDEI DLLVSTVHRL ATRR
|
| |