Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_15030 |
Symbol | |
ID | 7760438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1480337 |
End bp | 1481470 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643804400 |
Product | cysteine desulfurase protein |
Protein accession | YP_002798693 |
Protein GI | 226943620 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGCCA TCCACGACGA ATTTCCGCAG CTGGCCGGGC TGCGCTACCT CAACCACGCC GCCGTCGCCC CCTGGCCGAT GCGGGCCGGC AAGGCGGTGA GCGCCTTCGC CGAGCAGAAC GTGCACATCG GTGCGCGCGA TTACCCTCGC TGGCTGGGCA TCGAACGGCG CCTGCGCGAA CGCCTGACGC GCCTGCTCAA CGCACCGACC ACGGCGGACA TAGCCTTGGT CAAGAACACC TCGGAAGCCC TGTCCTTCGT CGCCTTCGGC CTGGACTGGC AGGCCGGCGA GCGGATCGTC ATCAGCGACG AGGAGTTCCC CTCCAACCGC ATCGTCTGGG AGGCGCTCAA GCCCCAGGGC GTCGAGATCG TCACGGTCGG CCTCAAGGGG GCCGATCCGG AAGCCGATCT GTTGGCGGCC TGCGGCCCGC GCACGCGCCT CCTGTCGATC AGCGCCGTGC AATACGCCAG CGGCCTGCGC CTGGATCTGG AGCGGCTCGG CCAGGGTTGC CGGCAGCTCG GCGTGCTGCT CTGCGTCGAC GCCATCCAGC AGCTCGGCGC CCTGCCCTTC GACGTGCGCG CCTGCGACTG CGCCTTCGCC ATGGCCGACG GACACAAGTG GATGCTCGGC CCGGAGGGGC TCGGCGTGTT CTACTGCCGC AGCGACCTGC GCGAACGGCT CAAGCTGCAC GAATACGGCT GGCACATGCT GGAGAACGCC GGCGACTACG ACCGCCTCGA CTGGGAACCG GCGCGCACCG CCCGGCGCTT CGAATGCGGC AGCCCGAACA TGCTCGGTGC CGTGGCACTG GAAAGCAGCC TGTCGCTGCT GGAGCAGGTC GGCATGGCCA CGGTCGCCCA GGGACTGCGC GAACGGGTGG CCTGGCTCGA AGAAGGCCTC CTGCGGATTC CCGGCCTGCA CCTGCGCAGC CCGCGCGATC CCGAGCGGCG CGCCGGCATC CTGACCTTCG CACTCGACGG CTGGGACAAC CAGCAGTTGT TCGAGCGCCT CAAGCACGAG CAGATCGTCT GCGCCCAGCG TGGCGGCGGA ATCCGTCTGT CGCCGCATTT CTACACGCCG ACCGTGGTGC TCGACGAAAC CCTCGCGGTC CTGCGCCAGC TGGCATCCGA TTAG
|
Protein sequence | MHAIHDEFPQ LAGLRYLNHA AVAPWPMRAG KAVSAFAEQN VHIGARDYPR WLGIERRLRE RLTRLLNAPT TADIALVKNT SEALSFVAFG LDWQAGERIV ISDEEFPSNR IVWEALKPQG VEIVTVGLKG ADPEADLLAA CGPRTRLLSI SAVQYASGLR LDLERLGQGC RQLGVLLCVD AIQQLGALPF DVRACDCAFA MADGHKWMLG PEGLGVFYCR SDLRERLKLH EYGWHMLENA GDYDRLDWEP ARTARRFECG SPNMLGAVAL ESSLSLLEQV GMATVAQGLR ERVAWLEEGL LRIPGLHLRS PRDPERRAGI LTFALDGWDN QQLFERLKHE QIVCAQRGGG IRLSPHFYTP TVVLDETLAV LRQLASD
|
| |