Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_28070 |
Symbol | |
ID | 7761712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 2894260 |
End bp | 2895588 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643805686 |
Product | cysteine desulfurase-like protein |
Protein accession | YP_002799954 |
Protein GI | 226944881 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.622503 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGTCCGC GCGATAGTCA CCTGTTTTTG CACTCAAATT GCGCTAGAGG AAATGTAGGA TATGCCTTGT CGATTCAATG TTCTTTTATG GGAGTCTTAG CAATGCCTAC CTTGGACTTG TCTTTTGTAC GTAGCCAATT CCCCGCCTTT GCAGAACCCT CTCTCGATGG GCAGGGCTTT TTCGATAATG CCGGCGGCTC CTACACCTGC CAACAAGTGA TCGATCATCT TAACCGATAC TATCGGCAAA TGAAGATCCA ACCCTACGGT GCTCATCCAG CTTCGCGTAG GGCTGGGGAA GAGATGGACC ATTCACACGT AGAGATTGCC AGTTACTTGG GGGTATCTGC AGAAGAAGTC CATCTTGGAC CCTCTACATC CCAGAATACT TATGTGCTCG CAGAGGCATT CGGTAACCGT CTCCAACCAG GGGATGAAAT CATTGTCACG AACCAAGACC ACGAAGCAAA CAACGGTGTA TGGCGTCGCC TCGCCAAGCG AGGCATTGTG ATCAAAGAGT GGCAGGTCAA TCCTTCGACG GGTTCGCTGA ATACCGAAGC ACTTCAAGCC CTGCTGTCGA CGCGTACAAA GGTAGTGGCA TTCACTCACT GCTCGAACAT CCTGGGTGAA ATCAACCCAG TTCGTAAAAT CTGCGAGATC GTCCACGCGG CTGGTGCACT CGCTGTGGTT GACGGAGTTT CCTATGCCGG TCATGGGCTG CCCGATGTCT CTCAACTAGG TGCCGATATC TACCTTTTCT CACTGTACAA AACATTTGGT CCCCATCTCG GCGCAATGGT AATCCGACGC CCCTTGATGG ACTTTCTAGG CAATCAAGCT CACTATTTCA ACGAAGCGTA TCTGGCCAAA TGGTTTGTTC CAGCCGGTCC TGACCATGCT CAAGTAGCTG CGGCTAGCGG AGTCACGGCT TACTTCGACA AGGTCCATGA GTATCACTTC CCTAGTGCAG ACAGTAAACA TCGCGCTGAA AATGTGAGGA CACTGTTTAG AGAAGCAGAA GCCTCTCTAC TTCCAACACT GCTTGAGTTC TGTGGCGGCG ATTCACGAGT TCGCCTCCTC GGCCCGATTG AAGCAATAAA GCGCTCGGCG ACGGTATCCA TTCAAACACT GAGCGCCACT CCACGAGAAA TCTCTGAGCG GTTGGCAAAG AAAGGAATTA TCGCGGGTGC AGGTCACTTC TATGCAGTTC GTCTACTCGA GGCGCTGGGG GTAAATTCTG CTGAGGGGGT GCTGCGCCTG TCTTTCGCTC ACTACTCCAG CCCAGAAGAG ATCCAACAGC TCATTAACGC TTTGAACGAA GCACTTTAA
|
Protein sequence | MSPRDSHLFL HSNCARGNVG YALSIQCSFM GVLAMPTLDL SFVRSQFPAF AEPSLDGQGF FDNAGGSYTC QQVIDHLNRY YRQMKIQPYG AHPASRRAGE EMDHSHVEIA SYLGVSAEEV HLGPSTSQNT YVLAEAFGNR LQPGDEIIVT NQDHEANNGV WRRLAKRGIV IKEWQVNPST GSLNTEALQA LLSTRTKVVA FTHCSNILGE INPVRKICEI VHAAGALAVV DGVSYAGHGL PDVSQLGADI YLFSLYKTFG PHLGAMVIRR PLMDFLGNQA HYFNEAYLAK WFVPAGPDHA QVAAASGVTA YFDKVHEYHF PSADSKHRAE NVRTLFREAE ASLLPTLLEF CGGDSRVRLL GPIEAIKRSA TVSIQTLSAT PREISERLAK KGIIAGAGHF YAVRLLEALG VNSAEGVLRL SFAHYSSPEE IQQLINALNE AL
|
| |