Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4077 |
Symbol | |
ID | 3681600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 5065776 |
End bp | 5067080 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637719428 |
Product | O-acetylhomoserine/O-acetylserine sulfhydrylase |
Protein accession | YP_324576 |
Protein GI | 75910280 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2873] O-acetylhomoserine sulfhydrylase |
TIGRFAM ID | [TIGR01326] OAH/OAS sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.978495 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.255888 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAA AATACCGTTT TGAAACTTTG CAAGTCCATG CTGGGCAAGA ACCAGCCTTA GGAACTAATG CCCGTGCTGT ACCGATTTAT CAAACAACTT CATACGTTTT TGATGATGCC GATCATGGAG CGCGATTGTT TGCTCTGCAA GAGTTCGGCA ACATTTACAC AAGGATAATG AATCCGACAA CAGACGTGTT TGAAAAGCGT ATTGCAGCTT TAGAAGGGGG TGTGGCAGCA TTAGCCACTT CTAGCGGTCA AGCGGCGCAA TTCTTAGCTA TCAGCACGAT CGCTCAAGCT GGAGATAATA TTGTTTCCAC CAGTTTTTTA TACGGGGGAA CCTATAACCA ATTTAAAGTT TCTATACCAC GTTTAGGAAT AAATGTCAAA TTTGTCGAAG GCGATGATGT AGAAACTTTT CGTCAGGCGA TCGACGATCG CACAAAAGCA TTGTACGTGG AAACTATTGG CAATCCCCAA TTCAATATTC CCGACTTTGC GGCATTAGCC CATATTGCCC ATGAACATGG TATTCCCTTG ATTGTTGATA ATACCTTTGG GGCTGGGGGC TATTTGGCTC GACCAATTGA ACACGGTGCA GATATTGTAG TTGAGTCTGC AACTAAATGG ATTGGTGGAC ATGGCACTTC TATCGGTGGC GTAATTGTTG ATTCTGGTAA ATTTAACTGG GGTAACGGCA AATTTCCTGT ATTTACTGAA CCATCACCTG GTTATCATGG GCTGAATTTT CAAGAGGTAT TTGGTGTAGG TAGTCCCTTT GGGAATATTG CCTTTATTAT TCGCGCCAGA GTAGAAGGAT TAAGAGATTT CGGCCCCTCC TTAAGTCCAT TTAACGCATT TTTACTATTG CAAGGATTAG AAACACTTTC TTTGCGTGTA GATCGCCATG TCTCCAACGC CTTAGAATTA GCCCAGTGGC TAGAACAACA ACCCCAAGTA GCATGGGTAA ATTATCCCGG ACTTCCCCAT CACCCCTATC ATGAAAGAGC CAAAAAATAT CTGAGACACG GATTTGGTGG GGTATTGAAT TTTGGCATCA AAGGGGGACT AGAAGCAGGT AAAACCTTTA TTAATCATGT TAAATTGGCA AGTCACTTAG CAAACGTAGG TGATGCTAAA ACCCTTGTCA TTCATCCTGC TTCTACCACT CATCAACAAC TCAGTGATAC AGAACAACTT TCCGCCGGTG TGACACCTGA TTTGGTGCGT GTATCTGTGG GAATTGAACA CATCGACGAT ATTAAAGAAG ATTTTGAGCA AGCGTTTCAG AAGATTAGTC AATAG
|
Protein sequence | MSEKYRFETL QVHAGQEPAL GTNARAVPIY QTTSYVFDDA DHGARLFALQ EFGNIYTRIM NPTTDVFEKR IAALEGGVAA LATSSGQAAQ FLAISTIAQA GDNIVSTSFL YGGTYNQFKV SIPRLGINVK FVEGDDVETF RQAIDDRTKA LYVETIGNPQ FNIPDFAALA HIAHEHGIPL IVDNTFGAGG YLARPIEHGA DIVVESATKW IGGHGTSIGG VIVDSGKFNW GNGKFPVFTE PSPGYHGLNF QEVFGVGSPF GNIAFIIRAR VEGLRDFGPS LSPFNAFLLL QGLETLSLRV DRHVSNALEL AQWLEQQPQV AWVNYPGLPH HPYHERAKKY LRHGFGGVLN FGIKGGLEAG KTFINHVKLA SHLANVGDAK TLVIHPASTT HQQLSDTEQL SAGVTPDLVR VSVGIEHIDD IKEDFEQAFQ KISQ
|
| |