Gene Ava_4077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4077 
Symbol 
ID3681600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5065776 
End bp5067080 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content43% 
IMG OID637719428 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_324576 
Protein GI75910280 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.978495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.255888 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA AATACCGTTT TGAAACTTTG CAAGTCCATG CTGGGCAAGA ACCAGCCTTA 
GGAACTAATG CCCGTGCTGT ACCGATTTAT CAAACAACTT CATACGTTTT TGATGATGCC
GATCATGGAG CGCGATTGTT TGCTCTGCAA GAGTTCGGCA ACATTTACAC AAGGATAATG
AATCCGACAA CAGACGTGTT TGAAAAGCGT ATTGCAGCTT TAGAAGGGGG TGTGGCAGCA
TTAGCCACTT CTAGCGGTCA AGCGGCGCAA TTCTTAGCTA TCAGCACGAT CGCTCAAGCT
GGAGATAATA TTGTTTCCAC CAGTTTTTTA TACGGGGGAA CCTATAACCA ATTTAAAGTT
TCTATACCAC GTTTAGGAAT AAATGTCAAA TTTGTCGAAG GCGATGATGT AGAAACTTTT
CGTCAGGCGA TCGACGATCG CACAAAAGCA TTGTACGTGG AAACTATTGG CAATCCCCAA
TTCAATATTC CCGACTTTGC GGCATTAGCC CATATTGCCC ATGAACATGG TATTCCCTTG
ATTGTTGATA ATACCTTTGG GGCTGGGGGC TATTTGGCTC GACCAATTGA ACACGGTGCA
GATATTGTAG TTGAGTCTGC AACTAAATGG ATTGGTGGAC ATGGCACTTC TATCGGTGGC
GTAATTGTTG ATTCTGGTAA ATTTAACTGG GGTAACGGCA AATTTCCTGT ATTTACTGAA
CCATCACCTG GTTATCATGG GCTGAATTTT CAAGAGGTAT TTGGTGTAGG TAGTCCCTTT
GGGAATATTG CCTTTATTAT TCGCGCCAGA GTAGAAGGAT TAAGAGATTT CGGCCCCTCC
TTAAGTCCAT TTAACGCATT TTTACTATTG CAAGGATTAG AAACACTTTC TTTGCGTGTA
GATCGCCATG TCTCCAACGC CTTAGAATTA GCCCAGTGGC TAGAACAACA ACCCCAAGTA
GCATGGGTAA ATTATCCCGG ACTTCCCCAT CACCCCTATC ATGAAAGAGC CAAAAAATAT
CTGAGACACG GATTTGGTGG GGTATTGAAT TTTGGCATCA AAGGGGGACT AGAAGCAGGT
AAAACCTTTA TTAATCATGT TAAATTGGCA AGTCACTTAG CAAACGTAGG TGATGCTAAA
ACCCTTGTCA TTCATCCTGC TTCTACCACT CATCAACAAC TCAGTGATAC AGAACAACTT
TCCGCCGGTG TGACACCTGA TTTGGTGCGT GTATCTGTGG GAATTGAACA CATCGACGAT
ATTAAAGAAG ATTTTGAGCA AGCGTTTCAG AAGATTAGTC AATAG
 
Protein sequence
MSEKYRFETL QVHAGQEPAL GTNARAVPIY QTTSYVFDDA DHGARLFALQ EFGNIYTRIM 
NPTTDVFEKR IAALEGGVAA LATSSGQAAQ FLAISTIAQA GDNIVSTSFL YGGTYNQFKV
SIPRLGINVK FVEGDDVETF RQAIDDRTKA LYVETIGNPQ FNIPDFAALA HIAHEHGIPL
IVDNTFGAGG YLARPIEHGA DIVVESATKW IGGHGTSIGG VIVDSGKFNW GNGKFPVFTE
PSPGYHGLNF QEVFGVGSPF GNIAFIIRAR VEGLRDFGPS LSPFNAFLLL QGLETLSLRV
DRHVSNALEL AQWLEQQPQV AWVNYPGLPH HPYHERAKKY LRHGFGGVLN FGIKGGLEAG
KTFINHVKLA SHLANVGDAK TLVIHPASTT HQQLSDTEQL SAGVTPDLVR VSVGIEHIDD
IKEDFEQAFQ KISQ