Gene Ava_4060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4060 
Symbol 
ID3681681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5046694 
End bp5048577 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content39% 
IMG OID637719411 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_324559 
Protein GI75910263 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0156752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.194218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGTTA AATTTACTCC AATAAATAAC AATATTATTC ATGGAGTTAC TAGCAATGGT 
AACTCTTTTC CAGAACATAA TGATGCTTAT AGCTCTGAAC TCAGGATTGA CGATATATTA
AGCTCAAACT TTCATAGTAG TTTTTCAACT CAGCAAGATA ATACATTATT GTCGGCAACT
GTCTCAAATA CTAATACTCA TCTCACTGCT CAAACAATTG ATGTTACATC TAGTCAACCA
GACTTAATCA TCCAAAATGC TGTAGTTCCT AGTACAGCAT CAATTGGAAC TACTATCCAA
GTAAACTATG TAGTTAAGAA CCAAGGCTCG GAGAATACTT TCTCTAGCTA TACTATGTTT
TTCTTATCGA GAGATAGAAA TGTGAGTGAT GATGATTATT ACTTAGGTTC AGACTATGTT
GACGGTATTG CAGCAGGCGC TTACAGTTCA GAATCAAGCA CACTCAGAAT TGATAATGGT
ATTGTCGCTG GTAGCTATTA TTTATTGTGC CAAGCTGATG GCAATGGAGA TTTTATTGAA
AGTAATGAAA CCAACAACAT TTTAGCGACA GCCATCAATA TTAATCTGAT TCAGACAGAT
TTAGTTGTTC AGAATCCTGT AGCACCTAGT TCAGTAACTG TTGGGTATAG CTTTAGAATT
AGTTATCGAG TTAAAAACCA GAGTGTGGGT AATGCTTTTC CTAGCTCTAC GATGTTTTAT
ATATCTAAAG ATAAAACTAT CAGCAATGAT GATCTATATT TAGGTTCCCA GGATGTTGGT
AGTATTCCAG CAGGTGCTTA TAGTTCACAA ACAACTTCAC TGAGGATTGT GAATAATATC
ACCGCAGGTA AATATTATCT GCTGTACAAA GCTGATGGTA ACAACAATTT GATTGAAACC
AATGAAGGCA ATAACATTGT TGCTAAAGCT ATCAACATCA AGAATAGTTT TCGCTCTACC
AATGGTTATG GCTTAATTAA TGCCGCCGCC GCAGTAGCTC AAGCCTTAGG TCAGACAACC
TTTGGTGATG TTGTTAATTT AGGTGGTAAC AATTGGGGTG CAGACCTCAT TAACGCACCA
GAAGTCTGGG CAAAAGGATA TACAGGTCAA GGAATTACTG TCGCTGTTGT AGATGGTGGG
GTTGACCGCA ACCATACCGA TTTGAGCAGT AATATCTGGA AAAATCTTAA AGAAATTGCT
GGTAACGGTA AGGATGATGA TGGCAATGGC TATATTGATG ATGTTTACGG CTGGAACTTT
GTTGACAACA ACAACAATAC CTTAGACAAA AATGGACATG GGACTCATGT AGCTGGGACT
ATTGCCGGGG TAAGAAATAG CTTTGGTGTT ACAGGTATTG CCTATAATGC CAAGATTATG
CCAGTGAAGG TTTTGGCTGA TAATGGTTCA GGTGCTGATA ATGCTATCGC TCAAGGTATT
CGTTATGCGG CAAACAATGG AGCCAATGTG ATTAACTTGA GTTTAGGTAA AGAGCAACCT
AGTATTAATA TCCAATCGGC TATTCAATAT GCTAGCAGCA AAGGCGCGAT CGTGGTTATG
GCAGCAGGAA ACGGTGGTCA GCTAACACCA TACTACCCTG CTAGATATGC CACAGACTGG
GGGCTAGCGG TAGGCGCAGT TGATAAGTCT GGAATTATGG CTAGCTTCTC TAACCTCGCC
GGAAATGAGC TATTGAGCTA TGTCACAGCC CCTGGTGTTG GTATTTATTC GACACTTCCT
GGTAACAAAT ATGCTTCTTG GAATGGAACA TCTATGTCCA CTCCTTATGT TGCTGGGGTA
GTCGCTTTGA TGCTGAGTGC TAATAAAAAT TTAACTGATT CCCTCGTGCG TCAAATCCTT
ACATCTACCG CAGCCAATAG ATAA
 
Protein sequence
MQVKFTPINN NIIHGVTSNG NSFPEHNDAY SSELRIDDIL SSNFHSSFST QQDNTLLSAT 
VSNTNTHLTA QTIDVTSSQP DLIIQNAVVP STASIGTTIQ VNYVVKNQGS ENTFSSYTMF
FLSRDRNVSD DDYYLGSDYV DGIAAGAYSS ESSTLRIDNG IVAGSYYLLC QADGNGDFIE
SNETNNILAT AININLIQTD LVVQNPVAPS SVTVGYSFRI SYRVKNQSVG NAFPSSTMFY
ISKDKTISND DLYLGSQDVG SIPAGAYSSQ TTSLRIVNNI TAGKYYLLYK ADGNNNLIET
NEGNNIVAKA INIKNSFRST NGYGLINAAA AVAQALGQTT FGDVVNLGGN NWGADLINAP
EVWAKGYTGQ GITVAVVDGG VDRNHTDLSS NIWKNLKEIA GNGKDDDGNG YIDDVYGWNF
VDNNNNTLDK NGHGTHVAGT IAGVRNSFGV TGIAYNAKIM PVKVLADNGS GADNAIAQGI
RYAANNGANV INLSLGKEQP SINIQSAIQY ASSKGAIVVM AAGNGGQLTP YYPARYATDW
GLAVGAVDKS GIMASFSNLA GNELLSYVTA PGVGIYSTLP GNKYASWNGT SMSTPYVAGV
VALMLSANKN LTDSLVRQIL TSTAANR