Gene Ava_3504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3504 
Symbol 
ID3679606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4347015 
End bp4349498 
Gene Length2484 bp 
Protein Length827 aa 
Translation table11 
GC content37% 
IMG OID637718856 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_324006 
Protein GI75909710 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.227841 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.192563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAGTC AGTTTGAACA CCTTAAATTA CCGAGAATAA TCAATATCGA ATTACCACGA 
CGATCTCATG GTGGAGGTGG TGGTGGAAAA CGCGCTGATT TTATTGAACA TGGTAAGCAT
TTATTAGATC AACTTTCTGG ACTGACGGAA CGTACCAAGC AAAAGAGCAA CCCTTTCCGC
CTTGATCCAA AATTAATTTT TAAGATTAAA GTTACTAAAA AACTTTCAGA CGATCTAGTT
AATCAGACAG GATTAGATAT CCTAGCATTT GAGCCTGATA AAGCCATAGT TGTCTTCTCG
TCTGATCTGG AGTTAAAAGA ATTTAGGAGA CGTTTAGAAA ATTACAGTCA TATTACAGAA
GGACACGAAT ATTCATATTT AGGAGCGATT GATGAGTTAG TCCCCCTCGA ACGTGAAGAT
CGCATTGGTC GTTTGTTAGA GTTAAAGCCT GTACAACTAG GTGAACTTGC AGCTTTAGAT
TTAGAATTAT GGCACACAGG CGATCGCCAA GAAATGAAAG TTTCTCTAGA ACATATTGCT
GAAACAATAG AGTACTTTTC CAGCGATACT GCTCCTATGA GAATGAGTGA TAGTTACGTT
GGTGAATATC TGTGTATAGC TAGAATTAAA GTCACTCACG AAGTTTTAGA GTTTTTACTG
GAACTCGAAA CTGTCAAAGA AATTGATCGT CCTCCCCAAC CTGCATTTGA GAGAACTGCC
GATTATAATT TACCAATTTC CCGTATACCC GAAGTTATTT CTCCACCTGA AGATAATTGC
GGCATTCTTG TTATTGACTC AGGTGTACAA AGAGGTCATC CCTTAATTGC TCGTGTACTC
GGTGAAGCTG ATGTATTTCC AGATCCAGCA CAGCAATTAA TAAGAGGTGG TGCAGATGAT
GTACATGGAC ATGGTACAAA TGTTGCTGGC ATTGCTATAT ATGGAGATGT CGAAAATTGT
ATTAAAAAAC TGTCATTTGA TCCAACAGTT TGGTTATTTT CTGCTCGTGT AACAGATGAA
AACTGCGAAT ATTATGAGGA TCTTCTCGTA GAAACTCAAC TAGATCAAGC CATTCGTGCT
TTTGTAGATC AGTACCCTAA CTGCAAAGTC ATAAATATTT CATTAGGTAA TGCTAAACAA
ATCTATAGAG ATGGAATGAA GCAGTTTCGA TTAGCAGCAA AAATAGATGA AATTGCTTAT
CAATACCAAA ACCAAAACAA AAATATTATT TTTGTGATTT CAGCAGGAAA TTCTTATCAT
GAAGAGTTAG GGTATGAACA ATTACGAACT GAATACCCAA ATTATTTACT GAATAAGAAG
GCCCGAATTA TTGATCCAGC AACTTCTGCG ATCGCACTAA CTGTAGGTTC TTTATCTTAT
GGACGTGGTA GTATGACAGA ACCTGGTGAT GTGCGTCGTC AGGCGATCGC AAAATTACGA
GGATACCCTT CTCCCTTTAC CAGAACTGGT TTTGGAGTAG ATGGTATGAT TAAGCCTGAT
GTTGTAGATT TTGGTGGAGA TTTAGCATTA GACCTTAGTT ATCGAGAAGC GTTAGGTTTG
CCTAAAGTTA GTCAATTAGA AGATAATGTA GCTGGGATTT CTGTTGTTAC TTTCTCCAAA
AATTTCCAAA GTTCTTTATT TAATATCTGT AGCGGTACAA GTTTTGCTGC ACCTCGTGTA
GCTAACATTG CTGCTCAACT CTTCACAAAA TATCCAAATG CCAGTTCTAA CCTCATTCGA
GCATTAATTG TCAATTCTGC GGTGCTTCCC AAAGAAATTC CAGATGAATT TAGTAAGGGT
ACAGAATCTA AAAAAATTAA AAAGCAGCTA CAAATCTATG GCTATGGACA GACTGATTTA
GAACGTGCAA TGTATTCTGC TGAAAACTAT GTTGTTCTAT CTGAAGATAA TATTTTTATT
CCAGTGGGCA AGTTCCATAT TTATGAAATC CCCCAACTAC CAGAGGAATT TTTTGATATA
GAAGGTACCC GCACATTATC AGTTACCTTG GCCTTTGATC CTCCAACTCG TCCAACTCGT
GGCGACTCAT ATTTAGGGGT AACTATGGAA TTTAATATTT TTAAAGGTAT TGACAAGGAA
AGTGTCGTAA ATGCCTATGT AGATGCAAGT AGAACAGATA AGCCTGGTGA ATTTGCAGAA
ATACCAATAA AAAATTTGAA GAAAAAATAT CCAAAACGTA GTATTACTAT TGATTTATCC
CCAGGTTCCA ATCTTCGTAA GAAAGGAACT GTACAAAGAG GTCAAACACA ACTAAAGTCA
GGAGCTAAGA AATACAATAA TTTACCGATG ACTTTGGTGG TGAGTTGTAA TCGTAAATGG
GCAAATCCAG ATGAAATTGA AATCCAACGT TATGCTTTAG TTGTTAGCGT CAGTCATTCC
GATCCGCAAG TTAATTTATA TAATCGTCTA AAACTAAAGG TTGATGAGAT TGATCTAAGA
GAAAGAAGCC GAGCAAGGAT TTAA
 
Protein sequence
MVSQFEHLKL PRIINIELPR RSHGGGGGGK RADFIEHGKH LLDQLSGLTE RTKQKSNPFR 
LDPKLIFKIK VTKKLSDDLV NQTGLDILAF EPDKAIVVFS SDLELKEFRR RLENYSHITE
GHEYSYLGAI DELVPLERED RIGRLLELKP VQLGELAALD LELWHTGDRQ EMKVSLEHIA
ETIEYFSSDT APMRMSDSYV GEYLCIARIK VTHEVLEFLL ELETVKEIDR PPQPAFERTA
DYNLPISRIP EVISPPEDNC GILVIDSGVQ RGHPLIARVL GEADVFPDPA QQLIRGGADD
VHGHGTNVAG IAIYGDVENC IKKLSFDPTV WLFSARVTDE NCEYYEDLLV ETQLDQAIRA
FVDQYPNCKV INISLGNAKQ IYRDGMKQFR LAAKIDEIAY QYQNQNKNII FVISAGNSYH
EELGYEQLRT EYPNYLLNKK ARIIDPATSA IALTVGSLSY GRGSMTEPGD VRRQAIAKLR
GYPSPFTRTG FGVDGMIKPD VVDFGGDLAL DLSYREALGL PKVSQLEDNV AGISVVTFSK
NFQSSLFNIC SGTSFAAPRV ANIAAQLFTK YPNASSNLIR ALIVNSAVLP KEIPDEFSKG
TESKKIKKQL QIYGYGQTDL ERAMYSAENY VVLSEDNIFI PVGKFHIYEI PQLPEEFFDI
EGTRTLSVTL AFDPPTRPTR GDSYLGVTME FNIFKGIDKE SVVNAYVDAS RTDKPGEFAE
IPIKNLKKKY PKRSITIDLS PGSNLRKKGT VQRGQTQLKS GAKKYNNLPM TLVVSCNRKW
ANPDEIEIQR YALVVSVSHS DPQVNLYNRL KLKVDEIDLR ERSRARI