Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_5068 |
Symbol | |
ID | 3683213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 6364426 |
End bp | 6365676 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637720429 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_325560 |
Protein GI | 75911264 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.705563 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACAT ATAATGGCGC ACATAACAAG ATTTATACTT GGAGTCTGCC CCACAAGGGT GGCAGTAGTG CGGTTTTGAT GCTGCTGGGT GGGGTAACAG CTATGTTTTT AGGAAGCTGC TCCCTTCTAC CCACAAGGAC AATACAATCC CAAGCTAATC AATCTCAGCC CCAGTCGAAT GATAATAGTC CAGCAATTGT CCCCCCAGCT ATTTTTTCAT CCACTGGCGA CCCTAATTTT GTCGTCGGAG TAGTACAAAA AGTGGGAGGG GCTGTAGTGC GGATTGATTC TGCCAGAACA GTAACTTCCA GAGTTCCAGA TGGATTTAAT GATCCTTTTT TCCGCCGTTT TTTCGGAGAT GGAGTGCAAG CACAACCAAG ACAGCGCGTA GAAAGGGGTA GCGGTTCAGG TTTTATTATT AGTTCCTCTG GTCAAATTTT AACTAATGCT CATGTTGTCG ATGGTGCTGA TGAGGTAACA GTTACCCTCA AAGATGGTAG GACTTTTGAT GGTAAGGTAC TTGGTGAAGA CCCAGTAACG GATGTAGCTG TTATTAAAAT AAACGCTAAT AACTTGCCAA CTGTTGCTGT CGGTAATTCT GAAGTTTTAC AACCAGGTGA AGCGGTTATT GCGATCGGTA ATCCTCTAGG CTTGAATAAT AGTGTTACGT CAGGAATTAT CAGCGCCACA GGTCGTTCTA GTACTGATAT TGGCGCAAGT GATAAGCGCG TTGACTATCT GCAAACAGAT GCGGCGATTA ATCCTGGTAA CTCTGGCGGC CCCCTGCTCA ATGCTCGCGG TCAGGTAATT GGGATGAACA CAGCTATTAT CCAAGGCGCT CAAGGTTTGG GATTTGCTAT TCCTATTAAT ACTGTGCAGA AAGTTGCTCA GGAATTAATC ACTCAAGGTA AGGTAGATCA TCCCTATTTG GGTGTACAGA TGGCAACCCT CACGCCACAA GTTAAGGAAA GAATTAACGA AAGATTGGGC GATCGCATCA ATATTACAGC AGATAGAGGC GTTTTATTAG TTCGTATCGT CCCTGGTTCT CCCGCCGCCA ATGCCGGACT CAGACCAGGA GATATTATTC AAAGTATTAA TAACCAATCT GTCACAACCG TTGAAGAAGT CCAAAGAATT GTGGAAAATA GCCAAATAGG TAACCCTTTA CAAGTCCAAA TAGAACGCAA TGGTCGAACA ACACAGGTAG CCGTCAGTCC AGCACCTTTA CCTGTGCAAC GAGAAGGGTA G
|
Protein sequence | MKTYNGAHNK IYTWSLPHKG GSSAVLMLLG GVTAMFLGSC SLLPTRTIQS QANQSQPQSN DNSPAIVPPA IFSSTGDPNF VVGVVQKVGG AVVRIDSART VTSRVPDGFN DPFFRRFFGD GVQAQPRQRV ERGSGSGFII SSSGQILTNA HVVDGADEVT VTLKDGRTFD GKVLGEDPVT DVAVIKINAN NLPTVAVGNS EVLQPGEAVI AIGNPLGLNN SVTSGIISAT GRSSTDIGAS DKRVDYLQTD AAINPGNSGG PLLNARGQVI GMNTAIIQGA QGLGFAIPIN TVQKVAQELI TQGKVDHPYL GVQMATLTPQ VKERINERLG DRINITADRG VLLVRIVPGS PAANAGLRPG DIIQSINNQS VTTVEEVQRI VENSQIGNPL QVQIERNGRT TQVAVSPAPL PVQREG
|
| |