Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2462 |
Symbol | |
ID | 3682852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 3056252 |
End bp | 3058081 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637717805 |
Product | peptidase S49, protease IV |
Protein accession | YP_322972 |
Protein GI | 75908676 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00705] signal peptide peptidase SppA, 67K type [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.67824 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAACT TCTTTAAACA AACTCTTGCT AGCTTACTAG GCACATTATT AGGACTGTTT CTGTTCGCTG GTGCAGGTAC TACAGGACTT TTATTATTAT TATTTGCTGC TGCTAGTTCC AGCAATACCA GCCCAGTAGT TAAAGATAAA TCAGTAGTGG TTCTAGACTT GTCCATGAAC ATCACCGATA GAGAACCAAG TGCTGGCGAG GAACTACAAA ATAGATTTTC CGGTGTAACA CAAGAGCGGA TGACACTTCG TAATGTTATC GATAGTTTAG AAAAAGCACA ACGAGATAAA CGCATCGTTG CTATTTACCT AGACGGTAGT CGTGGAGGAA ATAATCTGGG TTTTGCATCG TTGAAAGAAA TCCGCAAAGC TCTAGAAGAA TTCCGTAAAT CCGGGAAAAA GGTAATCGCC TATGGTGTGT CTTGGAACGA GCGGGAATAT TACCTCAGTT CGGTAGCTGA TACTATTGCA CTTAACCCCT TGGGTGGACT AGAAATCAAC GGTTTGAGTA GCCAACCAAT GTTCGTCGCC GGAGCCTTGC AAAAGTATGG TATTGGCGTT CAGGTTGTAA GAGCCGGAAA ATTTAAGGGA GCAGTTGAAC CATTCGTGTT AAACAAACTA AGTCCCGAAA ACCGCGAACA AACTCAAAAA TTATTGGACG ATGTTTGGGG TGAATGGCGT ACCACCGTTG GCAATAGCCG TAAAATTAAC CCACAAAAAC TACAAGCGAT CGCCGATAAT CAATCATTAT TGGAACCAAC AGAAGCTAAA ACTAACGGCT TAGTCGATCA GGTAGCGTAC AACGATCAAG TAGTTGCTGA CCTGAAGAAA TTAACAGGTA GCGATAAAAA GGATAATACA TTTACTCAAA TTAGCCTGCG CCGATATGCT CAAGTACCCG GACAGTCTTT AGGTGTGGCA AGAAACTCTA AAAATAAGAT TGCCGTAGTT TATGCCGAAG GCGATATTGT CGATGGTAAA GGCGATGATG GGCAAATAGG AGGCGATCGC TTTGCCCGCA TCTTTAATAA AATTAGACAA GATGAAAATG TCAAAGCAGT CGTTTTACGC ATCAATAGTC CTGGTGGTAG TGCTACAGCA TCGGAGGTAA TGCAGCGAGA AATACGCCTG ACTCGTGAAA GTAAACCCGT TGTTGTATCT ATGGGTGATT ATGCCGCTTC TGGTGGTTAC TGGATAGCCA CCGACTCTAA CCGAATTTTT GCCGAACCTA ATACAATCAC AGGTTCCATC GGTGTGTTTG GCGTGCTATT TAACGGGCAA AAGCTAGCAA ACGATAATGG TATCACTTGG GATGCTGTCA AAACGGCACG TTATGCAGAT TCTCAAACCG TCGCTCGTCC CAAATCACCC CAAGAGATAG CAATTTACCA GCGCAGTGTT GACCGCATTT ACAATATGTT TGTCAACAAA GTTGCTCAAG GTCGCAAGTT ACCCACACAA AAAGTAGCCC AAATCGCCCA AGGTAGAGTT TGGTCTGGTG TAACTGCAAA ACAAATTGGT TTAGTTGATG AAATCGGTGG TCTAAACGCC GCCATAGAAT ACGCAGCTAA AGCAGCGAAA CTAGGTAAAG ATTGGCAATT AAGAGAATAT CCCAGAGAAA GCTCATTTGA AGAGCGCTTT TTTGGCGGAG TAGTCGAGGA AATTAGTACG ACTTTGGGAA TTGAGAAGTT AGACGTTAAA CCAAATGACC CATTAACAGT GCAAATCCGA AAAATCCAAC AGGAAATCTC TGTTTTACAG ACCATGAACG ACCCACAAGG TGTTTATGCG CGGTTACCAG TTAACTTAAA AATAGAGTAA
|
Protein sequence | MSNFFKQTLA SLLGTLLGLF LFAGAGTTGL LLLLFAAASS SNTSPVVKDK SVVVLDLSMN ITDREPSAGE ELQNRFSGVT QERMTLRNVI DSLEKAQRDK RIVAIYLDGS RGGNNLGFAS LKEIRKALEE FRKSGKKVIA YGVSWNEREY YLSSVADTIA LNPLGGLEIN GLSSQPMFVA GALQKYGIGV QVVRAGKFKG AVEPFVLNKL SPENREQTQK LLDDVWGEWR TTVGNSRKIN PQKLQAIADN QSLLEPTEAK TNGLVDQVAY NDQVVADLKK LTGSDKKDNT FTQISLRRYA QVPGQSLGVA RNSKNKIAVV YAEGDIVDGK GDDGQIGGDR FARIFNKIRQ DENVKAVVLR INSPGGSATA SEVMQREIRL TRESKPVVVS MGDYAASGGY WIATDSNRIF AEPNTITGSI GVFGVLFNGQ KLANDNGITW DAVKTARYAD SQTVARPKSP QEIAIYQRSV DRIYNMFVNK VAQGRKLPTQ KVAQIAQGRV WSGVTAKQIG LVDEIGGLNA AIEYAAKAAK LGKDWQLREY PRESSFEERF FGGVVEEIST TLGIEKLDVK PNDPLTVQIR KIQQEISVLQ TMNDPQGVYA RLPVNLKIE
|
| |