Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2411 |
Symbol | |
ID | 3683093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 2999314 |
End bp | 3000519 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637717756 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_322923 |
Protein GI | 75908627 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000241562 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.000000145022 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGAATTTAT CCCTAAAGCA ACTGGCCGTT TATCTGTCTC TACTAGTAGT TGGTGGTAGT GCAGGTTTGT TAGGCAGTCG CTATCTCCTC CCACAAAATC GCTCGTTCCA ACAACTCAAA AATGTCACAG TCGGTTTGCC TTCGGAATCT GTAGCGTCTA ATCCTGTCAT AGGTTCTGCG GCAAATAATG GGGGGGATAA TGTCAATTTT ATTGCTAGTG CTGTGCAGAA AGTTGGCCCG GCTGTAGTGC GAATTAATGC CACCCGTAAA GTTGCCAATC CTATCTCTGA TGTTTTAAAG AATCCTCTAT TACGTCGATT TTTCGGTGAA GATGAACAGC CAATTCCGCA AGAACGAATT GAGCGGGGTA CAGGTTCGGG GTTTATTTTG AGTGAAGATG GGCAACTACT AACTAATGCC CATGTCGTAG CTGATACAGA CACCGTACAA GTAACTCTTA AGGATGGTCG GACTTTTGAG GGGAAGGTAC TGGGAGTTGA CCAGATTACA GATGTAGCTG TTGTCAAAAT CCCTGGAAGA AACTTGCCGA CAGTGAACTT GGGGAATTCG CAAAACCTCA TTCCAGGACA ATGGGCGATC GCTATTGGCA ATCCTCTCGG TTTAGATAAT ACTGTCACTA TCGGCATTAT CAGCGCCACC GACCGCACCA GCGCCCAAGT TGGAGTTCCC GATAAGCGAG TCAGCTTTAT TCAAACCGAT GCAGCAATCA ACCCCGGTAA TTCTGGCGGG CCTTTATTAA ACGCTCAAGG GGAAGTAATT GGCGTTAACA CTGCTATTCG TGCAGATGCT CAAGGTCTTG GCTTTGCCAT TCCCATAGAA ACAGCTGCCC GTGTCGCTAA TGAGCTTTTT ACTAAGGGGA GTGTACAACA TCCGTTTTTA GGGATTGAAA TGACAGACTT GTCCCCTAGC AAAAAACAGC AAATTAATAT TGAAAACAAG TTAAATATTC GACAAGACAC TGGGGTGGTA ATTAAAGGTG TCTTGGATGA TTCTCCAGCC AAAGAAGCAG GCTTGCTCCC TGGTGATGTG ATTCAAAAAA TTAACGGTAA AACAGTGAAA ACATCAGCCC AGGTACAAAA ATCGGTGGAA TCCAGCACAG TTGGAGATAT TCTAACCGTC GAAGTTAACC GCAGTGGTGA AATTCTCACC TTAAAGGTTC AGTCGGGAGT TTATCCCAAC AGATAG
|
Protein sequence | MNLSLKQLAV YLSLLVVGGS AGLLGSRYLL PQNRSFQQLK NVTVGLPSES VASNPVIGSA ANNGGDNVNF IASAVQKVGP AVVRINATRK VANPISDVLK NPLLRRFFGE DEQPIPQERI ERGTGSGFIL SEDGQLLTNA HVVADTDTVQ VTLKDGRTFE GKVLGVDQIT DVAVVKIPGR NLPTVNLGNS QNLIPGQWAI AIGNPLGLDN TVTIGIISAT DRTSAQVGVP DKRVSFIQTD AAINPGNSGG PLLNAQGEVI GVNTAIRADA QGLGFAIPIE TAARVANELF TKGSVQHPFL GIEMTDLSPS KKQQINIENK LNIRQDTGVV IKGVLDDSPA KEAGLLPGDV IQKINGKTVK TSAQVQKSVE SSTVGDILTV EVNRSGEILT LKVQSGVYPN R
|
| |