Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1479 |
Symbol | |
ID | 3682522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 1824112 |
End bp | 1825152 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637716818 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_321997 |
Protein GI | 75907701 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000000398571 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACTG TTTTAGCAAT CGAAACTAGC TGTGATGAAA CTGCCGTGGC AATTGTTAAC AATCGTCAAG TTTGCAGCAG TATCATAGCT TCACAAATTC CAGTCCACCA GCAGTACGGT GGAGTGGTAC CGGAGGTAGC CTCACGAGCG CACTTGGAAA CGATAAATGA CGCGATCGCT CAAGCGATGG ATCAAGCTCA ACTAGGTTGG GATAAAATCG ATGGGATCGC TGCCACTTGT GCGCCTGGAC TTGTAGGAGC GCTGTTAGTG GGGCTAACTG CTGCCAAAAC TTTAGCAATC TTACACAATA AACCATTTTT GGGAGTTCAC CACCTTGAAG GTCACATCTA CGCGACTTAT TTGAGTGAGC CAACTTTAGA TCCCCCTTTT CTTAGCTTAC TGGTTTCTGG GGGACATACA AGCTTAATTT ACGTTAAGGA ATGTGGCAGG TATGAAAGTC TGGGTGAAAC TCGTGATGAT GCTGCTGGGG AAGCTTTTGA CAAAGTAGCT AGGCTATTAA AGCTGGGTTA TCCTGGCGGC CCAGTCATTG ATAAACTAGC GCAAACAGGT AATTCTCAAG CCTTTGCGTT GCCGGAAGGA AAAGTGTCCT TAGCTGGTGG GGGATATCAT CCCTATGATG GCAGTTTTAG CGGCTTAAAG ACGGCTGTAC TGCGTTTAGT GCAGCAATTA GAGAGGGATG GAGATCCATT GCCTATAGAG GACATTTCCG CCAGCTTCCA GGCGACAGTA GCCAAGGCAT TAACCAAGAG AGCGATCGCC TGTGCTTTAG ACTATGGTCT AGATACCATT GCTGTAGGTG GTGGGGTAGC CGCTAACAGT GGTTTGAGAC AGCACCTACA AGCAGCAGCC ACCGCAAATA ATCTCCGCGT CCTCTTTCCC CCCCTAAAAT TCTGTACCGA TAACGCCGCC ATGATAGCCT GCGCTGCCGC CGATCACTTA TCACGGGGTC ATCTATCCCC CATCACCTTA GGCGTAGAGT CACGCCTCAG TCTTAGCCAA GTGATGAAGT TATATCAGTA A
|
Protein sequence | MTTVLAIETS CDETAVAIVN NRQVCSSIIA SQIPVHQQYG GVVPEVASRA HLETINDAIA QAMDQAQLGW DKIDGIAATC APGLVGALLV GLTAAKTLAI LHNKPFLGVH HLEGHIYATY LSEPTLDPPF LSLLVSGGHT SLIYVKECGR YESLGETRDD AAGEAFDKVA RLLKLGYPGG PVIDKLAQTG NSQAFALPEG KVSLAGGGYH PYDGSFSGLK TAVLRLVQQL ERDGDPLPIE DISASFQATV AKALTKRAIA CALDYGLDTI AVGGGVAANS GLRQHLQAAA TANNLRVLFP PLKFCTDNAA MIACAAADHL SRGHLSPITL GVESRLSLSQ VMKLYQ
|
| |