Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4785 |
Symbol | |
ID | 3679438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 6016505 |
End bp | 6017626 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637720141 |
Product | peptidase M50 |
Protein accession | YP_325277 |
Protein GI | 75910981 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.384856 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGGAA CAATTCGCGT TGGTAATCTC TTCGGTATTC CTTTTTATAT CCATCCGTCG TGGTTTTTAG TTCTGGGTTT AGTTACCTGG AGTTATGGCG GTGGACTCTC AGCAGAATTT CCCCAACTAT CTGGGGTGAT GGCTTTGGGA CTGGGACTGA TAACGGCGTT GTTATTGTTT GCTTCTGTCG TCGCTCATGA ATTAGGACAT AGCTTTGTCG CCATCCGTCA AGGAATTAAC GTTAATTCCA TCACACTATT TATCTTTGGT GGCTTGGCTA GCTTAGAAAA AGAGTCCAAA ACACCAGGTG GAGCCTTTTG GGTGGCGATC GCCGGGCCTC TAGTCAGTTT ATTATTGTGT GGTATCGTCA CGGCAATTGG TGTGACTACG GCAGTTACAG GGCCATTGGC AGCAATTCTG GGAGTTCTGG CTTCTGTAAA CTTAGCTTTG GCATTGTTTA ACCTGATTCC TGGCTTACCG TTGGATGGTG GAAACGTCCT TAAAGCCATT GTTTGGAAAG TAACAGGTAA TCCCTATAAA GGTGTCACTT TTGCTAGTCG TGTAGGACAA GTATTTGGTT GGGTGGCGAT CGCTTCTGGT ATTTTCCCCA TACTATATTT TGGTAGCTTC GCCAACGTGT GGAATCTGTT AATTGGCTTC TTCTTGCTAC AAAATGCTGG TAACGCAGCC CAATTTGCCA GAGTGCAAGA AAAACTCACA GGCTTAACAG CAGCCGACGC TGTAACGACC GATAGCCCTA TAGTTTCTGC CCATCTTAGC CTGAGAGAAT TTGCTGATGA TCAAATCGTT CAAGGACAGA ACTGGCGACG GTTTTTAGTT ACCAACAACG CAGGACAATT GGTAGGTGCG ATCGCTCTTG ATGACTTGCG AAACATCCCC ACTACATCCT GGACAGAAAC TCAAATTCAA CAGGTGATGC GGCCAATTCA ATCTACCACC ATCAAATCTA GTCAACCATT GTTAGAAGTA GTGCAATTAC TAGAACAACA AAAATTGTCT GCCCTCCCCG TAATTCTCGA CAATGGTGTA CTACTAGGCA TTTTAGAAAA AGCCGCTATC ATCCAGCTAT TGCAAAACGG AACCCAACCT AGCCCTGCAT AG
|
Protein sequence | MNGTIRVGNL FGIPFYIHPS WFLVLGLVTW SYGGGLSAEF PQLSGVMALG LGLITALLLF ASVVAHELGH SFVAIRQGIN VNSITLFIFG GLASLEKESK TPGGAFWVAI AGPLVSLLLC GIVTAIGVTT AVTGPLAAIL GVLASVNLAL ALFNLIPGLP LDGGNVLKAI VWKVTGNPYK GVTFASRVGQ VFGWVAIASG IFPILYFGSF ANVWNLLIGF FLLQNAGNAA QFARVQEKLT GLTAADAVTT DSPIVSAHLS LREFADDQIV QGQNWRRFLV TNNAGQLVGA IALDDLRNIP TTSWTETQIQ QVMRPIQSTT IKSSQPLLEV VQLLEQQKLS ALPVILDNGV LLGILEKAAI IQLLQNGTQP SPA
|
| |