Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2994 |
Symbol | |
ID | 3681229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 3713261 |
End bp | 3714211 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637718340 |
Product | prolyl aminopeptidase |
Protein accession | YP_323499 |
Protein GI | 75909203 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0267164 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.7335 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGAAC TTTACCCACT CATCGAACCT TATAAAGAAG GTAAATTAAA GGTTTCCCAA TTACACACCA TTCATTTTGA AGAATCAGGA AACCCCCAAG GTAAACCCAT AGTGTTATTG CATGGTGGCC CTGGTGGGGG CTGTCCTCCA GTTTATCGGC AATATTTTCA CCATGAAAAA TGGCGATTAG TCATGTTTGA TCAACGTGGC TGCGGTAAAA GTCAACCCCA TGCCGAATTG AGGGAAAATA CCACTTGGGA TTTAGTCAGT GATATTGAAA AACTCCGAGA ACATTTAGGA ATAGAAAAGT GGGTGGTTTT TGGTGGGAGT TGGGGCAGTA CTTTATCTTT AGCTTACAGT CAAACTCACC CTGAGCGTTG TTTAGGCTTG ATTTTACGCG GGATATTTTT GCTCAGACAA AAAGAGTTAC GCTGGTTTTA TCAAGAAGGT GCTAGTTATA TTTTTCCTGA TGCTTGGGAG GAATATCTGC AACCAATTCC TGTAGATGAA CGTGATGATT TACTCACGGC TTATTACCAA CGTTTAACTA GTCCAGATTC ACAAGTTAGA CAAGAAGCGG CTCGTGCTTG GTCAATTTGG GAAGCTAGCA CTAGCAGATT ATTTCCTGAT ACCCAACTAA AGCAAACTTT TGCTGAGGAT AAATTTGCAG AAGCTTTTGC CCGGATTGAA TGCCATTATT TTATAAATAA AGGCTTTTTA AATTCTGACC ATCAACTATT ATTAAATGTT GACTGCATTC GCCATATCCC TAGTGTAATT GTCCAGGGGC GTTATGATGT AGTTTGCCCA ATGACATCAG CTTGGGAATT ACATCGTGCT TGGCCGGAAG CTGAATTTAT TGTAGTTCCT GATGCTGGTC ATTCTATGAG TGAAGTGGGG ATTCGTAGTG CTTTGATTGA GGCGACGGAT AGGTTTGCTG ATGCAGGCTA G
|
Protein sequence | MRELYPLIEP YKEGKLKVSQ LHTIHFEESG NPQGKPIVLL HGGPGGGCPP VYRQYFHHEK WRLVMFDQRG CGKSQPHAEL RENTTWDLVS DIEKLREHLG IEKWVVFGGS WGSTLSLAYS QTHPERCLGL ILRGIFLLRQ KELRWFYQEG ASYIFPDAWE EYLQPIPVDE RDDLLTAYYQ RLTSPDSQVR QEAARAWSIW EASTSRLFPD TQLKQTFAED KFAEAFARIE CHYFINKGFL NSDHQLLLNV DCIRHIPSVI VQGRYDVVCP MTSAWELHRA WPEAEFIVVP DAGHSMSEVG IRSALIEATD RFADAG
|
| |