Gene Ava_0462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0462 
Symbol 
ID3682467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp587109 
End bp589178 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content40% 
IMG OID637715791 
Productprolyl oligopeptidase 
Protein accessionYP_320983 
Protein GI75906687 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.14887 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTACT CTGAAAAATC CCTGAACTAT CCACTCAGCC ACAAAATTGA TCAAGTTGAT 
GATTACCACG GTACCTTAGT AGCAGATCCT TATCGTTGGT TAGAAGATCC TGACTCTGAA
ACAACTAGGG CTTGGATCGA GGCACAAAAT CAAGTTACTT TTGCGTATTT GGGCGAAGTT
TCGACTAGAG AGAAAATTCA ACAACGTCTT AACAAACTTT GGGATTATGA AAAGTATGGA
ATTCCTTTTA AAGAAGGAGA AAATTATTTT TATTTCAAAA ATGATGGCTT GCAAAATCAA
AGTGTTCTGT ATACGTTGAA ATCTCTGGAT TCTGAACCAA GAGTTTTACT AGACCCCAAC
AAACTTTCGG ATGATGGGAC TGTTGCTCTT TCAGGTTTAG CTATCAGCGA TAATGGCAAG
CTTTTGGCGT ACGGTATAGC AACTTCTGGT TCTGATTGGC AAGAATGGAA GGTTATTGAT
GTAGAAACTG GTGCAGATTT TCCAGATCAT CTCAATTGGG TTAAATTTTC CGGTGCATCT
TGGACGAATG ATAATCAAGG TTTTTTCTAT AGTCGTTATG ACGAACCAAA TGAAAAAACT
AAATTGGAAG ATGTTAACTA TTACCAAAAG CTATATTATC ATCAGTTGGG TACACCGCAA
TCTGAAGACA TACTAATTTA CCAACGTCTT GACCAAAAAG AATGGGGTTT TAATGGCGTT
GTCACGGAAG ATGGTTGCTA TCTGATAATT TCAGTTTGGT TGGGTACTGA TTCCAGAAAT
CTAGTTTTTT ATAAAGACTT AACTAACCCC AATACTGAAG TCGTAGAACT AATTGATCAA
TTTGAAGCAG ATTATAGTTT CATTGATAAT GATGAGAGTG TTTTCTACTT CCGTACTGAC
TTGGATGCAC CACGGGGTAG AGTAATTGCT ATTGATATAG CTAATCCGGC TAAAGAAATA
TGGCGAGAAA TTATCCCCCA AGCTGAAGCA ACTTTAGAAA GTGTCAATAT CTTAAATAAT
CAGTTTATTG CAGGTTATTT AGAAGATGCC CGATCGCAAG TGAAAATTTT TGACCTCAAC
GGTACATTAG TTCGGAATGT AGAATTACCA GGATTGGGTG CTGTAGATGG CTTTGGTGGT
AAGCGTGGCG ATACAGAGAC TTTTTACAAA TTCACGAGTT TTACTACACC AGGAACTATT
TACCGATATA ACTTAGTAAC AGGCAAAAGC GAGGTTTTTA GAGAAACAAA TGTAGATTTT
AATCCTGATA ATTACGAAAC TAAACAAGTT TTTTATCAAA GTCAAGATGG TACACAAGTA
CCCATGTTTA TTACTCATAA GAAAGGCATT CAATTAGATG GGAATAACCC CACTTATCTT
TATGCCTACG GTGGTTTTAA TGTCTCACTC ACGCCCAATT TTTCTGTAAG TATGTTGGTA
TGGATGGAAA TGGGTGGTGT TTATGCCATG CCAAATATAC GCGGTGGCGG AGAGTATGGC
GAAGAATGGC ATCAAGCAGG GATGAAGGAT AAAAAGCAAA ACGTTTTTGA TGACTTTATT
GCTGCTGCTG AGTGGTTGAT GGCAAATAAT TATACAAAAC CGGAAAAACT AGCGATCGCT
GGTGGTAGTA ATGGTGGTTT ATTAGTGGGT GCTTGCATGA CTCAGCGTCC TGAGTTATTC
GGTGCAGCTT TACCAGCCGT TGGTGTGATG GATATGTTAC GGTTTCACAA ATTTACCATT
GGTTGGGCTT GGACTGCTGA ATATGGTTCC CCAGATAACC CACAAGAGTT TCCTGCAATC
TACGCTTATT CGCCACTGCA TAATCTCAAA TCAGGTACAG CATACCCCGC AACCTTGATT
ACCACCGCCG ATCACGACGA TCGCGTTGTC CCCGCCCACA GCTTCAAATT TGCCGCAGCT
TTGCAAACTG CTCACAATGG TAATGCACCT GTATTAATTA GAATTGAAAC TAAAGCTGGA
CATGGTGCAG GTAAACCTAC GGCAAAGATT ATCGAAGAAG CCGCAGATAA ATGGGCATTT
TTAGTGCGGG CTTTAGCTGT TGAGGTTTAG
 
Protein sequence
MPYSEKSLNY PLSHKIDQVD DYHGTLVADP YRWLEDPDSE TTRAWIEAQN QVTFAYLGEV 
STREKIQQRL NKLWDYEKYG IPFKEGENYF YFKNDGLQNQ SVLYTLKSLD SEPRVLLDPN
KLSDDGTVAL SGLAISDNGK LLAYGIATSG SDWQEWKVID VETGADFPDH LNWVKFSGAS
WTNDNQGFFY SRYDEPNEKT KLEDVNYYQK LYYHQLGTPQ SEDILIYQRL DQKEWGFNGV
VTEDGCYLII SVWLGTDSRN LVFYKDLTNP NTEVVELIDQ FEADYSFIDN DESVFYFRTD
LDAPRGRVIA IDIANPAKEI WREIIPQAEA TLESVNILNN QFIAGYLEDA RSQVKIFDLN
GTLVRNVELP GLGAVDGFGG KRGDTETFYK FTSFTTPGTI YRYNLVTGKS EVFRETNVDF
NPDNYETKQV FYQSQDGTQV PMFITHKKGI QLDGNNPTYL YAYGGFNVSL TPNFSVSMLV
WMEMGGVYAM PNIRGGGEYG EEWHQAGMKD KKQNVFDDFI AAAEWLMANN YTKPEKLAIA
GGSNGGLLVG ACMTQRPELF GAALPAVGVM DMLRFHKFTI GWAWTAEYGS PDNPQEFPAI
YAYSPLHNLK SGTAYPATLI TTADHDDRVV PAHSFKFAAA LQTAHNGNAP VLIRIETKAG
HGAGKPTAKI IEEAADKWAF LVRALAVEV