Gene Ava_4484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4484 
Symbol 
ID3680340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5615080 
End bp5617188 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content46% 
IMG OID637719839 
Productoligopeptidase A 
Protein accessionYP_324977 
Protein GI75910681 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCGA ATACTACCAT TACTCAAAAT CCTTTACTTA AAGGTTCTGG CTTGCCTCCA 
TTTACGGAAA TTCAGCCAGA ACAAGTACAA CCAGCCTTTG AACAATTATT GGCGGAACTG
GAACAAGAAC TGACTACCTT GGAAGCGAAT GTACAGCCTA CTTGGGATGG CTTGGTCGAA
CCTTTGGAAA AGTTAAGCGA TCGCCTTACC TGGAGTTGGA GTATAGTAAA TCATCTAATG
GGTGTGAAAA ATAGCCCAGA ACTGCGTACA GCCCATGAAG CTGTGCAGCC ACAAGTAGTG
CAGTTTGCTA ACAAGTTGGG ACAAAGCCAA CCCATCTACA ATGCTTTTAA ACAACTCCGC
GCCAGTGATA GTTGGCAAAC TTTAGAATCA GCTCAACAGC GAATTGTGGA AGCAGCCATT
CGGGATGCAG AACTGTCCGG TGTGGGCTTG CAAGGAGAAG CACGGGAACG TTTCAATGCT
ATTCAAATGG AATTGGCAGA ATTGGCTACC AAGTTCTCTA ACCATGTCCT TGACGCAACC
ACAGCATTTA GCCTCACCCT CACTACTCAA GCAGAAGTTG ACGGCTTACC CCAAAGTTTA
CTTAGTTTAG CAGCACAAGC CGCACGGGCA ACTGGTGCAG AAAATGCCAC ACCAGAAAAT
GGCCCTTGGC GTATTACTTT AGATATCCCC AGTTACACCC CCTTCATCCA GCACAGCACC
AGGCGTGATT TGCGCGAACA GCTTTACAAA ACTTATATTA CCCGCGCCTC ATCTGGTGAA
TTAGATAACA ACCCAATTAT CGAACGGACT TTAGAGTTAC GCCAAGAACT GGCAAATTTA
TTGGGCTTCC AAAATTATGC CGAGTTGAGC CTAGCCAGTA AAATGGCTCC CAATGTGGAA
GCAGTTGAAG CACTATTAGA AGAATTACGC AGTGCTAGTT ACGATGCATC TGTTAAAGAT
TTAGAAACCT TGAAAGCCTT TGCAGCCTCT AAAGGAGCGC CAGAAGCCGC AGATTTACGA
CACTGGGATA TTAGCTTTTG GGCTGAACGC CAACGAGAAG AAAAATTTGC CTTCACCGCC
GAAGAACTAC GTCCTTACTT CCCCCTACCC CAAGTCCTCG ATGGCTTATT TGGACTGGTG
CATCGGCTGT TTGGCATTAC TGTTACCCCA GCAGATGGAC AAGCCCCAGT TTGGCACGAA
GATATCCGTT ATTTCCAAAT AGCTGATGAA ACTGGCAATC CCATCGCCTA TTTCTACTTA
GACCCTTACA GCCGTCCAGG TGAAAAGCGT GGCGGTGCAT GGATGGATAC CTGTATCAAT
CGCGCCATAA TTACAGAAAA TGGTGTAAAA ACTGTACGTT TACCTGTGGC TTATTTGGTG
TGTAACCAAA CTCCGCCAGT GGATGGTAAA CCTAGTTTGA TGACATTCTA TGAAGTAGAA
ACGCTGTTCC ATGAATTTGG TCATGGGCTA CATCATATGC TCACCAAGGT TAACTATGCT
GGAGCCGCAG GCATCAATAA TGTAGAGTGG GATGCAGTGG AACTACCAAG TCAGTTTATG
GAAAACTGGT GTTATGAACG TACCACCTTA TTTAGCTTGG CTAAACATTA CGAAACTGGG
GAAACACTAC CAGAACATTA TTATCAAAAG CTCCTAGCCG CCCGTAATTA TATGAGTGGT
AGCGGTATCT TGCGCCAACT CCACTTTAGT AGCGTTGATT TGGAATTACA CTCTCACTAT
CACCCAGGTG GTCAGGAAAC CGCCGCCGAT GTGCGTCATC GAGTTGCCCA AAAAACTACT
GTCTTACCAC CACTACCAGA AGATGCTTTC CTCTGCGCTT TTGGACACAT CTTTGCAGGT
GGTTATGCTG CGGGATACTA CAGTTACAAG TGGGCTGAGG TACTTAGTGC TGATGCTTTC
GCCGCTTTTG AAGATGCTGG GCTAGAAGAT GAAGTTGCCA TTAAAGCTAC GGGTAGACGT
TATCGAGATA CTGTATTAGC TCTTGGTGGT AGCAAGCATC CGATGGAGGT GTTTGCAGCG
TTCCGAGGAC GGGAACCGAG TACAGCTTCT TTGCTCAAGC ATAATGGGTT GCTGCCCAAT
GCTGCTTAA
 
Protein sequence
MSANTTITQN PLLKGSGLPP FTEIQPEQVQ PAFEQLLAEL EQELTTLEAN VQPTWDGLVE 
PLEKLSDRLT WSWSIVNHLM GVKNSPELRT AHEAVQPQVV QFANKLGQSQ PIYNAFKQLR
ASDSWQTLES AQQRIVEAAI RDAELSGVGL QGEARERFNA IQMELAELAT KFSNHVLDAT
TAFSLTLTTQ AEVDGLPQSL LSLAAQAARA TGAENATPEN GPWRITLDIP SYTPFIQHST
RRDLREQLYK TYITRASSGE LDNNPIIERT LELRQELANL LGFQNYAELS LASKMAPNVE
AVEALLEELR SASYDASVKD LETLKAFAAS KGAPEAADLR HWDISFWAER QREEKFAFTA
EELRPYFPLP QVLDGLFGLV HRLFGITVTP ADGQAPVWHE DIRYFQIADE TGNPIAYFYL
DPYSRPGEKR GGAWMDTCIN RAIITENGVK TVRLPVAYLV CNQTPPVDGK PSLMTFYEVE
TLFHEFGHGL HHMLTKVNYA GAAGINNVEW DAVELPSQFM ENWCYERTTL FSLAKHYETG
ETLPEHYYQK LLAARNYMSG SGILRQLHFS SVDLELHSHY HPGGQETAAD VRHRVAQKTT
VLPPLPEDAF LCAFGHIFAG GYAAGYYSYK WAEVLSADAF AAFEDAGLED EVAIKATGRR
YRDTVLALGG SKHPMEVFAA FRGREPSTAS LLKHNGLLPN AA