Gene Ava_1797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1797 
Symbol 
ID3681964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2240305 
End bp2241498 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content46% 
IMG OID637717137 
Productpeptidase M50 
Protein accessionYP_322314 
Protein GI75908018 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACAA ATTGGAGAAT CGGGTCTTTA TTTGGTATTC CCCTATTTTT AGATCCTTTG 
TGGTTTGTAA TTTTGGGCTT AGCAACACTG AACTTTGGTG TGGCTTATCA AGAATGGGGA
ACGGCGACAG CATGGACGGC TGGACTAATT ATGGCGTTGT TGCTATTTGG TTCGGTGTTA
TTGCATGAGT TAGGTCATAG TTTGGCAGCG CGATCGCAAG GGATTAAAGT TAACTCTATT
ACCTTATTTT TGTTTGGTGG GATTGCCGCC ATTGAGGAAG AATCCAAAAC TCCGGGTAAA
GCGTTTCAAG TAGCCATTGC TGGGCCTTTG GTGAGTATTG GCTTATTTTT ACTCTTACGG
TTGGGGAGTA CTGTTGTATC TGATAGTAGC CCGGTTAGTC TGATGGTGGC TGATTTGGCA
CGAATTAACC TAATTGTTGC CTTATTTAAC CTAATTCCCG GCTTACCTTT AGATGGGGGT
CAAGTGTTGA AGGCTGCACT CTGGCAAATT ACCGGAGACC GTTTTCAAGC AGTACATTGG
GCAGCCAAAG CCGGACAAAT TTTAGGTTAT GGTGCGATCG CCTTAGGTTT TGCTGTAGAT
TTCTTTACTA GAGAATTAGT TACAGGCTTG TGGATTGTGC TGTTAGGTTG GTTTGGGGTT
CGCAATGCCA ACAGCTACGA CCGCGTAACC ACATTACAAG AAACCTTGCT AAAGGTAACG
GCTGCTGATG CCATGACTCG TGATTTTCGT GTCATTGATG CCGACCAGAC CTTACGTTCC
TTTGCTGATT CCTACCTATT GGCAACTACC AACCCAGAAG TTTATTTTGC TGCTTCCGAT
GGTCGTTATC GCGGTATGGT CGCCATTGAG GATTTACGCC TGGTGGAAAG AAGCGCATGG
GAAACTCAAA CTCTCCACAG CATCGCCCAT CCCCTCACAG AAATACCTAC AGTTGCAGAA
TCCACTGTCA TCGCTGAAGT CATCAACAAG CTAGAAAATG AACAGTTACC CCGTGTCACC
GTACTTACTC CGGCTGGCGC TGTTGCGGGA ATAATTGACC GGGGAGATAT TGTCAGCGCA
TTGGCACAAA AATTAGGTTT GCGTATGACT GACGCAGAAA TTAAGCGTAT CAAAGAAGAA
GGTAGTTATC CGCCGGGGCT GCAATTGGGG GTAATTGCCA AGTCTATTAA TTAG
 
Protein sequence
MQTNWRIGSL FGIPLFLDPL WFVILGLATL NFGVAYQEWG TATAWTAGLI MALLLFGSVL 
LHELGHSLAA RSQGIKVNSI TLFLFGGIAA IEEESKTPGK AFQVAIAGPL VSIGLFLLLR
LGSTVVSDSS PVSLMVADLA RINLIVALFN LIPGLPLDGG QVLKAALWQI TGDRFQAVHW
AAKAGQILGY GAIALGFAVD FFTRELVTGL WIVLLGWFGV RNANSYDRVT TLQETLLKVT
AADAMTRDFR VIDADQTLRS FADSYLLATT NPEVYFAASD GRYRGMVAIE DLRLVERSAW
ETQTLHSIAH PLTEIPTVAE STVIAEVINK LENEQLPRVT VLTPAGAVAG IIDRGDIVSA
LAQKLGLRMT DAEIKRIKEE GSYPPGLQLG VIAKSIN