Gene Ava_4496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4496 
Symbol 
ID3680197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5632206 
End bp5633300 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content43% 
IMG OID637719852 
Productpeptidase C56, PfpI 
Protein accessionYP_324989 
Protein GI75910693 
COG category[R] General function prediction only 
COG ID[COG0693] Putative intracellular protease/amidase 
TIGRFAM ID[TIGR01382] intracellular protease, PfpI family 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACC ATAATAATTC TGGGAAAAAG AAAGTAGCTA TTCTCATCGA ACAAGCAGTA 
GAAGATACCG AGTTTATTAT TCCTTGTAAT GGTTTAAAAC AAGCAGGATT TGAGGTGGTT
GTCCTTGGTT CGCGGATGAA TGAAAAATAT AAGGGAAAAC GAGGCAGACT TTCCACCCAA
GCTGATGGTA CTACAACAGA AGCGATCGCC TCTGAATTTG ATGCGGTAGT AATTCCTGGT
GGAATGGCTC CCGATAAAAT GCGTCGCAAC CCCAATACAG TCCGCTTTGT ACAAGAAGCG
ATGGAGCAAG GAAAATTGGT AGCGGCTGTT TGTCACGGGC CACAAGTCTT AATTGAAGGC
GACTTACTCC GAGGTAAACA AGCCACCGGT TTTATCGCTA TCAGCAAAGA CATGATGAAT
GCTGGTGCTG ATTATCTCGA TGAAGCGCTA GTTGTTGACG GTAACTTGAT TACATCCCGT
GAACCTGGAG ATTTGGCAAT TTTCACCACA GCGATTTTGA GTCGTCTTGG TTATGGCGGT
AAAGATGCAG CCTTACCAGA TGAAAAAGAT AGGAATGCAG AATGGTGGAA ACTGGCTGAT
GCTTGGGGCG GTTCAACCAA AGGTGATATT GTCAGAGGTC TGAACACTGC TTTAGGTGGG
GAACGTTATT CTCTGGAAGC GCTGGAGAAG TACACGGAAA AAGAATCTGA TGTAGAAGCA
AAAGCGCTGT TCCAAGAAAT GATTACCAAT AAACAGCGTC ATATTGAATA TCTAGAAACT
TATTTGACTA GATTGGGTGA AAAACCGTCC CTCAGTGCAA ATATCGCTAA TCAATACGCC
AAAGTAAAAA CCGCTTTAAC TGGTAGCGAT GACATATATC AAATTCGCAG CGCCTTAGGC
GATATACAAA CAGGTATTGG TGATATTGGT AATCTCTGCG CCATGTACAC TGACCCCATA
GCCACCGCTA TTTTCAAAGA AATCTACAAA GACTTGGTCA AATACGAACA GCGATTAGTA
TCACTATACC GTACACGTAC AAATGCTACA GTCCAGCCGC CTAAGCCAAC AACAGGGGCA
GCTGTATCGA TGTAA
 
Protein sequence
MTNHNNSGKK KVAILIEQAV EDTEFIIPCN GLKQAGFEVV VLGSRMNEKY KGKRGRLSTQ 
ADGTTTEAIA SEFDAVVIPG GMAPDKMRRN PNTVRFVQEA MEQGKLVAAV CHGPQVLIEG
DLLRGKQATG FIAISKDMMN AGADYLDEAL VVDGNLITSR EPGDLAIFTT AILSRLGYGG
KDAALPDEKD RNAEWWKLAD AWGGSTKGDI VRGLNTALGG ERYSLEALEK YTEKESDVEA
KALFQEMITN KQRHIEYLET YLTRLGEKPS LSANIANQYA KVKTALTGSD DIYQIRSALG
DIQTGIGDIG NLCAMYTDPI ATAIFKEIYK DLVKYEQRLV SLYRTRTNAT VQPPKPTTGA
AVSM