Gene Ava_2947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2947 
Symbol 
ID3681317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3652615 
End bp3653697 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content42% 
IMG OID637718294 
Productpeptidase M4, thermolysin 
Protein accessionYP_323453 
Protein GI75909157 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGAA ATAAAAATAA ATCATCTAGG TTGGAATGTA ATCATACAGT CAATACTAGA 
TGTCCTATTT GCTGTGTCAT TCCACCGCAT ATATTAGAAA ATATTGCCCT TAATGGTACG
CCACTGCAAA GGAATTGGGC TTTTCAAACA TTAAATGTTT CAGCCCAATT ACGCGGACGG
AGAAACATCG TTGGCAACGT CTTTCTATCT CCTTCTCCTG GGGAAAAAAG TCGTACTATT
TACGATGCTC AAAACAGTGA ACAATTGCCA GGAAAAATAG TACGTGTTGA AGGTGATCCC
CCTAGTAGCG ATGTGGCAGT AAATGAAGCT TATGATGGGG CTGGAGCTAC TTATGACCTG
TTCTACGAAA TATTTGAGCG CAATTCTATT GACGACAAAG GATTACGTTT AGATTCTACA
ATCCATTTTG GTGTTAAATA TGAAAACGCC TTTTGGAATG GCGACCAGAT GGTTTATGGC
GATGGTGATG GTGAATTATT CGATCGCTTT ACCAAGTCAA TTGATGTTAT TGCCCATGAA
CTAACTCATG GTATAACTCA ACACGAGGCA GGACTGATAT ATTACGGCGA ACCAGGGGCT
TTGAATGAAT CTTTTTCTGA TGTGTTTGGT GCTTTGGTCA AACAAAGGGT AAAAAATCAG
AAAGCGGAAG AAGCTGATTG GCTGATTGGT GATGCTTTAT TAATGCCCAA CGTCAAGGGT
GTAGGTGTCC GTTCCATGAA AGAGCCGGGG ACAGCTTATG ATGATCCTGT TTTAGGAAAA
GACCCCCAAC CAGGTCATGT AAAAGACCAA TATACAGGCT GGGCTGATAA TGGAGGGGTA
CATATCAATT CAGGGATTCC CAATCGAGCT TTTTATCTAG CAGCAGTAGA GATTGGTGGC
TATGCTTGGG AGAAAGCAGG TAAAATTTGG TATATTGCTT TGCGCGATCG CTTACGTGCT
AAAGCTGATT TTACAAAGGC TGCTGATGTC ACTATCCAAG TTGCTAGCGA ACTCTATGGC
GACGGTAGTC TAGAACATAA AGCTGTACAG AATGCTTGGA AACAGGTGGG AGTTTTGAGT
TAA
 
Protein sequence
MARNKNKSSR LECNHTVNTR CPICCVIPPH ILENIALNGT PLQRNWAFQT LNVSAQLRGR 
RNIVGNVFLS PSPGEKSRTI YDAQNSEQLP GKIVRVEGDP PSSDVAVNEA YDGAGATYDL
FYEIFERNSI DDKGLRLDST IHFGVKYENA FWNGDQMVYG DGDGELFDRF TKSIDVIAHE
LTHGITQHEA GLIYYGEPGA LNESFSDVFG ALVKQRVKNQ KAEEADWLIG DALLMPNVKG
VGVRSMKEPG TAYDDPVLGK DPQPGHVKDQ YTGWADNGGV HINSGIPNRA FYLAAVEIGG
YAWEKAGKIW YIALRDRLRA KADFTKAADV TIQVASELYG DGSLEHKAVQ NAWKQVGVLS