Gene Ava_4610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4610 
Symbol 
ID3679960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5766563 
End bp5767849 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content45% 
IMG OID637719965 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_325102 
Protein GI75910806 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.861127 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTATCTA AAGTAATAAT AAACATAGTT TCCTCGCTCA AGGCCATGCA AAACCAAGTA 
CACGATGAAT CTCAACCCTT AAATCCTAAA AACCAGAATC ACGCACCTTG GAAAAAAGCG
GCTGCATCTC TATCACTAGT ACTGCTGGGA TCTGGTATGA CTTTAGCTGG TGGATATTTA
GCAGGAAACC AGCAACAATT GGCACAAAAA GCATCTGACT TGGCCGTGAG CCGAGTAGAT
GCAGCACCGC CATTAGGAAA TAACACAGAC CCCAACTTTG TAACTCAAGT AGTACAGAAA
GTTGGGCCGG CTGTTGTGCG TATTGAAGCG TCTCGGACTG TCACATCTCG ATTACCAGCC
GAATTTAACG ATCCATTTTT CCGTCGCTTC TTCGGTTCCC AACTACCTCA ACAACAAGAG
AGAGTACAAC GGGGTACTGG TTCCGGGTTT CTTATTAGTG CTGATGGTAG TATTCTTACC
AATGCTCACG TTGTTGATGG TGCAGATACG GTGCGAGTCA TCCTCAAAGA TGGGCGCAGT
TTTCAGGGTA AGGTATTAGG TACAGATAAT TTAACAGATG TAGCTGTTGT CAAAATTCAG
GCAAATAACT TGCCGACCTT AGCAGTGGGT AATTCTGACC AATTACAACC TGGACAATGG
GCGATCGCTA TTGGTAATCC TTTAGGTTTA GATAACACCG TCACCACAGG TATAATTAGT
GCAACTGGAC GTACCAGCAA TCAAATTGGC GCACCAGATA AGCGTGTAGA ATATATTCAA
ACTGACGCAG CAATTAATCC AGGTAACTCT GGTGGCCCCT TGCTGAACTA TCGTGGTGAA
GTTATTGGGA TGAATACCGC CATTATTCAA GGCGCGCAGG GTCTAGGTTT TGCCATCCCT
ATCAAAACAG CACAGCGTAT TTCTAATCAA CTTATAGCCA CAGGTAAAGT ACAGCATCCT
TACCTGGGTA TTCAAATGGT AGGATTAACA CCCCAAGTCA GACAAAACAT TAACTCTGAC
CCCAATAGTG GTTTGAGTGT TGATACAGAC AAGGGTGTTT TAGTGGTCAG AGTCATGCCA
AATTCGCCAG CCGCAAGAGC AGGGTTACGC GCTGGCGATG TTATTCAAAA GCTGAACGGC
CAATCTGTTA CGGATGCTAG TAATGTACAA AGAGCCGTAG AGAACGCTCA AGTCGGTGGA
CAATTGCAGC TAGAATTATG GCGCAATGGT CGAAATGTTA ACTTAGCTGT ACAAGCAGGC
GCTTTCCCGA CTCAACAGGT GGAATAG
 
Protein sequence
MLSKVIINIV SSLKAMQNQV HDESQPLNPK NQNHAPWKKA AASLSLVLLG SGMTLAGGYL 
AGNQQQLAQK ASDLAVSRVD AAPPLGNNTD PNFVTQVVQK VGPAVVRIEA SRTVTSRLPA
EFNDPFFRRF FGSQLPQQQE RVQRGTGSGF LISADGSILT NAHVVDGADT VRVILKDGRS
FQGKVLGTDN LTDVAVVKIQ ANNLPTLAVG NSDQLQPGQW AIAIGNPLGL DNTVTTGIIS
ATGRTSNQIG APDKRVEYIQ TDAAINPGNS GGPLLNYRGE VIGMNTAIIQ GAQGLGFAIP
IKTAQRISNQ LIATGKVQHP YLGIQMVGLT PQVRQNINSD PNSGLSVDTD KGVLVVRVMP
NSPAARAGLR AGDVIQKLNG QSVTDASNVQ RAVENAQVGG QLQLELWRNG RNVNLAVQAG
AFPTQQVE