Gene Ava_5068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_5068 
Symbol 
ID3683213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6364426 
End bp6365676 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content45% 
IMG OID637720429 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_325560 
Protein GI75911264 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.705563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACAT ATAATGGCGC ACATAACAAG ATTTATACTT GGAGTCTGCC CCACAAGGGT 
GGCAGTAGTG CGGTTTTGAT GCTGCTGGGT GGGGTAACAG CTATGTTTTT AGGAAGCTGC
TCCCTTCTAC CCACAAGGAC AATACAATCC CAAGCTAATC AATCTCAGCC CCAGTCGAAT
GATAATAGTC CAGCAATTGT CCCCCCAGCT ATTTTTTCAT CCACTGGCGA CCCTAATTTT
GTCGTCGGAG TAGTACAAAA AGTGGGAGGG GCTGTAGTGC GGATTGATTC TGCCAGAACA
GTAACTTCCA GAGTTCCAGA TGGATTTAAT GATCCTTTTT TCCGCCGTTT TTTCGGAGAT
GGAGTGCAAG CACAACCAAG ACAGCGCGTA GAAAGGGGTA GCGGTTCAGG TTTTATTATT
AGTTCCTCTG GTCAAATTTT AACTAATGCT CATGTTGTCG ATGGTGCTGA TGAGGTAACA
GTTACCCTCA AAGATGGTAG GACTTTTGAT GGTAAGGTAC TTGGTGAAGA CCCAGTAACG
GATGTAGCTG TTATTAAAAT AAACGCTAAT AACTTGCCAA CTGTTGCTGT CGGTAATTCT
GAAGTTTTAC AACCAGGTGA AGCGGTTATT GCGATCGGTA ATCCTCTAGG CTTGAATAAT
AGTGTTACGT CAGGAATTAT CAGCGCCACA GGTCGTTCTA GTACTGATAT TGGCGCAAGT
GATAAGCGCG TTGACTATCT GCAAACAGAT GCGGCGATTA ATCCTGGTAA CTCTGGCGGC
CCCCTGCTCA ATGCTCGCGG TCAGGTAATT GGGATGAACA CAGCTATTAT CCAAGGCGCT
CAAGGTTTGG GATTTGCTAT TCCTATTAAT ACTGTGCAGA AAGTTGCTCA GGAATTAATC
ACTCAAGGTA AGGTAGATCA TCCCTATTTG GGTGTACAGA TGGCAACCCT CACGCCACAA
GTTAAGGAAA GAATTAACGA AAGATTGGGC GATCGCATCA ATATTACAGC AGATAGAGGC
GTTTTATTAG TTCGTATCGT CCCTGGTTCT CCCGCCGCCA ATGCCGGACT CAGACCAGGA
GATATTATTC AAAGTATTAA TAACCAATCT GTCACAACCG TTGAAGAAGT CCAAAGAATT
GTGGAAAATA GCCAAATAGG TAACCCTTTA CAAGTCCAAA TAGAACGCAA TGGTCGAACA
ACACAGGTAG CCGTCAGTCC AGCACCTTTA CCTGTGCAAC GAGAAGGGTA G
 
Protein sequence
MKTYNGAHNK IYTWSLPHKG GSSAVLMLLG GVTAMFLGSC SLLPTRTIQS QANQSQPQSN 
DNSPAIVPPA IFSSTGDPNF VVGVVQKVGG AVVRIDSART VTSRVPDGFN DPFFRRFFGD
GVQAQPRQRV ERGSGSGFII SSSGQILTNA HVVDGADEVT VTLKDGRTFD GKVLGEDPVT
DVAVIKINAN NLPTVAVGNS EVLQPGEAVI AIGNPLGLNN SVTSGIISAT GRSSTDIGAS
DKRVDYLQTD AAINPGNSGG PLLNARGQVI GMNTAIIQGA QGLGFAIPIN TVQKVAQELI
TQGKVDHPYL GVQMATLTPQ VKERINERLG DRINITADRG VLLVRIVPGS PAANAGLRPG
DIIQSINNQS VTTVEEVQRI VENSQIGNPL QVQIERNGRT TQVAVSPAPL PVQREG