Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_12950 |
Symbol | algW |
ID | 7760237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1258768 |
End bp | 1259919 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643804197 |
Product | Htr-like protease |
Protein accession | YP_002798496 |
Protein GI | 226943423 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family [TIGR02038] periplasmic serine pepetdase DegS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAAGG CCCTGCGTTT TCTTGGCTGG CCCCTGGCGG TCGGAGTGCT GCTGGCCCTG CTGATCATCC AGCGTTATCC GGAGTGGGTC GGCTTGCCCC GGCAGCTTGC CGATGAGCAG CAACTGTCCC GCTCCATCCT CGCTCCCCAA GGCCCCGTCT CCTATGCCAA TGCCGTGAGC AGCGCCGCAC CGGCGGTAGC CAACCTGTAC ACCACCAAGG TCGTCAAGAA ATCGAACCAG CCGCTGTTCG ACGATCCACT GCTGCAACAG TACTTCGGCA ACTCCCTGCC CAGTCAGCGG CGCCTGGAAT CCAGCCTGGG GTCTGCGGTG ATCATGCGCC GGGATGGCTA CCTGCTGACC AACAACCACG TCACCGCCGG TGCCGACCAG ATCGTCGTCG CCCTGCGGGA CGGACGGGAA GTCCTCGCCC GGGTGATAGG CAACGATTCG GAAACCGATC TGGCCGTGCT CAAGATCGAT CTGGACGAAT TGCCGGTCAT GCATCTCGGA CGCTCCGACA GCATCCGCAT CGGTGATGTC GCCCTGGCCA TCGGCAACCC CTTCGGCGTC GGCCAGACCG TGACCATGGG CATCATCAGC GCCACCGGGC GCAACCAACT GGGCTTGAAT ACCTACGAGG ACTTCATCCA GACCGATGCA GCGATCAATC CGGGCAATTC GGGCGGCGCG CTGATCGATG CCAATGGCTA TCTGATCGGC ATCAATACCG CCATTTTCTC CAAGTCGGGC GGATCCCAGG GTATCGGCTT CGCGATTCCG GCCAAGCTTG CCCTGGAGGT GATGGAGGAA ATCATCAAGC ACGGTCAGGT AATTCGCGGC TGGCTCGGAC TCGAGGTGCA ACCACTGACC AAGGAGTTGG CCGAATCCTT CGGCCTGGAA GGCCGGCCAG GCATCGTCGT CGCCGGCATA TACCGTGACG GCCCCGCACA ACGGGCCGGT CTGCAGCCGG GCGACCTGAT CGTCAGTATC GATGGCCAGC CGGCCACCGA TGGACGCCAT GCCATGAATC AGGTCGCCCA GACTCGACCG GGAGAAACCA TCGAAATCGA GGTCCTGCGC AACGGCCAAG CCCTCACCCT TAGCGCCGAG ATCGGCCTGC GCCCGCCACC CACCGCCGTG CAGCAGCCAT GA
|
Protein sequence | MLKALRFLGW PLAVGVLLAL LIIQRYPEWV GLPRQLADEQ QLSRSILAPQ GPVSYANAVS SAAPAVANLY TTKVVKKSNQ PLFDDPLLQQ YFGNSLPSQR RLESSLGSAV IMRRDGYLLT NNHVTAGADQ IVVALRDGRE VLARVIGNDS ETDLAVLKID LDELPVMHLG RSDSIRIGDV ALAIGNPFGV GQTVTMGIIS ATGRNQLGLN TYEDFIQTDA AINPGNSGGA LIDANGYLIG INTAIFSKSG GSQGIGFAIP AKLALEVMEE IIKHGQVIRG WLGLEVQPLT KELAESFGLE GRPGIVVAGI YRDGPAQRAG LQPGDLIVSI DGQPATDGRH AMNQVAQTRP GETIEIEVLR NGQALTLSAE IGLRPPPTAV QQP
|
| |