Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_46320 |
Symbol | |
ID | 7763496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4706385 |
End bp | 4707482 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643807477 |
Product | serine peptidase |
Protein accession | YP_002801713 |
Protein GI | 226946640 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.497554 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGATC CTTTCCTTGT CCGCCTGCTG GGCGTCACGC TGGCGGTCGG TGCGCTGGCC GTGTCCTGGC CCGGCGCCCG GCCGGAGGCC GCGCCGCGCG CGGTCGAGGC CCGCAGCGAA CTGGCGGCGG ACGAGAAGAG CACCATCGAT CTGTTCGAGC GCTCCCGCAA CTCGGTGGTG TTCATCACCA CCCGGGCCCA GGTGATGGAC TTCTGGACCC GCAACGTCTT CTCGGTGCCG CGTGGCACCG GCTCGGGCTT CGTCTGGGAC GATGCCGGGC ATGTCGTGAC CAACTTCCAC GTGGTGGAAG ATGCCAGCGA GGCGCTGGTC AAGCTGGCCG ACGGGCGCAC CTTCAAGGCC AGCCTGGTGG GCAGCTCCCG CGAGCACGAC ATCGCCGTGC TCAGGATCGA CATCGACGTC GGGCGGCCGT CTCCGGTGCC CCTGGGTTCC AGCCACGATC TGCGGGTGGG CCAGAAGGTC TTCGCCATCG GCAACCCCTT CGGCCTGGAC TGGACGCTGA CCACCGGGAT CGTCTCGGCC CTGGACCGCA CCCTGGCGGG CGAGGGCGGA CCGGCGATCA ACCATCTGAT CCAGACCGAC GCGGCGATCA ATCCCGGCAA TTCCGGCGGG CCGTTGCTGG ATTCGGCCGG CCGGCTGATC GGCATCAACA CCGCCATCTA CAGCCCGAGC GGCGCCTCCG CCGGGATCGG TTTCGCCGTG CCGGTGGACA CCGTCAACCG GGTCGTGCCT CAGCTCATCG ATACCGGCAA GTACGTGCAG CCGACCCTGG GCATCCAGGT AGATAGCGGG GTGAACCAGC GCCTCGGCGA ACTGAGCGGC ATCGAGGGCG TCTTCGTGCT CGGCGTGAAA CCGGGCTCGG CCGCCGAGGC GGCGGGCCTC GAGGGCGCGG CCCTGACCCG CGACGGCGGC ATAGTGCCCG GCGACATAGT CACCGCCGTC GACGGCAAGG CGGTCGACTC GGTCGAGCGC TTGCTGGCGA TCCTCGACGA CTACCGGGCC GGCGACCGGG TACGGCTTTC CGTGAAGCGC GGCGAGCGGC AGCGCGAGGT CGAGCTGGTA TTGCGCCAGG GGGAGTGA
|
Protein sequence | MRDPFLVRLL GVTLAVGALA VSWPGARPEA APRAVEARSE LAADEKSTID LFERSRNSVV FITTRAQVMD FWTRNVFSVP RGTGSGFVWD DAGHVVTNFH VVEDASEALV KLADGRTFKA SLVGSSREHD IAVLRIDIDV GRPSPVPLGS SHDLRVGQKV FAIGNPFGLD WTLTTGIVSA LDRTLAGEGG PAINHLIQTD AAINPGNSGG PLLDSAGRLI GINTAIYSPS GASAGIGFAV PVDTVNRVVP QLIDTGKYVQ PTLGIQVDSG VNQRLGELSG IEGVFVLGVK PGSAAEAAGL EGAALTRDGG IVPGDIVTAV DGKAVDSVER LLAILDDYRA GDRVRLSVKR GERQREVELV LRQGE
|
| |