Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_27550 |
Symbol | |
ID | 7761662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2832233 |
End bp | 2833261 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643805631 |
Product | putative periplasmic protease |
Protein accession | YP_002799904 |
Protein GI | 226944831 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGTGG AGTTTTTTCT CGATTATGCC GGTTTTCTGG CCCAGACCGT GACAGTGGTG GTGGCCGTTC TGGTGATATT GGTCGCCATC GCGGTCTTGC GCAGCAGAAG CCGTCATCGG GGCGAAGGGC ATCTCGAGGT GCACAGGCTC AATGATTTCT ATAAGTCCCT GCGGGAAAGT CTAGAGCAGA TAGTGCTGGA CAAGGACCGC CTCAAGACCC GGCGCAAGGC GGAGGCCAAG GCCGAGAAAC GGGAGCGGAA GGAGGGCAAG ACGAAGCCCA GGCTTTTCGT GTTGGACTTC GATGGCGACA TCCGGGCTTC CGCTACCGAC AAGCTGCGTC ATGAAGTGAC GGCCGTGCTG AGCATGGCCA AGCCCGAAGA CGAAGTCGTG TTGCGCCTGG AGAGCGGGGG CGGGTTGGTG CACAGCTATG GCCTGGCCGC TTCGCAATTG GTCCGTATCC GCCAGGCTGG TGTACCCCTG ACCGTATGCG TCGACAAGGT GGCCGCCAGC GGCGGTTACA TGATGGCTTG CATCGGTGAT CGGATTCTTT CCGCGCCTTT CGCCATCCTC GGCTCCATCG GTGTGGTGGC GCAGTTGCCC AATGTGCATC GGTTGCTGAA AAGGCATGAC ATCGACTTCG AAGTGCTCAC CGCCGGCGAG TACAAGCGCA CTTTGACGGT ATTTGGTGAG AACACCGAGA AAGGCCGGGA AAAGTTCCAG GAGGACCTGG AAACCACCCA TGAGCTGTTC AAGAACTTCG TTGCCCGCTA CCGGCCGCAA TTGACCATCG ATGAGATCGC CACCGGCGAG ATCTGGCTCG GGCAGAGTGC GCTGGAGAAG CAACTGGTCG ATGAGTTGAT GACCAGCGAC GAATACCTGG CGGCGAAGGC CGGCAGTGCG GAGCTGTTCC AACTGCATTA CGCCGAAAGA AAGAGTCTGC AGGAGCGTTT CGGCTTGGGG GCTGCCCTGG CCGTGGACCG TGTGCTTCTG AACTGGTGGG AGCGCTTGAG TCGAAGTCGC TATCAGTGA
|
Protein sequence | MAVEFFLDYA GFLAQTVTVV VAVLVILVAI AVLRSRSRHR GEGHLEVHRL NDFYKSLRES LEQIVLDKDR LKTRRKAEAK AEKRERKEGK TKPRLFVLDF DGDIRASATD KLRHEVTAVL SMAKPEDEVV LRLESGGGLV HSYGLAASQL VRIRQAGVPL TVCVDKVAAS GGYMMACIGD RILSAPFAIL GSIGVVAQLP NVHRLLKRHD IDFEVLTAGE YKRTLTVFGE NTEKGREKFQ EDLETTHELF KNFVARYRPQ LTIDEIATGE IWLGQSALEK QLVDELMTSD EYLAAKAGSA ELFQLHYAER KSLQERFGLG AALAVDRVLL NWWERLSRSR YQ
|
| |