Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_14850 |
Symbol | |
ID | 7760421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1467351 |
End bp | 1468328 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643804383 |
Product | Peptidase S49, SppA |
Protein accession | YP_002798676 |
Protein GI | 226943603 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.772119 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAACG ACGAGTGGAA GTCGCCGCCG GAGGGGGGCG ACGAGAAGAG CTGGAAACTG CTGGAAAAGA CCTTGCTCGC CACCGTGGAG GAGCGGCGGC GGGCGCGTCG TTGGGGGATC TTCTTCAAGT TGCTGGGGTT CGCCTATCTG GTGGCGCTCC TGGCGATGTT CTCCCCGGCG CTGAGCCTCA GGGAGGCGGC GCGTAGCGGC GAGCACACGG CGCTGGTCGA GGTGCGCGGG ATGATCGCCG ACGACGAGGC CGCCAGTGCC GACAACGTCG TCGGCAGTCT GCGGGCCGCG TTCAAGGACA AGCACACCAA GGGTGTGGTG CTGCGTATCA ACAGTCCCGG CGGCAGTCCC GTGCAGTCCG GCTACATCTA CGACGAGATC CGCCGGCTGC GTGCCGAGCA TCCGGATACC AAGCTCTATG CGGTCATTAC CGATCTCGGT GCTTCCGGCG CCTACTATGT CGCCAGCGCC GCCGACGCCA TCTATGCGGA CAAGGCCAGC CTGGTCGGCT CGATCGGGGT AACGGCGGCA AGCTTCGGTT TTGTCGGGGC GATGGAGCGG CTGGGCGTCG AGCGTCGGGT CTATACGGCC GGCGAGCACA AGGCGTTCCT CGATCCGTTC CAGTCGCAGA AGGAGGGAGA GGTGCGCTTC TGGCAAGAGG TGCTGGAGGT CACCCACCGG CAGTTCATCG ACAGTGTCAA GCAGGGGCGC GGCGAACGGC TGAAGGACAA GGAGCATCCC GAACTGTTCT CCGGTCTGGT CTGGTCCGGC GAGCAGGCCT TGCAACTGGG CCTGGTCGAT GCCCTGGGCA GCGCCAGTCA TGTGGCTCGC GAAGTGGTGG GGGCAGAGGA TCTGGTCGAT TTCACGGTGC GGGAAACCCC CTTCGATCGT TTCGCCAAGA AGCTGGGAAG CGGGGTGGTC GAGCGTCTTG GCGTGTGGAT GGGCCTGCAG GCGCCGGTCC TGCGCTGA
|
Protein sequence | MTNDEWKSPP EGGDEKSWKL LEKTLLATVE ERRRARRWGI FFKLLGFAYL VALLAMFSPA LSLREAARSG EHTALVEVRG MIADDEAASA DNVVGSLRAA FKDKHTKGVV LRINSPGGSP VQSGYIYDEI RRLRAEHPDT KLYAVITDLG ASGAYYVASA ADAIYADKAS LVGSIGVTAA SFGFVGAMER LGVERRVYTA GEHKAFLDPF QSQKEGEVRF WQEVLEVTHR QFIDSVKQGR GERLKDKEHP ELFSGLVWSG EQALQLGLVD ALGSASHVAR EVVGAEDLVD FTVRETPFDR FAKKLGSGVV ERLGVWMGLQ APVLR
|
| |