Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_04640 |
Symbol | |
ID | 7759422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 438366 |
End bp | 439685 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643803386 |
Product | carboxyl-terminal protease S41A |
Protein accession | YP_002797694 |
Protein GI | 226942621 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.900151 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTACC TGCCCCGCCC CCTGTCGCTG GCCCTGGTCG CCGCCCTGGC GCTCGGCGCT CCCCTCCTGC GCGCCGAGGA GGCGCCCGCG CCAGCCGCCG AGACGCGAGG CAAGCCCCTC CTGCCCCTCG AAGAGCTGCG CACCTTCGCC GAGGTCATGG ACCGCATCAA GGCCGCCTAT GTCGAGCCGG TGGACGACAA GACCCTGCTG GAAAACGCCA TCAAGGGCAT GATCAGCAAT CTCGATCCGC ACTCGGCCTA CCTCGAGCCC GAGGAATTCC TCGACCTGCA GGAGAGCACC AGCGGCGAGT TCGGCGGTCT CGGCGTCGAG GTCGGCATGG AGGACGACCA GCTCAAGGTG GTCGCGCCGA TCGACGACAC GCCGGCGGCC AAGGCCGGCA TCGAGGCCGG CGACCTGATC GTCCGGATCG ACGGCCAGCC GACCAAGGGC ATGTCCATGC TCGAAGCCGT GGACAAGATG CGCGGCAAGC CCGGCAGCAA GATCGAGCTG ACCCTGGTGC GCGAGGGCGG CAGGCCGTTC GATGTCAGCC TGACCAGGGC GGTGATCAAG GTCAAGAGCG TGAAGAGCCA GTCGCTCGAG TCCGGCTACG GCTACCTGCG CATCACCCAG TTCCAGGTCA ACAGCGGCGA GGAAGTCGGC AAGGCGCTGG CGCGCCTGAA GCAGGAGAAC GGCGGACAGA AGCTGAAGGG CCTGGTGCTG GACCTGCGCA ACAACCCCGG CGGCGTGCTG CAGTCGGCCG TCGAGGTGAC CGACCATTTC CTCACCAAGG GCCTGATCGT CTACACCAAG GGCCGCATCG CCAACTCCGA ACTGCGCTTC TCCGCCGACC CGGCCGACGC CAGCGAAGGC GTGCCGATGG TGACGCTGAT CAACGGCGGC AGCGCCTCGG CCGCGGAGAT CGTCGCCGGC GCCCTGCAGG ATCACAAGCG GGCGGTGCTG ATGGGCACCG ACAGCTTCGG CAAGGGCTCG GTGCAGACCG TGCTGCCGCT GAACAACGAC CGCGCCCTGA AGCTGACCAC CGCTCTCTAC TTCACCCCCA ACGGCCGTTC GATCCAGGCT CAGGGCATAG TGCCGGACAT CGTGGTGGAG CGCGCCAAGC TGACCCAGGA CGCCCAGCAG GAACACCTGC GCGAGGCCGA TCTGGCCGGC CACCTGGGCA ACGGCAACGG CGGGCCGGAC AAGCCCAGCG GCAAGGCCGG GCAGGAAGGC AAGGCGCGTC CGCAGGACGA CGACTACCAG TTGAGCCAGG CGCTCAACCT GCTCAAGGGC CTCAACCTGA CCCGCGGACT GCAGCGGTAA
|
Protein sequence | MPYLPRPLSL ALVAALALGA PLLRAEEAPA PAAETRGKPL LPLEELRTFA EVMDRIKAAY VEPVDDKTLL ENAIKGMISN LDPHSAYLEP EEFLDLQEST SGEFGGLGVE VGMEDDQLKV VAPIDDTPAA KAGIEAGDLI VRIDGQPTKG MSMLEAVDKM RGKPGSKIEL TLVREGGRPF DVSLTRAVIK VKSVKSQSLE SGYGYLRITQ FQVNSGEEVG KALARLKQEN GGQKLKGLVL DLRNNPGGVL QSAVEVTDHF LTKGLIVYTK GRIANSELRF SADPADASEG VPMVTLINGG SASAAEIVAG ALQDHKRAVL MGTDSFGKGS VQTVLPLNND RALKLTTALY FTPNGRSIQA QGIVPDIVVE RAKLTQDAQQ EHLREADLAG HLGNGNGGPD KPSGKAGQEG KARPQDDDYQ LSQALNLLKG LNLTRGLQR
|
| |