Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2309 |
Symbol | |
ID | 4445352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2597828 |
End bp | 2598808 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639690118 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_831789 |
Protein GI | 116670856 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00460586 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGCAG CCAGCAGCGG GATCAACGGA GCCGACACTA TTGAAGACCC GCTGGATTCC TATTCCGCAA CAGTCATCCG CGTCGCGGAA ACCGTCACCC CGCACGTAGC CGCCGTCGAA ATGGCCGGCA CGGGCCGCAA CGGCGGATAC CGGGTTGGCG CGGGGTCGGC CGTCCTGTTC ACCTCCGACG GCTACCTGGT GACCAACGCG CATGTGGTGG GCTCCGCGGG AAAGGGGCAC GCGGTGTTTG CCGACGGCAC CCGGACGGCT GTTGAAGTGG TGGGTGCCGA TCCCCTGTCC GACCTCGCTG TTGTCCACGG CAAAGCACCC ATGGTCCGGC CGGCCGAATT CGGCGACGCC GAACTCCTCA AGGTCGGGCA GCTGGTGATC GCTGTCGGTA ATCCGCTGGG ACTCTCGGGC TCCGTGACCG CAGGGGTGGT CAGCGGCCTC GGCCGTTCCA TCCCGGTGTG GTCGGGACGC AACCGGCGCG TGATCGAGGA CGTCATCCAG ACCGACGCCG CGCTTAATCC CGGCAACTCC GGAGGGGCCC TGGCCGACGC ACGGGGCAGG ATCGTGGGCA TTAACACGGC GGTCGCAGGA GCGGGACTGG GTTTGGCGAT TCCTATCAAC GCGACGTCAC GCCGGATTAT CGCCTCCCTC CTCTCCGACG GGCGGGTCCG GCGCGCTTAC CTGGGACTCG TGAACACTCC CGTTCAACTT CCGGTCAGCT CGGTGGTCCG CACCGGCCAC CGGGATGGGC TGCTGGTTGT CGAAGTGCTT CCCGGATCAC CTGCCGAACG GGCGGGCCTC CGCGCCGGGG ACGTGCTGTT GAGCGTGGGG CGGAAATCCG TTTCGAACGC GGAAAGCCTC CAGAAGCTGC TGTTCTCGGA GGCCATCGGG GCACCATTGG ACATTTCGGC GCTCCGCGAT GGAAAAGAAT TCCACGTTGT GGCCGTACCG GAGGAAATGA GCGCCCCGTA A
|
Protein sequence | MAAASSGING ADTIEDPLDS YSATVIRVAE TVTPHVAAVE MAGTGRNGGY RVGAGSAVLF TSDGYLVTNA HVVGSAGKGH AVFADGTRTA VEVVGADPLS DLAVVHGKAP MVRPAEFGDA ELLKVGQLVI AVGNPLGLSG SVTAGVVSGL GRSIPVWSGR NRRVIEDVIQ TDAALNPGNS GGALADARGR IVGINTAVAG AGLGLAIPIN ATSRRIIASL LSDGRVRRAY LGLVNTPVQL PVSSVVRTGH RDGLLVVEVL PGSPAERAGL RAGDVLLSVG RKSVSNAESL QKLLFSEAIG APLDISALRD GKEFHVVAVP EEMSAP
|
| |