Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0225 |
Symbol | |
ID | 4447316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 235433 |
End bp | 236530 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639688021 |
Product | PDZ/DHR/GLGF domain-containing protein |
Protein accession | YP_829726 |
Protein GI | 116668793 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCTCAG CGCGAGCCGG GACCGGCGTC CGGGCCGGCG CAGCCCTGGC CCTCTTGGCA ATCCTGGGCT CCGCCGTGAC CGGATGCACG GGGGCGGACC AGCGCCCGGC CCCTGCGTCC TCCTCAGCTG CAGGATCCCC TGCTACAGGC TCCTCCGGGA CAGCCCCCGC TACCCGGCCG GGCACCCCGC AGGCTGCCGC GGACATCCCG TCCATCGTGG AGAACGTCCA GCCGTCCGTT GTCACGGTGC TCACCCACGG CGGACTGGGC AGCGGCGTGG TCTTCGCGGC GGAGGGACTG ATCCTCACCA ACGAGCATGT GGTGCGGGGC AACACCGAGG TGGAGATCGC CTTTGCCGAC GGCCAACGGG TTGCAGGCAC AGTCAAGGCC ACGGATGCCA TTTCGGACCT GGCCCTCGTG GAGGCGAAGC GCACAGGGCT TCCTGCCGCA AAGTTCCAGT CCGAACTCCC CCGGGTGGGC GAACTGGCAA TCGTGATCGG CTCTCCGCTG GGTTTCGAAA ACACGGCGAC GGCGGGCATC ATCTCCGGTT TGCACCGCGA GATCCCGGGG TCGGCGGCCA GCAGCCAGTC GCTGGTGGAC CTGATCCAGA CTGATGCGGC CATCAGCCCC GGGAACTCCG GCGGCGCGGT GGTGAATTCC CGGGGCGAGG TGATCGGCAT CAGCGAGGCG TACATCCCGC CGCAGTCCGG GGCGGTGGCG CTGGGCTTCG CGATCCCCGC GGCGACCGCC GTCCGGGTAG CCGGGCAGCT GCGCGAGGAC GGGACAGCGG ACCACGCCTT CATCGGACTC CGCCCGGGCG AGATCACCTC GCAGATCGCA GACACGCTGG GACTGGAAAA CACCCGCGGC GCCCTGGTGC TCTCGGTGGT GGACGGCGGC CCCGCGGACC GTGCGGGCAT CCGGCCCGGG GACGTCCTTG TCTCCTTGGA CGGCAAGGAA CTGGCCTCAC CCGAGGACCT GCTCGCCGAA CTCCGCGGCA AGAACCCGGA CCAGACCGTT AACGTCGGCT ACCGCCGGGG CACGGAAGCC AAGGAAGCCA AGGTCACCCT CGCTGCCCGC CCCGCCTCGG GGGGCTGA
|
Protein sequence | MISARAGTGV RAGAALALLA ILGSAVTGCT GADQRPAPAS SSAAGSPATG SSGTAPATRP GTPQAAADIP SIVENVQPSV VTVLTHGGLG SGVVFAAEGL ILTNEHVVRG NTEVEIAFAD GQRVAGTVKA TDAISDLALV EAKRTGLPAA KFQSELPRVG ELAIVIGSPL GFENTATAGI ISGLHREIPG SAASSQSLVD LIQTDAAISP GNSGGAVVNS RGEVIGISEA YIPPQSGAVA LGFAIPAATA VRVAGQLRED GTADHAFIGL RPGEITSQIA DTLGLENTRG ALVLSVVDGG PADRAGIRPG DVLVSLDGKE LASPEDLLAE LRGKNPDQTV NVGYRRGTEA KEAKVTLAAR PASGG
|
| |