Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1904 |
Symbol | |
ID | 4445558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2143256 |
End bp | 2144506 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639689714 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_831386 |
Protein GI | 116670453 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0620222 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATTA AGCGAAGTCC CACCATTTTC TGTGCGGTTG CGGCGGTTGT GAGCTTATGT CTTTGGACAA CAGGATGCAC AAATGGGTCT GGACCTGCCC CTGAGTCCTC GACTAGTTTG TCCGGGCCGC CCACCACGCA AACGCCTGTC CCAAGCGAGA AAACCGCTGC CCCCAGCCTG AGTCCATCCC CGAGCCCGAG TCCAAGCACG GAGGCCGTTC CCAGCAGCTG GCCGGACGTT GTTGCCCAGA CGCAGTCGGG TGTCGGGCAG TTCAGCGTGA CGAGGTGTGA GAACAGCTTC ACGGGCACGG GGTTCCTTGT TGGTCCTGAT CTGGTTGTCA CCGCTGCGCA CATGGTGCGG GACGCGTCCG CGATAAGCAT TTCGTTTGGT CGAACAACGG TAAATGCCAC AACACTGGGC ACAAACGAAC TCGCTGATTT GGCGCTCGTA AGAACCGAAA CCCCAGTTCA AGGTCATCAG TTTCAGTTCA GAACAACTGA ACCGCCAATT GGAACGGATG TCGCTGCCCT CGGATTTCCC TTAGGTCGGC CTTTCACTTT CACCCGGGGG ACAGTAAGCG CTCTGAACGC GGAACAGAGA ATTGGTAGCC GAGTTCTTAG CAATCTGATT CAAACCGATA CGGCTATCAA CCATGGAAAC AGCGGTGGAC CCCTTATTAC CCAGGATGGT CAGGTTTCTG GCGTTATCGT GACCATTGAA TTCGACGAAA ATGTCCGGGC CGAAGGCATC GCCTACGCGG TGACTGCGCC ACGGGCTGCT GCCGCCGTTC AGGAATGGCA GAAACGATCG GTGCCGGTAA CGCTCAAAGA CTGTGGCAAC GCCCCGGCAC CGGGTTCAGG ATCTTTCCCG CTAACCGTCT TGTCCAGCCA CGATCAAGCA CGCAACATCG GGCAGAGCCT CCTGCTGCAT GGCCAAGGCA TCAACCAAGG GGCCTACGCC GCCGCCTTCA AGCAATTCAC TCCCGAACTC CAGGCAACTT TCGGTGACTC AGTTGCATGG AGCGCTGAGC TTGGATCCTC GTACTGGCAA AAGGTCGAAA TCGTAGACGT TACAGGCAGC GGCGATGCCC TCTCCGCTGA TGTGAACCTA CAAACACGCC AAGATGCAGC GCACGGCAGA AACGGCCAAA CCTGTTCGAA CTGGAAGCTC CGCTATGCAA TGCATTGGGA CGGCAGCGCC TGGCTCATAG CCGGCACGTC ATTACCCTTC GGTGAACCCA CGGCCTGCTG A
|
Protein sequence | MSIKRSPTIF CAVAAVVSLC LWTTGCTNGS GPAPESSTSL SGPPTTQTPV PSEKTAAPSL SPSPSPSPST EAVPSSWPDV VAQTQSGVGQ FSVTRCENSF TGTGFLVGPD LVVTAAHMVR DASAISISFG RTTVNATTLG TNELADLALV RTETPVQGHQ FQFRTTEPPI GTDVAALGFP LGRPFTFTRG TVSALNAEQR IGSRVLSNLI QTDTAINHGN SGGPLITQDG QVSGVIVTIE FDENVRAEGI AYAVTAPRAA AAVQEWQKRS VPVTLKDCGN APAPGSGSFP LTVLSSHDQA RNIGQSLLLH GQGINQGAYA AAFKQFTPEL QATFGDSVAW SAELGSSYWQ KVEIVDVTGS GDALSADVNL QTRQDAAHGR NGQTCSNWKL RYAMHWDGSA WLIAGTSLPF GEPTAC
|
| |