Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2892 |
Symbol | |
ID | 4444449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3258453 |
End bp | 3259523 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639690715 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_832371 |
Protein GI | 116671438 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.266312 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGAA CACAGCCCCT AGTCCTGGGC ATCGAGTCCT CCTGCGATGA GACAGGTGTG GGAATCGTGC GCGGAACTGC GCTGCTCAGC AACACTGTGT CATCCTCCAT GGAAGAGCAT GTCCGCTTCG GCGGAGTCAT CCCCGAGATC GCCTCCCGTG CACACCTGGA CGCCTTCGTG CCCACCCTCC AGGAAGCCCT CGCGGACGCG GGAGTCCAGC TTGACGACGT GGACGCGATC GCCGTCACTT CCGGTCCCGG GCTGGCCGGG GCTTTGATGG TGGGCGTGTG CGCCGCCAAG GCGCTCGCGG TGGCCACGGG CAAACCGCTA TATGCCATCA ACCACCTGGT GGCCCACGTC GGTGTCGGCC TGCTGCAGGA GGAGAACACC CTGCCTGAAC ACCTGGGCGC CCTGCTGGTT TCCGGCGGCC ACACCGAGAT CCTCCGGATC AGGAGCATCA CCGACGACGT CGAGCTGCTG GGCTCCACGA TTGACGACGC TGCCGGGGAA GCCTACGACA AAGTGGCACG GCTCCTGGGG CTCGGCTACC CGGGCGGCCC GGCCATCGAC AAACTAGCCC GGACAGGCAA CGCCAAGGCC ATCCGGTTCC CGCGCGGACT GACGCAGCCC AAGTACATGG GCACCGCGGA CGAACCCGGC CCGCACCGCT ACGACTGGTC CTTCAGCGGA TTGAAGACCG CCGTCGCCCG TTGCGTGGAG CAGTTCGAAG CCCGGGGCGA CGAAGTGCCG GTCGCGGACA TCGCGGCCGC CTTCCAGGAG GCCGTTGTGG ACGTCATCAC GTCCAAGGCG GTGCTCGCCT GCACGGAAAA CGGCATCACC GAGCTCCTGC TGGGCGGCGG GGTAGCCGCG AACTCGCGGC TGCGCCAGCT CACCGAACAG CGGTGCAGGG CGGCCGGAAT CCGGCTGACT GTTCCGCCGC TTGAGCTGTG CACAGACAAC GGTGCCATGG TGGCCGCCCT CGGTGCCCAG CTGGTCATGG CCGGCATCGA GCCCAGCGGC ATCAGCTTCG CCCCGGATTC GTCCATGCCG GTCACGACGG TTTCGGCGTA G
|
Protein sequence | MNRTQPLVLG IESSCDETGV GIVRGTALLS NTVSSSMEEH VRFGGVIPEI ASRAHLDAFV PTLQEALADA GVQLDDVDAI AVTSGPGLAG ALMVGVCAAK ALAVATGKPL YAINHLVAHV GVGLLQEENT LPEHLGALLV SGGHTEILRI RSITDDVELL GSTIDDAAGE AYDKVARLLG LGYPGGPAID KLARTGNAKA IRFPRGLTQP KYMGTADEPG PHRYDWSFSG LKTAVARCVE QFEARGDEVP VADIAAAFQE AVVDVITSKA VLACTENGIT ELLLGGGVAA NSRLRQLTEQ RCRAAGIRLT VPPLELCTDN GAMVAALGAQ LVMAGIEPSG ISFAPDSSMP VTTVSA
|
| |