Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_0063 |
Symbol | |
ID | 7291489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 64477 |
End bp | 65835 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643588462 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002486155 |
Protein GI | 220910846 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGAAT GGATCATGCT GGGCATCGGC CTGGTCCTGA CCGTCGGCAC CGGCTTCTTC GTCGCCTCCG AGTTCGCCCT GGTGAACCTG GACCGGCATG ACCTCGAAGC CCGCCAGGCC CGCGGCGAGA AACGCCTCGG CCCCACCATC AAAGCCCTCA AGATCACGTC CACCCACCTT TCGGGTGCAC AGCTGGGCAT CACCCTCACC ACGCTCCTCA CCGGTTATAC GTTCGAGCCC GCCATCAGCC GGATGCTGAA CGGCCCGCTG ACCTCGGCGG GACTGCCGGA AGCCGTGGTT CCCGCGGTCG GCTCAGTGGC CGGCATCTTC CTGGCCACCA TCTTCTCCAT GGTGATCGGC GAACTGGTCC CGAAGAACTT TGCCCTGGCC CTCCCGCTGG CCACCGCCAA GGTGGTGGTG CCGTTCCAGG CGCTCTTCAC CACGGTGTTC AAGCCCGTGA TCCTGCTGTT CAACAACACC GCCAACAAGA TCATCCGCGG CTTCGGCATC GAACCCAAGG AAGAGCTGTC CGGCGCACGG AGCGCGGAGG AGCTGAGCTC CCTGGTGCGC CGCTCCGCCG TCGAAGGCGT GCTTGACCTG GACCATGCCA CCCTGCTGCA CCGGACCCTC CGCTTTACCG AGCACAGCGC GGTGGACGTC ATGACGCCGC GGGTGCGGAT GACGGCTGTG GGTACCGCTG ACACTGCCGA GGAGATCGTC GGCCTGGCGA AGTCCACCGG CTACTCCCGC TTCCCGGTCA TCGGCAGGGA CCGGGACGAT GTGGTGGGGG TGCTGCACGT GAAGCAGGCC TTCGCCGTGG CTCTCTCCGA TCGCGCGCGC ATCACCGCCG AAGAGCTGAT GATCGAACCG CTTCGCGTAC CCGAATCGAT GGGCGTTGAC ACCCTCCTGA ACCTGCTCCG CAAGCAGGGG TTGCAGGTGG CCATCGTCTC CGACGAGCAC GGCGGCACCG CCGGGATCGT CACGCTGGAG GACCTGGTGG AAGAGATTGT GGGTGAACTG GAGGACGAGC ACGACCGTGC CCGCGTGGGC GTGGTCCGCA CCGGCCGCTC CATCACCTTC GACGCCGCGC TGCGCCCGGA CGAGCTCCTT GACCGCACCG GCATCCGCGT GCCGGACGGT GAGTACGACA CCGTCGCAGG TTTCGTGACG GACCAGCTGG ACCGGCTGCC CGAGCTTGGA GACGAGGTGG AGGTCGACGG CGGCACGCTC CGTGTGGAGC GCGCGATGGA GACCCGCGTG GAGCGGCTCC GCTTCACGCC TGCCGAATCC GGCGATGCAC CGCAAAGCCC GCACGACAGG ATCGTGGACA ACATCACCCG GGAGCTGACC CATGAGTGA
|
Protein sequence | MYEWIMLGIG LVLTVGTGFF VASEFALVNL DRHDLEARQA RGEKRLGPTI KALKITSTHL SGAQLGITLT TLLTGYTFEP AISRMLNGPL TSAGLPEAVV PAVGSVAGIF LATIFSMVIG ELVPKNFALA LPLATAKVVV PFQALFTTVF KPVILLFNNT ANKIIRGFGI EPKEELSGAR SAEELSSLVR RSAVEGVLDL DHATLLHRTL RFTEHSAVDV MTPRVRMTAV GTADTAEEIV GLAKSTGYSR FPVIGRDRDD VVGVLHVKQA FAVALSDRAR ITAEELMIEP LRVPESMGVD TLLNLLRKQG LQVAIVSDEH GGTAGIVTLE DLVEEIVGEL EDEHDRARVG VVRTGRSITF DAALRPDELL DRTGIRVPDG EYDTVAGFVT DQLDRLPELG DEVEVDGGTL RVERAMETRV ERLRFTPAES GDAPQSPHDR IVDNITRELT HE
|
| |