Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_2378 |
Symbol | |
ID | 7293851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 2668872 |
End bp | 2670203 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643590785 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002488432 |
Protein GI | 220913123 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.000000393714 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGTGGC TACTCCTGGC GGCAGGGCTC CTGCTGATCG CCGGCACAGG CTTTTTCGTC GCCGTCGAAT TCTCCCTGAT TGCCCTCGAC CAGCCCACGG TCCAGCGGGC TGTCGACGCC GGTGACGCCG GCGCCGTTCC CCTCCTTACC TGCCTCAAAT CGCTCTCCAC ACAGCTCTCC AGCTGCCAGT TGGGCATCAC CCTCACCACC CTGCTCACCG GCTACGTCAT GGAACCATCG GTGGGCCGCC TCCTGGAGGG GCCGCTGACG GCCCTTGGCC TGCCGGAGGT AGCTGCAGCA TCAATTTCGC TGATCCTCGC CATGGTGCTG GCAACCCTGT TGTCGATGCT CCTTGGCGAA CTCGTTCCCA AGAACATGGC CATCGCGTTG TCCTTCCCCG TCGGCAAAGC CCTGGCCAGG CCGCAACTGA TCTTCACCGC GGTCTTCAGG CCGGCCATCG TGGTCCTCAA CGGCTTTTCC AACCGGGTGC TCCACATCTT CGGGCTCGAA GCCAAGGAAG AGCTCTCCGG CGCGCGCACG CCGTCCGAGC TGGCGTCACT GGTGCGCCGC TCGGCTGCGA TGGGAACGCT CGACGCCGGT ACGGCCAACT TCGTGTCCCG CACCTTGAAT TTTTCCTCCA GGACCGCGGC CGACGTCATG ACGCCGCGCA TCCGGGTGGA AATGATCGAC GCGGACCAGC CGGTCTCCGA CATCGTTGAC GCGGCGCGCC GTACGGGATA CTCACGGTTC CCCGTGATCG GCGACTCTGC GGACGACATC AAAGGCCTGG TCCACGTCAA GAAGGCCGTG GCCGTGCCCT CGGACAGGCG GCACAAGCTG GAAGCCGGTG CCATCATGAC CGAAGTCCTC AGGGTTCCCG AGACCATCCA CCTTGACGCC CTGCTGGCGG AACTCCGCGA AGGCAACCTC CAGCTGGCGG TGGTCCTCGA CGAATACGGC GGCACCGCCG GCATTGCCAC GCTCGAAGAC CTGGTCGAGG AAATTGTGGG CGAGGTAGCC GACGAACACG ACAAGGTGCG CCCGGGGCTG CTGCAGAGCG CCTCCGGGGA CTGGTATTTC CCGGGACTCC TTCGCCCGGA CGAGTTGTCC GAGCAGATCC CGGGCCTGAC CGTCCCGGAC GAAGCAGCCT ACGAAACCGT GGGAGGCTAC GTGATGAGCA AACTGGGCAG GATCGCGGCG GTAGGGGACA CGGTGGCCGT GGACGGCGGC ACGCTGAGCG TTACCCGGAT GGACGGGCGC CGCATCGACC GTATCTGCTT CCGGCCGGCT GCCCCTGAGC CGGACGGCAA CAACGACGGG AGCCCATCAT GA
|
Protein sequence | MEWLLLAAGL LLIAGTGFFV AVEFSLIALD QPTVQRAVDA GDAGAVPLLT CLKSLSTQLS SCQLGITLTT LLTGYVMEPS VGRLLEGPLT ALGLPEVAAA SISLILAMVL ATLLSMLLGE LVPKNMAIAL SFPVGKALAR PQLIFTAVFR PAIVVLNGFS NRVLHIFGLE AKEELSGART PSELASLVRR SAAMGTLDAG TANFVSRTLN FSSRTAADVM TPRIRVEMID ADQPVSDIVD AARRTGYSRF PVIGDSADDI KGLVHVKKAV AVPSDRRHKL EAGAIMTEVL RVPETIHLDA LLAELREGNL QLAVVLDEYG GTAGIATLED LVEEIVGEVA DEHDKVRPGL LQSASGDWYF PGLLRPDELS EQIPGLTVPD EAAYETVGGY VMSKLGRIAA VGDTVAVDGG TLSVTRMDGR RIDRICFRPA APEPDGNNDG SPS
|
| |