Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_2377 |
Symbol | |
ID | 7293850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 2667805 |
End bp | 2668875 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643590784 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002488431 |
Protein GI | 220913122 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.000000976861 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGACT GGGCCGGCAT CGTCTGGCTG GCGTTCCTGC TGCTCGGCAA CGCGTTCTTC GTGGCGGCCG AGTTCGCCAT CATGTCCGCC CGGCGCAGCC AGATCGAACC CTTGGCGGAG GCCGGGTCGA AGCGGGCCCA AACCACCCTG AAGGCGATGG AGAACGTCTC GCTGATGCTG GCCTGCGCGC AGCTTGGCAT CACGGTGTGC TCGCTGCTGA TCCTGCTGGT GGCCGAGCCT GCGATCCACC ACCTCCTGGC CGCGCCGCTG GAGCTGGTGG GCCTGCCCGT GGAAGTGGCC GATGTTGCCG CGTTCGCTGT GGCCCTGATG TTCGTCACGT TCCTGCACGT CACCTTTGGC GAGATGGTGC CCAAGAACAT CTCGGTATCG GTCGCTGACA AGGCGGCGAT GTTCCTGGCC CCGCCGCTGG TCTTCGTGGC ACGGCTGGTC CACCCCGTCA TCTCGGTGCT GAACTGGTCG GCGAACCATA TCCTCAAGCT GTTGCGGATC GAGCCCAAGG ACGAGGTCAA CTCGTCCTTC ACGCTCGAGG AGGTCCAGTC CATCGTGCAG GAATCCACCC GGCACGGACT GGTGGACGAC GACGCCGGCC TGATCACCGG CGCCCTTGAG TTCTCCGAAT ACACGGCTGG AGACATCATG GTTCCGCTGG ACAGCCTGGT CATGCTCAAG GCTGCGACTA CTCCGGTGGA GTTTGAAAAG GCTGTCAGCC GCACGGGTTT TTCCCGGTTC CCCATGCTGG ATGAGGACGA TCTCCTGTAT GGCTACCTGC ACGTCAAGGA TGTGCTGTCC ATCCCTCCGA CGGCGTACGA GCTGCCCATT GCGGAAAGCC GCGTCCGTTC CCTGGCCAAC CTGGCCCTGG GCGATGAAAT CGAAAAGGCC ATGTCCGTCA TGCAGCGGAC CGGCTCGCAC CTTGCCCGCG TCATCGGCAA GGACGGCAAT ACCCAGGGCA TCCTGTTCCT CGAGGATGTC ATTGAACAAC TCGTCGGCGA GATCCGGGAC GCTACCCAGG CCACCGGCAT CCGACGGCTG GGGCAACCCA ACGGGGGATA G
|
Protein sequence | MSDWAGIVWL AFLLLGNAFF VAAEFAIMSA RRSQIEPLAE AGSKRAQTTL KAMENVSLML ACAQLGITVC SLLILLVAEP AIHHLLAAPL ELVGLPVEVA DVAAFAVALM FVTFLHVTFG EMVPKNISVS VADKAAMFLA PPLVFVARLV HPVISVLNWS ANHILKLLRI EPKDEVNSSF TLEEVQSIVQ ESTRHGLVDD DAGLITGALE FSEYTAGDIM VPLDSLVMLK AATTPVEFEK AVSRTGFSRF PMLDEDDLLY GYLHVKDVLS IPPTAYELPI AESRVRSLAN LALGDEIEKA MSVMQRTGSH LARVIGKDGN TQGILFLEDV IEQLVGEIRD ATQATGIRRL GQPNGG
|
| |