Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3600 |
Symbol | |
ID | 7295081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 4002796 |
End bp | 4004073 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643592006 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002489645 |
Protein GI | 220914336 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGCCAC CTTTGCCGTC CGCCGGGACA CGACTTTCCC GGCGAACCTT CCTAGCCGCC GCAGGAGGCT CGCTGGCCGC TTTCAGCCTT GCGGCGTGCA GCCCGGCCGG AGCCCAGCCC ACCATCACGT TCCACCAGTC GAAGCCGGAA GCGGTTCCCT ACTTCCGCGA CCTCACAGCA AAATTCACGG CGTCGCAGGA CCGCTTCAGC GTCCTGCATG ACATGGCAAC GAACCTTTCC GCCAGCTTCG TCCGCAGCAG CCCGCCGGAC CTTGGCTGCC TCAACTACAA CCTGGAGATG GCGCGGTTCA TGGAGCGCGG CGCCCTCTCG GACCTGGCCG ACCTTCCGGA AGCGGCAGCC ATCCGCGGCG ACGTCCTCGA CCTCACCAAC TGGTACCCCA CCTACCCGGG CCGCACCAGC GTCATCCCCT ACTCGGTCAT GGCGGCGTCG GTCATCTACA ACCGGCGTAT CTTCGAGGCG AACGGCCTCT CGGTCCCCAC CACCTGGGAC GAGCTCATTG AGGTCTGCGA ACGCCTCAAG GCTGTGGGGA TCACTCCGGT CTACGGCACG TTCCGGGATC CCTGGACCAT CGCGCAGGGC CTGTTCGACT ACACCGTGGG CGGGATGGTG GATGTGCGCG GCTTCTACCA GTCCATGCAC GAAGCGGGCG AGAAGGTGGG GCCGGATTCC GAGGCCTCCT TCCAGAAAAC ACTGCTGGAA CCCGTCCGGC GCATGGTCCA GCTGAAGAAA TACGTCAACC CCGATGCCGC CAGCCGCGGC TACGGGGACG GCAACACCGC CATGGCGCAG GGGCAGGCAG CCATGTACTT CCAGGGGCCG TGGGCCTTCG GCGAAATCGA AAAGGCCGGC ACCGACGTCG ACCTCGGCAC CTTCCCCCTG CCGATGACCG ACAATCCCGC CGACCTCAAG GTCCGCGTCA ACATCGACCT TTCACTCTGG GTCCCCGAGG TCTCGAACGG ACAGCAGGGG GCACGCGCCT TCATCCAGTA CCTGATGCAG CCGGAGATCC AGGACACCTA CAACGCCAAA TTCCTGGGCT TCGGAACGGT CAAGGATGCC CCGCCGGTCA CCGACCCCAG GATCGTGGAA ATGCAGAAGT ACTACGACGA GGGCCGCTTC TACATGGGCG CGTCACAGTT CATTCCCAAC ACGATTCCCG CTGCCAACTA CATCCAGTCG ATCATCGGCG GCGCCGATGC CGAGGGCACC CTGCGCCGGA TGGACGCCGA CTGGGCGCGC CTGGCGTTCC GCGCGTGA
|
Protein sequence | MLPPLPSAGT RLSRRTFLAA AGGSLAAFSL AACSPAGAQP TITFHQSKPE AVPYFRDLTA KFTASQDRFS VLHDMATNLS ASFVRSSPPD LGCLNYNLEM ARFMERGALS DLADLPEAAA IRGDVLDLTN WYPTYPGRTS VIPYSVMAAS VIYNRRIFEA NGLSVPTTWD ELIEVCERLK AVGITPVYGT FRDPWTIAQG LFDYTVGGMV DVRGFYQSMH EAGEKVGPDS EASFQKTLLE PVRRMVQLKK YVNPDAASRG YGDGNTAMAQ GQAAMYFQGP WAFGEIEKAG TDVDLGTFPL PMTDNPADLK VRVNIDLSLW VPEVSNGQQG ARAFIQYLMQ PEIQDTYNAK FLGFGTVKDA PPVTDPRIVE MQKYYDEGRF YMGASQFIPN TIPAANYIQS IIGGADAEGT LRRMDADWAR LAFRA
|
| |