Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_0780 |
Symbol | |
ID | 7292212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 834130 |
End bp | 835665 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643589176 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002486864 |
Protein GI | 220911555 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.142319 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTAG GTCCCAAGGC GGCAGCGGCC GCCCTCATGC TGAGTGCCTC ACTGGCCCTC ACTGCCTGCG GCGGCGGCGG CAACGCCGGC GCGGGGGCGA ATGCCGGCGG CACCACGGTC ACGGCACTCA CGCTGGGAAC CCTGCGTGAC TTGACCTCGT GGGATCCGGC CCAGGCCCAC GTGGGCCACG CGCTGCAGCC GTATCAGGCG GCCTACGATT CGCTGATTCT TCGCGAGCCT GACGGCAAGC TCAGCCCCAT GCTTGCCACG GCATGGAAGT ACAACGACGC CCGCACCAAG CTGACCGTGG ACCTCCGGAC GGACGTCACC TTCAGTGACG GGGCAAAGTT CGACGCCGAA GCCGCCAAGG CCAACCTGGA CCACTTCAAG AAGGCCAACG GTCCGCAGAT GGCGCAGCTC AGCACCGTTT CCGACGTCGC CGTGGTGGAC GCGGACACCG TCGACATCAA CTTGAGCGCA CCGGAACCGG CGCTGGAGTT CTTCCTCAGC CAGGCGGCCG GCCTGATGGG CAGCCCCAAG GCGCTGGGCA CAGACGCCAT CAAGACGGAG CCTGTCGGCT CAGGTCCGTA CGTCATGGAC AAGGCGGCTT CGGTCAAGGA TTCCCAGACC GTCTTCAATG CCCGCGAGGG GTACTGGAAC AAGGACCTGC AGAAGTACAA GAAGCTGACG CTCAAGATCC TCCTGGACCC CACCGCACGC ACGAACGCGC TGGTTTCCGG CCAGATCGAC GCCACCCTCC TGGATCCCAA GAACGGCAAG CAGGCCGAGG GCGCCAAGAT GAAGCTGGAG GCCAACCAGG TGGACTGGGC CGGCCTGCTC CTGCTGGACC GCGACGGTGC GAAGAACCCG GCGCTGGCCG ACGTGAAGGT CCGCCAGGCC ATCAACTACG CGTTCGACCG CAAGACCATC CTGGACCAGG TGATGCTGGG CCAGGGGACG CCGACGTCGC AGCCGTTCGG CAAGGACAGC GGGGCATGGT CCGAAGAGCT GGAGAACTAC TACAGCTACG ATCCGGAGAA GGCCAAGCAG CTGCTGAAGG AGTCGGGCTT CGAAGGCAAG GTCAGCATCG ACGTTCCCAC GCTTCCCGGC GCCGAAACCC TGATCTCCGT GCTCAAGCAG CAGCTCGCGG ACGTCGGCAT CACCCTGAAC CCCGGTGCCG CCATCACCAA CACCTTCACG GCGGACGTCG CGGCCCAAAA GTACAGCGCC ATGTACTTCA ACCTCTTCCA GGGCGAGCCC ACGGTGGCCA TCGACCAGAT CGTTTCCACC AAGGCCCTGT ACAACCCGTT CAAGACGACG ACTCCTGAAC TGGAAGACAA GATCAAGGCT GTTCGCACCG CTGGGGATGC CGCCGGCGAA GAAGCCAAGG AAGTCAATAA GTACGTAGTG GAGCAGGCCT GGTTCGCGCC GCTGTTCCGT GTGAACCAGA TGTACTACCA CAACGACAAA GTCAACGTGG TACCGCAGGC GCAGCAGGCC GTCCCCTCCA TCTACAACTA CTCGCCTGCC AAGTAG
|
Protein sequence | MKLGPKAAAA ALMLSASLAL TACGGGGNAG AGANAGGTTV TALTLGTLRD LTSWDPAQAH VGHALQPYQA AYDSLILREP DGKLSPMLAT AWKYNDARTK LTVDLRTDVT FSDGAKFDAE AAKANLDHFK KANGPQMAQL STVSDVAVVD ADTVDINLSA PEPALEFFLS QAAGLMGSPK ALGTDAIKTE PVGSGPYVMD KAASVKDSQT VFNAREGYWN KDLQKYKKLT LKILLDPTAR TNALVSGQID ATLLDPKNGK QAEGAKMKLE ANQVDWAGLL LLDRDGAKNP ALADVKVRQA INYAFDRKTI LDQVMLGQGT PTSQPFGKDS GAWSEELENY YSYDPEKAKQ LLKESGFEGK VSIDVPTLPG AETLISVLKQ QLADVGITLN PGAAITNTFT ADVAAQKYSA MYFNLFQGEP TVAIDQIVST KALYNPFKTT TPELEDKIKA VRTAGDAAGE EAKEVNKYVV EQAWFAPLFR VNQMYYHNDK VNVVPQAQQA VPSIYNYSPA K
|
| |