Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3143 |
Symbol | |
ID | 4444256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3527159 |
End bp | 3528580 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639690969 |
Product | extracellular solute-binding protein |
Protein accession | YP_832621 |
Protein GI | 116671688 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.536467 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGGAG CCACAAACGG GCACCTGCGG GAAATCACCC GTCGTACCGC CCTTGGTGCC CTGGGCGCGG GAATCATCGG AGCAACCGTG GCGTCCTGGC CGCGGCTCTC CGGATCGGAC ATTCCGGGCC GAGGGGACAA CAGCCTCAGT ATCGCCATCA TGGGCACCGC CGCGGACGCC GCCGCCCGCC AGCGCGCCAT CGACGCCTTC ACCCGCCTCC ACCCGGAGAT CAGGGTCAAA GTCCAGGCCA TCCAGGCCGT CGACTGGAAG GACTTCTTCA CCAAGATCCT CACCATGGTG GCCGCGGGCA CCCCGCCGGA TGTGGTCTAC GTGGCCACTG AAGGCGCCCA GCTGTTCGCT GAAAAGCTTG CCCACCCGCT GGACGAGTAC GTGCGCCGCG ACGCCGCGGA CATGGCCGAG TTCTTCGACG ACGTCCACCC CAGCCTGGTG GAGGCCTTCA TGTACAAGGG CAGCCTGTAT CAGCTCCCGA TGGACTGGAA CGCCGCCAAC ATGTACTACA ACACCACCGC GTTCGCGCAG GCAGGATTGG AGCGCCCGGC GGATGACTGG ACCCACATGG ACTTCCGCAA CAGCCTCGCC GCCATGCGGA AAGCCCGGAC CTCGGACTTC ACGCCCTACT ACTGGACCAA CCGGCTCTTC GGCGGAGTGG TGCCGTGGCT CTACGCGAAC GACACCAGCT TCCTGAAGGA GACCAGGTCC GCCGGTGGAG AGTGGCTTTG GGACGGCTTC TACGCCAACG ATCCCTCCCG CGGCCTCCGC TCCGGCGGCT ACCAGTGGCT GGAACCCAAC GCCAATGACG ACCGCGTGTT CGAGTCCTTC GACTACCTCC GCGGACTGGT CAAGGACGGG CTGGGCGTCC GCCCCGAGGA AGGCGGCGGC AGCTCACTGG TGGGACTGTT CGCATCCAAC CGCATCGGGA CCACCCCCGC CGGCGGCTAC TGGGTGCAGG GCCTGCACGA AGCCGGGATG GGCGAAAGCG ATTTCGACGT GCAGTTCTTC CCGCGCTGGA AGAGCCAGCG CCACCAGTTC GGCACCGCGG GCTACGCGAT CATGAAGACC GCGAAGGACA AGGACGCCGC CTGGGAATGG ATCAAGTTCA GTTCCAGCCG CGAGGCCATG GAACTGATTT TCCCCAACCC GATTACGACG CCGGCGCGCC GCTCCATGGT GAACGAGCAG CTTTACGCGG GCAAGGGGCC CGCCCATTGG AAGGTCTTCT ACGACACCCT GGACCGTTTC CCCACCACCG GCCCCATTCC GGCACCACCC CAGCAGGCGG CCGTCGAAAC GGCCCTGATG AAGAACGTAT CGCTCGCAGT CAGCGGCGAC GAGCGCCAGC TCAAACAGGC CCTCGCCTCC ATGCAGCGCG ACCTTGAACT GGCCCTGAGG AGGCAGTCAT GA
|
Protein sequence | MTGATNGHLR EITRRTALGA LGAGIIGATV ASWPRLSGSD IPGRGDNSLS IAIMGTAADA AARQRAIDAF TRLHPEIRVK VQAIQAVDWK DFFTKILTMV AAGTPPDVVY VATEGAQLFA EKLAHPLDEY VRRDAADMAE FFDDVHPSLV EAFMYKGSLY QLPMDWNAAN MYYNTTAFAQ AGLERPADDW THMDFRNSLA AMRKARTSDF TPYYWTNRLF GGVVPWLYAN DTSFLKETRS AGGEWLWDGF YANDPSRGLR SGGYQWLEPN ANDDRVFESF DYLRGLVKDG LGVRPEEGGG SSLVGLFASN RIGTTPAGGY WVQGLHEAGM GESDFDVQFF PRWKSQRHQF GTAGYAIMKT AKDKDAAWEW IKFSSSREAM ELIFPNPITT PARRSMVNEQ LYAGKGPAHW KVFYDTLDRF PTTGPIPAPP QQAAVETALM KNVSLAVSGD ERQLKQALAS MQRDLELALR RQS
|
| |