Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3146 |
Symbol | |
ID | 4444259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3530278 |
End bp | 3531552 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639690972 |
Product | extracellular solute-binding protein |
Protein accession | YP_832624 |
Protein GI | 116671691 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0629231 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCGTC CGAACGCAAC GAAGGCAGCC GCCATAGGCC TCGCCGCAGC GCTGCTGATG ACCGGCTGCG GCAGGGACAC CGCGGGTTCG TCCCCAGCGT CATCGGCCAA GCCCATTGCC TCAGGCCAGG CATCCGGCAC CATTACCCTG TGGGCCCAGG GCAGCGAAGG CGAAGCCCTT CCGGCACTGC TCAAGGAGTT CGAGGCCGAG AATCCCGGCG TCAAGGTCAA CGTCACAGCC ATCCCCTGGG ACGCGGCCCT CAGTAAGTAC CAGACCGCCA TCGCCGGCGG GACGACGCCG GACGTCGCCC AGATGGGCAC CACCTGGATG GGCGATTTCG CCAACTCGTT CGATGCCACG CCCAAAGAGA TCGACGCAAG CGACTTCTTC CCCGGCTCGG TGAAGTCAAC CGAAGTCGAA GGAACCACCT ACGGGGTGCC GTGGTACGTC GACACCCGCG TGGTCTACTA CCGCAGCGAC CTCGCGGAGA AGGCCGGCAT CACCAAGGCG CCCGAAACCT GGGATGACTT CAAGGCCCTT GCCAAGGGCC TTCAAGAGAA GGCCGGGGCA AAATACGGGG TTCAACTGCC TGCCGGGGTC GCCGGCTCCT ACCTCGACAC CCTCCCGTTC CAGTGGTCCA ACGGAGCGAA GTTGATGAAC GACGACGGCA CCAAGTGGAC CCTCGACACT CCGGAAGCGG CAGAGGCCCT GAAGTATTAC TCCAGCTTCT TCGCTGATGG GCTCGCGTCC AAGGCTGTCT CCACGGGAAC CACTGCCGAG GCGTCCTTCG TGGACGGTTC CGCCCCCATG ATGATCAGCG GTCCCTGGCA CGTCGGCCTG CTCAACAAGG CCGGCGGGGC AGGATTCGAG GACAAGTACA AGGTTGCCCC GATGCCCAAG GCGAAGACCT CAACGTCCTT CGTCGGCGGC TCCAACATGG TGGTGTTCAA GAAGTCAGAG AACCGCGATT CTTCCTGGAA GCTCCTGCAG TGGCTGTCCA AGCCCGAGGT CCAGCTCAAG TGGTACAAGG CCACCGGCGA CCTCCCTTCG CAGCAGGGTG CCTGGAAGGA CCAGTCCCTG GCAGGAGACA GCAAGCTCTC GGTCTTCGGC GACCAGCTCA AGACCACCAA CAACCCGCCG GCCGTTTCCA CCTGGACCCA GGTTGCCGCC GCCGCCGACA GCGAAATCGA ACAGATCGTC AAGGCCGGCA AGGACCCCGC GGAGGCACTG AAGTCCCTGC AGCAGGCCGC AGATTCGATC GGCACCGGGA AGTAA
|
Protein sequence | MIRPNATKAA AIGLAAALLM TGCGRDTAGS SPASSAKPIA SGQASGTITL WAQGSEGEAL PALLKEFEAE NPGVKVNVTA IPWDAALSKY QTAIAGGTTP DVAQMGTTWM GDFANSFDAT PKEIDASDFF PGSVKSTEVE GTTYGVPWYV DTRVVYYRSD LAEKAGITKA PETWDDFKAL AKGLQEKAGA KYGVQLPAGV AGSYLDTLPF QWSNGAKLMN DDGTKWTLDT PEAAEALKYY SSFFADGLAS KAVSTGTTAE ASFVDGSAPM MISGPWHVGL LNKAGGAGFE DKYKVAPMPK AKTSTSFVGG SNMVVFKKSE NRDSSWKLLQ WLSKPEVQLK WYKATGDLPS QQGAWKDQSL AGDSKLSVFG DQLKTTNNPP AVSTWTQVAA AADSEIEQIV KAGKDPAEAL KSLQQAADSI GTGK
|
| |