Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0090 |
Symbol | |
ID | 4447458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 92050 |
End bp | 93375 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639687885 |
Product | extracellular solute-binding protein |
Protein accession | YP_829591 |
Protein GI | 116668658 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.214251 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAAC CCCAGTCCTC CCGGCGCTCC TTTCTTGCCC TTGCCGCCCT TGCACCTTTC GCCGCCATGG TGACCAGTGC CTGCGGGACA TCGGGGCCGG GCGCCTCCAG CGGCGGGGGT GCCAGCATGT GGTACCTCTC CGGTGAGCCG AACCAGACCA CCATGCAGAA GGCGGTGGAT GCGTTCGGTT CAGCGAACCC GGACAACAAG GTCACAGTGA CCTACTTCCA GAACGACGCA TACAAAACCA AGATCAAAAC GGCGATCGGC GCCGGCCAGG CGCCGACCAT CATCTATGGC TGGGGCGGCG GGACCCTGAA GACCTATGCC GAGGCTAAGC AGGTTGAAGA CCTGACCAGC TGGTTGGCGG AGAACCCGGA CCTGAAGGAC AAATTCTTCC CCGCATCCTT CGGCGCCGCC ACCGTGAACG GTAAGGTCTA TGCGCTGCCC AACCAGTACG TGGCCCCGAT CGTGCTGTTC TACAACAAGG AACTGTTCGA AAAGGCCGGC GCCCAGCCGC CAAAGACCTG GGACGACATC ATGTCCCTCG TCAAGACCTT CAACAACATG GGTGTGGCGC CCTTCTCCCT TGGTGGACAG TCCCGGTGGA CCTCCATGAT GTGGCTGGAG TACCTGCTCG ACCGCATCGG CGGAGCCGAA GTCTTCACAG CCATTTTCGA AGGCAAGCCC GATGCATGGA AAGATCCAGC CGTGATCGAG ACGGGCACCA AGATCCAGGA GCTCGTCTCG GCGGATGGCT TCATCAAGGG CTTCTCATCC ATCGCTGCTG ACTCCAATGC TGACCAGGCC CTTCTGTTCA CGGGCAAAGC AGCCATGATG CTCCACGGTT CCTGGACCTA CGGCGCTATG AAGAAAGGCG GCCAGAACTT CGTCCAGGAC GGCAAGCTCG GCTTCGTCCA ATTCCCGGTC GTTGCCGGCG GCAAGGGCGA TCCAAAGAAC GGCGTGGGAA ACCCCGCCCA GTACATGTCC ATTTCCTCAA AGGCCTCTGA AAAGGAAAAA GAGACGGCGA AGAAGTTCTT CAAGGACGGC ATCCTGACCG ACACGGTCAT AGACACCTAC ATCAATTCCG GGTCCGTTCC CATTGTCAAC GGCATCGAGG ACAAGCTCAA CACGTCTCCG GACAAGGACT TCCTGAACTT CGTCTACGAC CTGGCCAAGA ACGCACCGAA CTTCCAGCAG TCGTGGGACC AGGCACTCAG CCCGACCGCC GCCGAAGCCC TGCTGAACAA CATCGACCAG TTGTTCCTCA AGTCGATCAC GCCTCAGCAG TTCGCCGAGA ACATGAATGC CACCCTCGGA AAATGA
|
Protein sequence | MKQPQSSRRS FLALAALAPF AAMVTSACGT SGPGASSGGG ASMWYLSGEP NQTTMQKAVD AFGSANPDNK VTVTYFQNDA YKTKIKTAIG AGQAPTIIYG WGGGTLKTYA EAKQVEDLTS WLAENPDLKD KFFPASFGAA TVNGKVYALP NQYVAPIVLF YNKELFEKAG AQPPKTWDDI MSLVKTFNNM GVAPFSLGGQ SRWTSMMWLE YLLDRIGGAE VFTAIFEGKP DAWKDPAVIE TGTKIQELVS ADGFIKGFSS IAADSNADQA LLFTGKAAMM LHGSWTYGAM KKGGQNFVQD GKLGFVQFPV VAGGKGDPKN GVGNPAQYMS ISSKASEKEK ETAKKFFKDG ILTDTVIDTY INSGSVPIVN GIEDKLNTSP DKDFLNFVYD LAKNAPNFQQ SWDQALSPTA AEALLNNIDQ LFLKSITPQQ FAENMNATLG K
|
| |