Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2378 |
Symbol | |
ID | 4444992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2665099 |
End bp | 2666454 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639690186 |
Product | extracellular solute-binding protein |
Protein accession | YP_831857 |
Protein GI | 116670924 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.612639 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACCGCC CTGCCAAAAT ACTTGCGGCG CTGATGTCGG CGGCAGCGCT GCTGGCCACA TCAGCCTGCT CCGCGGAGAA ACCGGCCGCC GAAGACCGGA CCCTGAAGAT CGTCTACCAG AAGACTGACT CGTTTTCTGC CCTGGACACC TTGTTCAAGG ACGCCAAGAA GGACTTCGAA GCTGCCAACC AGGGCACCAA AGTGGAGCTG CAGCCCATCG AGGCCAACGA TGACGACTAC GGCACCAAGC TGGCGCTGGC CCTCCGGTCG TCCGAGACCG CCCCGGACGT CTTTTACGAG GACACCTTCA AAGTGAGGTC CGATGTCGAC GCCGGATACC TCCTGAAACT GGACGGGTAT CTCGAAAAAT GGGACGACTG GAAGTTGTAC AACGAGGCCG CCAAAGCTGC CGGCACAGGA GACGACGGCG GGATCTACGC TGTGCCCCTG GGAACCGACA CCCGCGCCAT CTGGTACAAC AAGAAAGTCC TGCAAAAAGC AGGCATATCG GTTCCCTGGC AGCCCCGGAG CTGGGACGAA ATACTCGAAG CCGCCCGCAA GATGAAAGCG GCGGACCCGT CCCTGGTCCC CTTCAACATG TATGCCGGCA AGGCCACCGG TGAAGGAACC GTCATGCAGA GTTTCTACGA ACTGCTGTAC GGCACGGACA GCGAACTCTA TGACCAGCAG GAGAAAAAAT GGGTGATCGG TTCCCGGGGG TTCACCGATT CCCTGGCTTT CCTGAAGACA CTCTACGACG AAGGACTTGC CGTCACGCCT GCCGAGGCGC TTGACGCCAA CGTCTGGAAG AAGGTCTTCG GCGAATGGCT GCCCAAGGGC AAGATGGGCG CCACCGTGGA AGGTTCCTAC ACGCCGTCGT TCTGGCAGAA GGGCGGCAAC TACGAATGGG CAGGCTATGC CGAGGACATG GGGGTGGCGA AATTCCCCAC CCAGCGTGGC CAGGAACCCG GCGGCGTCAG CATGTCCGGT GGCTGGACGC TGGCCGTCGG AGCCGACTCC AAAAACCCGG ACCTGGCGTT CAAGTTCCTT TCCGAGGCCG TGAGCAAGAA GAACTCGCTG GCGTTCACCG TGTCCGGATC CCAGATCGCG GTCCGGACGG ACGTCGCCGC CGAAGCCGAG TACCTGGCGG CAAATCCGTT CGTCAAAGAC GTCTCCGAAC TCGTATCCGT CACCCACTAC CGGCCCGCCA CGGCGGACTA TCCGCGGATC TCCGCCGCGG TCCAGGAGGC AACCGAAGCC GTGATCACCG GTGCCCTCTC ACCGCAGGAG GCAGCCGCGC AGTACGACAA GACAGTCAGG GACCAGGTGG GTGACGCCAA GGTCCTGCAG AAATAG
|
Protein sequence | MHRPAKILAA LMSAAALLAT SACSAEKPAA EDRTLKIVYQ KTDSFSALDT LFKDAKKDFE AANQGTKVEL QPIEANDDDY GTKLALALRS SETAPDVFYE DTFKVRSDVD AGYLLKLDGY LEKWDDWKLY NEAAKAAGTG DDGGIYAVPL GTDTRAIWYN KKVLQKAGIS VPWQPRSWDE ILEAARKMKA ADPSLVPFNM YAGKATGEGT VMQSFYELLY GTDSELYDQQ EKKWVIGSRG FTDSLAFLKT LYDEGLAVTP AEALDANVWK KVFGEWLPKG KMGATVEGSY TPSFWQKGGN YEWAGYAEDM GVAKFPTQRG QEPGGVSMSG GWTLAVGADS KNPDLAFKFL SEAVSKKNSL AFTVSGSQIA VRTDVAAEAE YLAANPFVKD VSELVSVTHY RPATADYPRI SAAVQEATEA VITGALSPQE AAAQYDKTVR DQVGDAKVLQ K
|
| |