Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3387 |
Symbol | |
ID | 4444116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3808714 |
End bp | 3810057 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639691210 |
Product | extracellular solute-binding protein |
Protein accession | YP_832862 |
Protein GI | 116671929 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.278202 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAT CCCTCGGCAC CGCCGCCGTC GCCGCAGCCA TCGCGCTCTC CCTCTCCGCC TGTGGGGGCG GCTCAGGCTC CTCTGCAGAA TCGGCCAAGG GCGAGCTCAG CTACTGGCTC TGGGACGCCA ACCAGCTTCC CGCCTACCAG CAGTGCGCTG ATGACTTCCA GAAGGCCAAC CCGGACATCA AGGTCAAGAT CACCCAGCGC GGCTGGGACG ATTACTGGAG CACGCTCACG AACGGGTTCG TTGGCGGCAC GGCTCCCGAC GTCTTCACCA ACCACCTGGG CCGCTACGGC GAGCTCGCCG CGAACAAGCA GCTGCTGCCC ATTGACGACG CCGTCAAGAA GGACAACGTG GACCTGTCCG CCTACAACGA GGGACTCGCG GACCTCTGGG TGGGCCAGGA CGGCAAGCGC TACGGCCTGC CCAAGGACTG GGACACCATC GGGTTGTTCT ACAACAAGGC CATGCTTTCC AAGGCCGGCG TCTCCGAAGA AGAGATGAAG AACCTCACCT GGAACCCGCA GGACGGCGGA ACGTACGAGA AGATCATTGC CCACCTGACC GTTGACAAGA ACGGCAAGCG CGGGGACGAA GCGGGCTTCG ACAAGAACAA TGTGGATGTC TACGGCCTTG GACTCAACGG CGGCGGCGAC TCCTCAGGCC AGACTGAGTG GAGCTACCTC ACCAACACCA CCGGCTGGTC ACACACGGAC AAGAACCCGT GGGGAACTCA CTACAACTAT GACGACCCCA AATTCCAGTC CTCTATCGAC TGGTTCGCAG GGCTGGTTGA CAAGGGCTAC ATGCCCAAGC TTGAAACCAC TGTTGGCGCA GCCATGGCCG ACACCTTCGC CGCGGGCAAG TCTGCCATCA ACGCCCACGG CTCATGGATG ATCGGCCAGT ACACCGGGTA CAAGGGTGTT GAGGTGGGCA TCGCTCCCAC CCCCGTGGGT CCTGAAGGCA AGCGGGCGTC GATGTTCAAC GGCCTGGCCG ACTCGATCTG GGCCGGCACC AAGAAGAAGG ACGCCGCCAT CAAGTGGGTG GAGTACCTTG CCTCAGCACC TTGCCAGGAC GTCGTCGCAT CCAAGGCTGT GGTGTTCCCG GCCCTGAAAG CCTCTTCCGA AAAAGCGGCG GAAGCATTCA AGGCCAAGGG TGTGGATGTC ACCGCCTTCA CCGAGCACGT CAAGAACGGA ACCACATTCC TGTACCCCAT CACTGACAAT ACTGCCAAGG TCAAGGGCAT CATGGAGCCT GCCATGGACG CTGTAGTATC CGGCAAAAAG CCTGCCAGCT CCTTGACCGA AGCCAACAAC CAGGTGAACG ATCTCTTCAA GTAG
|
Protein sequence | MKKSLGTAAV AAAIALSLSA CGGGSGSSAE SAKGELSYWL WDANQLPAYQ QCADDFQKAN PDIKVKITQR GWDDYWSTLT NGFVGGTAPD VFTNHLGRYG ELAANKQLLP IDDAVKKDNV DLSAYNEGLA DLWVGQDGKR YGLPKDWDTI GLFYNKAMLS KAGVSEEEMK NLTWNPQDGG TYEKIIAHLT VDKNGKRGDE AGFDKNNVDV YGLGLNGGGD SSGQTEWSYL TNTTGWSHTD KNPWGTHYNY DDPKFQSSID WFAGLVDKGY MPKLETTVGA AMADTFAAGK SAINAHGSWM IGQYTGYKGV EVGIAPTPVG PEGKRASMFN GLADSIWAGT KKKDAAIKWV EYLASAPCQD VVASKAVVFP ALKASSEKAA EAFKAKGVDV TAFTEHVKNG TTFLYPITDN TAKVKGIMEP AMDAVVSGKK PASSLTEANN QVNDLFK
|
| |