Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0780 |
Symbol | |
ID | 4446702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 844374 |
End bp | 845684 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639688586 |
Product | extracellular solute-binding protein |
Protein accession | YP_830278 |
Protein GI | 116669345 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCT CGTCCGGAAG CAGGTTTCGA CTCAGGATTG GAGCCACCGC CGCTGCCCTC GCGATGCTGG TCACAGGCTG CGGCGCGTCA ACAGGTTCCG GCGGCGACAA GGAAGTCACA CTGCGCTTTG CCTGGTGGGG CAACGAGTAC CTCAACGCCC AGACCCAAAA GGTGATCGAC GCGTTCGAGG CCGAACACCC CAACATCAAG ATCAAAGCGG CCCCTGGGGA ATGGAGCGGA TACTGGGACA AGCTGGCCAC CACCACCGCC GCCAATGATG CACCGGACGT GATCCAGATG GATCAGAAAT ACATTGCCGA ATACGGTGGC CGCGGAGCCC TCCTTGACCT GGCCAAGCAA GACGGCATCG ACCTGTCGAA AATGGACAAG GAACAGCTCG CCTCCGGCCA GTACGACAAT GCCCAGTACG GACTCAGCAC AGGCAAGAAC GCCTACGTGG TCATGGCGAA CACCAAGGTA TTCGAGGCGG CCAACGTTCC GCTTCCCGAC GACGCCACCT GGACCTGGGA TGACTTCAAC GAAATCGCCA CGAAGCTCAC CAAGGCGGGA GGGGGTACAA ACTTCGGGGC TGCATACGGC AGCAACGAAG CCGACCTCAT CATCTGGCTT CGTCAGCACG GCGAAAACCT GTACTCACCG GACGGCAAGC TGGCCTTCAA CGCCGGGACT GCGGCTTCGT TTTGGGAACG CCTGAAAAAG CAACGGGATT CCCAGGCAAG CCCGCCGGCC ACCGTCGCTA CGGAGGACGC AGGCGCCGGT TTGGAAGAAA GCCTGTTCGG GACCAATAGG ATCGGCATGG CGTGGTGGTG GACCAATCAG CTCGGATCCC TGGAGACCAC CACGGGAAGC AGCATCAAGA TGCTCCGCGC ACCGAGCACG GACGGAAAAG CGGCTGACAA CGGCATGTAC TACAAGCCCT CAATGTTCTG GTCCGCCTCT TCGAGGTCCA AGAACCCGGA GGCAGCGGCA ACATTCATCA ACTACCTGGC AAACAGCCCG GAGGCTGGAT CGATCCTCAT GACCGACCGC GGGGTCCCTG CCAACTCTGA AGTTCTGGCG GCCATTACGC CGAAGCTGAA GACGGCGGAC TCGACGGTGG TCGGCTTCCT CCAGGACATC AAACCTGAGA TGGCTGAGGC GCCCCCTGTA CCGCCGGTCG GGTCGGGCAG CGTGCAGAAT GTCATCAAGC GGTACACCGA CGAAGTCCTC TATGACCGTA AGTCACCTTC AGCGGCCGCT GAGGAGTTCA AGAAGGAAGT CGAGGGGATG CTGGCCTCAG CCCGCAAATG A
|
Protein sequence | MKISSGSRFR LRIGATAAAL AMLVTGCGAS TGSGGDKEVT LRFAWWGNEY LNAQTQKVID AFEAEHPNIK IKAAPGEWSG YWDKLATTTA ANDAPDVIQM DQKYIAEYGG RGALLDLAKQ DGIDLSKMDK EQLASGQYDN AQYGLSTGKN AYVVMANTKV FEAANVPLPD DATWTWDDFN EIATKLTKAG GGTNFGAAYG SNEADLIIWL RQHGENLYSP DGKLAFNAGT AASFWERLKK QRDSQASPPA TVATEDAGAG LEESLFGTNR IGMAWWWTNQ LGSLETTTGS SIKMLRAPST DGKAADNGMY YKPSMFWSAS SRSKNPEAAA TFINYLANSP EAGSILMTDR GVPANSEVLA AITPKLKTAD STVVGFLQDI KPEMAEAPPV PPVGSGSVQN VIKRYTDEVL YDRKSPSAAA EEFKKEVEGM LASARK
|
| |