Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1947 |
Symbol | |
ID | 4445531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2196989 |
End bp | 2198275 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639689757 |
Product | extracellular solute-binding protein |
Protein accession | YP_831429 |
Protein GI | 116670496 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.709657 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGTCAGA TTTCACGCAG GCAGGCCATC GCGATTCTCG GAGCCCTGGG CTTCGGAGCA ACTGCGGCAG CATGCGCCGG ACCCGGCGGC ACCACGGCGC CAGGCGGCGC CACCGGACCC GCCGCACCCG CCACCGGTGC CGTCACCGGC AAGGTGTCCT TCGCACATTG GCGCGGCGAG GACAAGGTGG TGTTCGACGA ACTCATCAAG CGCTTCGCGG CCTTGCACGA CGGCGTGGAC GTGGTCCAGG ACATCTCCAC CTCCAACGAC TACAACGCCC AGGGGCTGCA GAAGGTCCGC GGTGCAGCCA TCGGCGACGC GTTTGCCACC TTCCGCGGCG CCCAGTTCAA GAACTTCACC GAGGCCGGGA TCTATGCCGA ACTCAAGGGC AGTAAGGCCG CCGCCAACTA CCAGCCGGGC CTGCTGACCG CCGGGCAGTC CGGCGACAGC CAGCTGGGCC TGCCGTACCA GGTTGTCTTC CCCATGCCCA TGGCCAACGC CGACCTGTTC GATAAGGCCG GCGCGGAGAT CGCCCCCAAG GACTGGGACG GCTTCCTGGC CATGTGCGAG AAACTCGCAG CCTCCGGGGT CATTCCCATC TCCTGGCCGG GCGGCGACGT CGGAAACGGC GGCCAGCTGT TCAACTGCAT GATCGCCAAC AACGCCCCCG TGGACGACAT GTGCGCCCAG ATCGAGCAGG GCAAGCTCAA GTGCACCGAC GACTGGTTCC TCAAGATGCT CGGCCAATAC AAGGAACTGG TCCCCTACCT GCAGCCGAAC GCCACCGGCA CCGCCGTGGA GCCCGCGCAG AACCTGTTCT CCCAGGGGAA GGCTGCCATG CTGGCCACCG GCTCGTACCA CATCGCTGCC GTCCGCGGCC TGGGCGCGAC GTTCCCTATC GAGCTGGTGT TCCCCAACAC CTCAGACGGT GGCGGCAAGT TCGAAGGCGC CTACAACGCC ACGTTCATCC TGGGCGTCAA CTCAGCGAGC AAGAACCAGG CCGCCTCCGC GGCGTGGATC GACTTCCTCT CCGAGCCCGA GAACGCCGGC TACTACGCCA ACAAGACCGC CCAGCACGTG GCCGTGAGCA ACGTGGAGTA CACCAACCCG GACCTAAAGC GACTCAGCCC CTGGCTGGAC AAAAAGACGG CGCTGGCCGC CCGCTTCCAG TTCCAGAACC TAGATGTCCG CAACGCGGTG GAGGCCAGCG CCACGGCTGT CATCTCGGGC ACCAGCCCCG AGCAGGCCGC CGAAGCCGCC CAGAAGATTG TTGACGAACG GCTATGA
|
Protein sequence | MSQISRRQAI AILGALGFGA TAAACAGPGG TTAPGGATGP AAPATGAVTG KVSFAHWRGE DKVVFDELIK RFAALHDGVD VVQDISTSND YNAQGLQKVR GAAIGDAFAT FRGAQFKNFT EAGIYAELKG SKAAANYQPG LLTAGQSGDS QLGLPYQVVF PMPMANADLF DKAGAEIAPK DWDGFLAMCE KLAASGVIPI SWPGGDVGNG GQLFNCMIAN NAPVDDMCAQ IEQGKLKCTD DWFLKMLGQY KELVPYLQPN ATGTAVEPAQ NLFSQGKAAM LATGSYHIAA VRGLGATFPI ELVFPNTSDG GGKFEGAYNA TFILGVNSAS KNQAASAAWI DFLSEPENAG YYANKTAQHV AVSNVEYTNP DLKRLSPWLD KKTALAARFQ FQNLDVRNAV EASATAVISG TSPEQAAEAA QKIVDERL
|
| |