Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1916 |
Symbol | |
ID | 4445570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2156584 |
End bp | 2157744 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639689726 |
Product | ABC sugar transporter, periplasmic ligand binding protein |
Protein accession | YP_831398 |
Protein GI | 116670465 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0176531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGATCC GCAGGCTTCC CATTGTGGCT TTGGCCGCAG CCCTGTCCCT TTCCGTCGCT TCCTGCAGCA GTTCGGCAAC GGGTACCTCA AACGCTCCCG ATGCAGGGGT GTCCGAGAAG GCCCAGCAGG CCCTTGACAA GATCAAGGGA CAGGTCCTGA GCAAGGGCCC CAACGGCGAG ACGCCCTCCC CGTCATCTGT TGCGGATCTG ACTCCCGCCG AGATCGAGAA GGTCAAGGCA CTCAACGCCA AGGCCGCAAT CGTGATGCAC TACGGCGGCA ACGACTGGGC GACCGCCCAA ACTAACGGGC TGAAGAGCGA ATTCGAAAAG CTCGGTATCA AGGTAATCGC CACAACCGAT GCGAACTTCA AGCCGGACAA GCAGGTTTCG GACATCGAGA CTGTCATGAC GCAGGACCCG GATGTCATCG TCTCCATCCC CACGGACCCC GTGGCAACGG CCTCCGCCTA CAAGAAAGTC GCGGCCGCCG GAACCAAGCT CGTCTTCATG GACAACATTC CCCAAGGGCT GACTGCCGGC AAGGACTACG TCTCTGTCGT CTCCGCCGAC AATTACGGCA ACGGCGTCGT TTCCGCCCAC CAAATGGCCA AGGCCATCGG TGGCAAGGGA AAGATCGGAC TCGTCTTCCA CCAGGCCGAC TTCTTCGTGA CCAAGCAGCG CTATCAGGGA TTCAAGGAAA CGATCACCAA GGAGTACCCC GACATCAAGA TCGTCGAGGA AAAGGGCATC GCCGGGCCGG ACTTCGCAGG AGACGCCCAG GCCGCTGCGA ACGCCATGCT GAGCAAATAC GCCGACCTGT CGGGCATCTG GGCGGTCTGG GACGTCCCCG CAGAAGGTGT CATGGCAGCA GCCCGTGCTG CGGGCCGACA GGATTTGAAG ATCGCCACTG AGGACCTTGG CAAGAACGTC GCCATCGCCT TGGCCAAAGA CCAACTCATC GTCGGCCTCG GGGCCCAGGT TCCGTTCGAC CAGGGCGTTA CGGAGGCCCG GCTGGCCGCA GGAGCGCTCA TCGGGAAAGA AGCGCCGGCA TATGTGGCGC TGAGTGCACT CCCCGTTGAC CATTCCAACG TCCTCGAAGC CTGGAAGCAG GTTTACCACG AGGACGCTCC CAAGGACATT CAAGAGTCCT ACAAAAAGTA G
|
Protein sequence | MMIRRLPIVA LAAALSLSVA SCSSSATGTS NAPDAGVSEK AQQALDKIKG QVLSKGPNGE TPSPSSVADL TPAEIEKVKA LNAKAAIVMH YGGNDWATAQ TNGLKSEFEK LGIKVIATTD ANFKPDKQVS DIETVMTQDP DVIVSIPTDP VATASAYKKV AAAGTKLVFM DNIPQGLTAG KDYVSVVSAD NYGNGVVSAH QMAKAIGGKG KIGLVFHQAD FFVTKQRYQG FKETITKEYP DIKIVEEKGI AGPDFAGDAQ AAANAMLSKY ADLSGIWAVW DVPAEGVMAA ARAAGRQDLK IATEDLGKNV AIALAKDQLI VGLGAQVPFD QGVTEARLAA GALIGKEAPA YVALSALPVD HSNVLEAWKQ VYHEDAPKDI QESYKK
|
| |