Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3326 |
Symbol | |
ID | 4443968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3735402 |
End bp | 3736763 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639691149 |
Product | extracellular solute-binding protein |
Protein accession | YP_832801 |
Protein GI | 116671868 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000398294 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCGCTAT TTTCCCGCCC TGCCTCAGCC GCCAGTCGTG GCGTAGCAGA GGCCACATCC GCCCGGGCCG GGCGGAGGCT TCGCCGAACC GGCGCAGTCG CCGCAGCTGC CGCCGTCGTA CTTGCCCTCA GTGCCTGCGG CGGGGGAGCC GCCCCGCAAA GTGCCGATGG CAAGGTTGAA CTCCGCTTCT CCTGGTGGGG AGGAGACAAG CGGGCGCAAC TGACGCAGGC CGCGATCGCG GCATTCGAGG CTGAGAACCC GAACATCAAG ATCAAGCCGG AGTTCGGCGA CTGGAGCGGT TACTGGGACA AGCTCGCCAC GCAGGTTGCT GCCAACGACG CCCCGGACAT CATCCAGATG GACGAAAAAT ACATCACGGA GTACTCCAGC CGCGGCGCCC TGCTGGACCT TTCCAAGTAC GACATTGACA CGTCAAAGTT TGACGAAGCC GCCCTCAACG CCGGGAAGAG CGAGGACGGC CTGACGGGGA TTGCCGCCGG CATCAACGCT GCAACCATCC TGGCCAACCC GGCAGTCTTC AAGGCCGCAG GCGTTGCGCT GCCGGACGAC AAGACCTGGA CCTGGGAGGA CTTCGAGCGC ATCGCTGCCG AGGTCACTGC GAAGTCGCCA AAGGGCACCT ACGGCGCTGC CGCCTACGGC ACCGATGAAG CCTCGCTCGG CGTATGGCTG CGGCAGAACG GCAAGTCGCT GTACACCAGC GACGGCAAGC TGGGCTTCGA GCCGGGCGAC ATCGCCGAAT GGTGGGCGTT CCTGAAGGAA CTCAGCGAGA AGAAGGCCGT GCCCTCAGCC TCGGAGGTGG TTGAGGCCGA GGCGGCACCG CTGGACCAGA GCGGCCTGGC GACAGGCAAG AACGGGCTCG CGTTCTGGTG GTCCAACCAG CTGCCGGCGC TGGAGAAGGC TGCCGGCGGA GAACTTCAGA TCCTGCGGTT CCCGTCCAAG ACCGGCAGCT CCGCGGACGC CAAGCTTTGG TACAAGGCCT CGCAGTTCTG GTCAGCTTCT TCACGCACCA AGCATCCGGA AGAAACCGCG AAATTCATCA ACTTCCTGGC CAACAACACC AAGGCCGGCG AAACCCTCCT GGCCGACCGC GGCGTTTATC CCAACTCCGA TGTCCGGGCG GCAATCGCAC CCAAGCTGAC CCCCGCCGAC ATCAAGGTGG TCAAGTTCAT TGACCAGATC AAGGGCGAAC TTGGCGAGGC TCCGGCACCG CCGCCGAAGG GCGCGGGTGC CATCCAGGAA ATCGTCAAGC GCTACACCTC GGAGGTTCTC TTCAACCGGC TGTCCACGGA GGAAGCCGGC AAGAAGGCAG TCGATGAAAT GAAATCAGCC ATCAGCAGCT AG
|
Protein sequence | MPLFSRPASA ASRGVAEATS ARAGRRLRRT GAVAAAAAVV LALSACGGGA APQSADGKVE LRFSWWGGDK RAQLTQAAIA AFEAENPNIK IKPEFGDWSG YWDKLATQVA ANDAPDIIQM DEKYITEYSS RGALLDLSKY DIDTSKFDEA ALNAGKSEDG LTGIAAGINA ATILANPAVF KAAGVALPDD KTWTWEDFER IAAEVTAKSP KGTYGAAAYG TDEASLGVWL RQNGKSLYTS DGKLGFEPGD IAEWWAFLKE LSEKKAVPSA SEVVEAEAAP LDQSGLATGK NGLAFWWSNQ LPALEKAAGG ELQILRFPSK TGSSADAKLW YKASQFWSAS SRTKHPEETA KFINFLANNT KAGETLLADR GVYPNSDVRA AIAPKLTPAD IKVVKFIDQI KGELGEAPAP PPKGAGAIQE IVKRYTSEVL FNRLSTEEAG KKAVDEMKSA ISS
|
| |