Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_4000 |
Symbol | |
ID | 4447263 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 4516279 |
End bp | 4517505 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639691831 |
Product | major facilitator transporter |
Protein accession | YP_833475 |
Protein GI | 116672542 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGGCC AGCTCGCCGC CACGCGGCCA GCAGTAGAAG AAACCCTGGA AGCCGAGAAG GCCTCCTTGC TCAAGCAGCC CAAGGCGGTA TGGGCCACGG CCCTCGCTGC GGTATTCGCA TTCATGGGGA TCGGGCTGGT GGATCCGATC CTCCCGGCGA TCGCCAAGAA CCTGGACGCC ACCCCAAGCC AGGTGTCCCT GCTGTTCACG AGCTATTTCC TGGTCACTGC GGTGGCCATG CTGATCACCG GTTTTGTATC TTCCCGGATC GGCGGCAAGA AGACCCTGCT GATCGGCCTG TCGGTGATCG TGGTCTTCGC TTCGCTGTCC GGAATGTCAG GCAGCGTCGG TGAACTGATC GGCTTCCGTG CCGGGTGGGG CCTGGGCAAC GCGCTCTTCG TGGCCACCGC CCTCGCCGTC ATTGTGGGAG TGGCCAGCGG GGGCGCCGGC ACGGCGATCA TCCTCTATGA GGCGGCCCTG GGCCTGGGCA TTTCCCTCGG CCCGCTCCTG GGTGCCCTGC TGGGCGGCTG GCAGTGGCGG GCACCGTTCT TCGGCACCGC CGTGCTCATG GCAGCGGCCT TTATTGCCCT CATCGGGCTG CTCCCCACAA CCCCGCTGCC GGAGCGGAAG GTCAGGCTCC GCGACCCCCT GCTCGCCCTG GGCCACAAGG GACTGCGCAC GACGGCGGCC AGCGGCCTGT TCTACAACTA CGGCTTCTTC ACCATCCTGG CTTTCACGCC GTTCATCCTC GGCATGGACG CTTACGGCAT CGGCGGGGTG TTTTTCGGCT GGGGCGTGGC CGTCGCGGTC TTCTCGGTCT TTGTGGCGCC TGTGCTGCAG AACCGGTTCG GCGCCACCAA AGTGCTCACC GGCACGCTGG CCGTCCTGAT GCTGGACCTG GCAGGGCTGG GGCTGGCCGC CGGGCACTCG GTTCCCGCCG TCGTCGTGCT GGTGGTGGTT TCAGGTGCGC TGCTGGGCAT CAACAACACG GTGTACACGG AACTGGCCAT GGGCGTTTCC GACTCCCCGC GTCCAGTGGC GTCCGCCGGC TACAACTTCG TGCGCTGGAT GGGAGGCGCA CTGGCACCCT TCGCTGCCGC CCAGCTGGGG GAGCACTTCG GCCCGCAGGT TCCGTTCTTC GCCGGCGCCC TGGCCATGGT GGTTGCCATC GCGATTGCCT TTGGCGGACG CCGCTTCCTG GCGGCCCACG AACCCCACGT GGTCTAG
|
Protein sequence | MEGQLAATRP AVEETLEAEK ASLLKQPKAV WATALAAVFA FMGIGLVDPI LPAIAKNLDA TPSQVSLLFT SYFLVTAVAM LITGFVSSRI GGKKTLLIGL SVIVVFASLS GMSGSVGELI GFRAGWGLGN ALFVATALAV IVGVASGGAG TAIILYEAAL GLGISLGPLL GALLGGWQWR APFFGTAVLM AAAFIALIGL LPTTPLPERK VRLRDPLLAL GHKGLRTTAA SGLFYNYGFF TILAFTPFIL GMDAYGIGGV FFGWGVAVAV FSVFVAPVLQ NRFGATKVLT GTLAVLMLDL AGLGLAAGHS VPAVVVLVVV SGALLGINNT VYTELAMGVS DSPRPVASAG YNFVRWMGGA LAPFAAAQLG EHFGPQVPFF AGALAMVVAI AIAFGGRRFL AAHEPHVV
|
| |