Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1817 |
Symbol | |
ID | 4445646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2035498 |
End bp | 2036718 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639689635 |
Product | major facilitator transporter |
Protein accession | YP_831307 |
Protein GI | 116670374 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000443507 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTATTG GCCTGTTAGC CCTAGCCCTC GGCGGGTTCG GTATCGGACT CACCGAGTTC GTGATCATGG GCCTGCTGCC CGAGGTCGCC GCAGACTTCA GCGTCAGCGA GGCCACGGCC GGCTGGCTGA TCTCCGGCTA TGCGCTCGCC GTCGTCGTCG GCGCCCTGCT GCTCACGGCG GCCGTGACAC GCTTCGAACG CAAGCCGGTC CTGGCCGTCC TGCTGGTGCT GTTCATTGCC GGCAACCTGG TCTCCGCCAT CGCCCCCGAC TACTCCATGA TGATGATCGG CCGGATTGTG GCCGCCCTGG CCCACGGCGC GTTCTTCGGC ATTGGGGCAG TCGTGGCCGC GGACATGGTG GCCCCCACTA AGAAAGCCGG CGCCATCGCC ATCATGTTCA CCGGACTCAC CGCCGCCAAC GTCCTGGGCG TGCCGTTCGG CACCATGCTC GGCCAGGCCG CCGGCTGGCG CTCCACCTTC TGGGCCATCA CGGGCATCGG CGTCCTGGCC CTCGTCGGCA TCCTGACCCT GGTCCCTAAG ACCGGCCGCG GCGACACCGC CCCCGGGAGC CTCCGCAGCG AACTGCGGGC CTTCCGCTCC GGCCAGGTCT GGCTGTCCAT CCTCGTCACC ATCCTCGGCT ACGGCGGCAT GTTCGGCGCC TTCACCTACA TCGCCTACAC CCTCACCGAG GTCACCGGCT TCGCCGCCTC CACCGTGCCC TGGCTCCTGA TCCTCTTCGG CATCGGACTG TTCATCGGCA ACACCGTGGG CGGCAAGGCG GCGGACCGGA ACGTGGACCG CACCCTTCTG GTGGTCCTGG CTGTGCTCGT GGCGGTCCTC GTGGGGTTCG CGCTGACCGC CGGCAACCAG CCCCTCACCA TCGCCTCCAT AGTCCTGCTC GGCGGCTTCG GCTTCGCGAC GGTCCCCGGA CTGCAGATGC GGGTCATGAA ATACGCCCAC AGCGCCCCCA CTTTGGCCTC CGGCGCCAAC ATCGGCGCGT TCAACGTCGG CAACGCCCTC GGCGCCTGGC TCGGCGGCGT GACCATTACC GCCGGCCTCG GCTACACCTC ACCCATCTGG GCCGGAGCCG GCATCACCCT CCTGGGCCTC GGCGTGATGG CCATCGCCGC AGCCGGCGCC AAACGCTCTA AAACGGCGGC CATTATTGGC GACAACACCT CTCAAACCGT GACTGACGCC GTCGTAGAAG CATCAATCTA G
|
Protein sequence | MPIGLLALAL GGFGIGLTEF VIMGLLPEVA ADFSVSEATA GWLISGYALA VVVGALLLTA AVTRFERKPV LAVLLVLFIA GNLVSAIAPD YSMMMIGRIV AALAHGAFFG IGAVVAADMV APTKKAGAIA IMFTGLTAAN VLGVPFGTML GQAAGWRSTF WAITGIGVLA LVGILTLVPK TGRGDTAPGS LRSELRAFRS GQVWLSILVT ILGYGGMFGA FTYIAYTLTE VTGFAASTVP WLLILFGIGL FIGNTVGGKA ADRNVDRTLL VVLAVLVAVL VGFALTAGNQ PLTIASIVLL GGFGFATVPG LQMRVMKYAH SAPTLASGAN IGAFNVGNAL GAWLGGVTIT AGLGYTSPIW AGAGITLLGL GVMAIAAAGA KRSKTAAIIG DNTSQTVTDA VVEASI
|
| |