Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3936 |
Symbol | |
ID | 4444811 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4446251 |
End bp | 4448203 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639691767 |
Product | major facilitator transporter |
Protein accession | YP_833411 |
Protein GI | 116672478 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCCAT CCCGCAATAG ATCCATGTCC CCCGGTGCCA TCACGGCAGT CCTTGCGCTG AGCGGCACGG TGGTGGCGCT GATGCAGACC CTCGTGGTGC CGCTCCTTCC GGATTTCCCC GGAATCCTCG GCGTTACGGC CGACGACGCG TCCTGGCTGG TCACCGCCAC GCTGCTCTCC AGCGCCGTGG CCACCCCGAT CGTGTCCCGC AGCGCGGACA TGTACGGCAA ACGTAAAATG ATGGTGATCT GCCTCGCCAT CATGGTTGCC GGCTCCATCG TGGCCGCGGT GGGGGGAAGC TTCCTCTGGC TTATCGTCGG CCGGGCACTG CAGGGGTTCT CCTCGGCCTT GATTCCCGTA GGCATCAGCA TCATGCGCGA CGAACTGCCC AAGGAAAAGA TGGGATCCGC CGTAGCCCTG ATGAGCGCCA CGCTGGGCAT CGGCAGCGCA CTGGGCCTCC CGCTGGCCGG GCTGCTTTAC GAAAGCCTGG GCTGGGAGTC CATCTTCTGG GTTTCCGGTG GCGCCGGCAC GCTGCTGCTC GCTGCCGTCG TCCTGGTGGT TCCCGAATCC AAGGTGCGCA CACCGGGCCG CTTCGACTAC CTCGGCGCAG TGATTCTTTC CGCGGCACTG GCAGCGCTGC TCCTGGGCAT TTCCAAGGGC GGATCGTGGG GCTGGAGTTC CGAACCGGTG CTGCTTTTGT TCCTCGCCGC CGCCATCCTC CTGGCCGCCT GGCTGCCCTA CGAGCTGAAG GTCAGCCAGC CGATGGTGGA CCTCCGCACC TCCGGCCGGC GCCCTGTCCT CCTGACCAAC CTGGCATCCC TGCTGGTGGG TTTCGCCATG TTCGCCAACA TGCTGCTGAC CACCCAGCAG CTCCAACTGC CCACCTCCAC CGGCTACGGT TTCCAGCTCA GCGTGATCAC CGCCGGCCTC TGCATGGTTC CATCCGGCCT GGCCATGGTG GTCTTCGCTC CCGTTTCCGG CGGCATCATC CGGCGGTTCG GCGGGAAGAC TGCGCTGATC TCGGGCGCGG CGGTCATGGT GGTGGGGTAC GTTGGCCGCG TCTTCTTCTG GGATTCCATC GCCTCGGTGA TCATCGGCTC CACGGTGGTC AGCATCGGCA CCGCCATCGC CTACGCCGCG ATGCCCACCC TGATCATGGG GGCCGTGCCC ATCACCGAAA CAGCCTCGGC CAACGGCCTC AACAGCCTGG TGCGGTCTAT TGGAACCTCG ACGTCGAGTG CAGCCGTCGC CGCCGTCCTC ACCTCAGTGA CCATCACCGT GGGATCCGCC CGGCTGCCGT CCTTCGAGGC ATTCAAGGAC GTCTTCTGGA TGGCCGCCCT GGCGTCCGCG GCCTCCATGG TGGCTGCCGT GTTCATCCCG CGGGCCGCGG CCGCAGCCAA GGCGGCCCTC CCTGCGCCGG CCGCCACCGA ACTGGTGGTG CAGGGGCGCG TCCTGACGGC CGACCGCCGC CCGGTCACTC CCGCCGTCGT CACCGTCCTG CAGACAAGCG GCGAACCGGT GGACTGGAGC CGGGTGGACA GCGATGGCAA CTATTCCGTG GCACTGCCCG GGGCGGGAAC ATATCTGATG GTGGCCAACG CCGCCGGCTG GGCGCCGATG GCAGAGGTGT TCGACTTCGA CGGCCGCACG CTCCAGCAGA ACTTCCACCT GGAAAACCGC CTGGAACTGG CCGGAACCGC CACGGTGGGA GGCACGGCCC TCACGGACGC GGTGGTCACC CTGTTGCAGG CCTCCGGCGA ACACGTGGCG ACAGTCCGCA CCGATTCGGA GGGACGCTAT TCACTGCCGC TGCCCTTGGC CGGGCGCTAC ATCGTGACCC TGCTGAACCC GGCGACCCAC CAGGCCATCG CCCGGAAGCT GGCCGTGGAC AACCGGTCGG TGACCGCGGA CCTGGCGATG GACGCCCCGG CCGGACAGCT GGTGGACGCG TGA
|
Protein sequence | MPPSRNRSMS PGAITAVLAL SGTVVALMQT LVVPLLPDFP GILGVTADDA SWLVTATLLS SAVATPIVSR SADMYGKRKM MVICLAIMVA GSIVAAVGGS FLWLIVGRAL QGFSSALIPV GISIMRDELP KEKMGSAVAL MSATLGIGSA LGLPLAGLLY ESLGWESIFW VSGGAGTLLL AAVVLVVPES KVRTPGRFDY LGAVILSAAL AALLLGISKG GSWGWSSEPV LLLFLAAAIL LAAWLPYELK VSQPMVDLRT SGRRPVLLTN LASLLVGFAM FANMLLTTQQ LQLPTSTGYG FQLSVITAGL CMVPSGLAMV VFAPVSGGII RRFGGKTALI SGAAVMVVGY VGRVFFWDSI ASVIIGSTVV SIGTAIAYAA MPTLIMGAVP ITETASANGL NSLVRSIGTS TSSAAVAAVL TSVTITVGSA RLPSFEAFKD VFWMAALASA ASMVAAVFIP RAAAAAKAAL PAPAATELVV QGRVLTADRR PVTPAVVTVL QTSGEPVDWS RVDSDGNYSV ALPGAGTYLM VANAAGWAPM AEVFDFDGRT LQQNFHLENR LELAGTATVG GTALTDAVVT LLQASGEHVA TVRTDSEGRY SLPLPLAGRY IVTLLNPATH QAIARKLAVD NRSVTADLAM DAPAGQLVDA
|
| |