Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3099 |
Symbol | |
ID | 4597884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 3302305 |
End bp | 3303606 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639777705 |
Product | major facilitator transporter |
Protein accession | YP_924288 |
Protein GI | 119717323 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGATAT CGGGTTGCCG ACCACGCGGC CCGAGGCAGG AGGATGTGGG GGTGCTCGAC CGCATCGGCG CCGCGCTGGG GTTCCCCTCC GTGGGCAACC ACCGCAGGTT CGTCACCGCC ATCGCCATCG ACGCGATCGG CAGCGGTGTG TTCATGCCGG TCTCGATCCT CTACTTCCTC GTCACGACCG ACCTGTCGCT CGTGCAGGTC GGGGCCGCGA TCTCACTGGC GTCGGCGGTC GCGCTGCCCG CAGGCCCGCT GCTCGGTGGC CTGGTCGACC GCTACGGCGC CAAGCACGTG CTGCTCGCCG GCAACCTGCT CCAGACGGCC GGCTTCGTCG CCTACCTCGC GACCGACTCC TTCGTCGGCC TGACCCTGTG GACGGTCGTC GTGACGGTGG GCCGGACCGC CTTCTGGGGC TCCTACGGCA ACATCGTCGC CGCGATCTCG GCGCCGGGAG AGCGGGAGAA GTGGTTCGGC TTCCTCGGGG CGCTGCGCAA CGTCGGCTTC GCGGTCGGCG GGCTCGCGTC CGGCCTCGCG ATCACGATCG GCACCACCAC GGCGTACTCG GCGGTCGTGC TGGCCAACGC CGTGTCGTAC GTCGTCGCCT TCCTCCTGCT GCTGGCCGTG CCGCCGACGG CGAGTGTCGC GCACCGCGCG GTCGAGGGCG CGTGGGGCAC CGTGCTGCGC GACCGGCCGT ACCGGCTGCT GTGGGTCACC CAGATGGCCT ACTCGACGGC GATGATGGTG CTGAACTTCG CGGTGCCCGT GTATGCGACC ACCGTGCTCG GGCTCTCCGG CTGGGTCACC GGCGCGGTGT TCACGCTGAA CACCGTGATG GTGGGCTTCG GCCAGGGGCT GACGGTCCGG GCGATGACCG GCGCGCTGCG CTGGCGGGTG CTGCTGCTCA GCAACCTGGT GTTCGCGGCC TCGTTCGTCG TGCTGCTCGG CGCGAGTCGG CTCTCGGTCG GGCTCGCCGC GGCGGTGGTG GTGCTCGGCT CGGGCATCTA CACGCTGGGG GAGCTGCTCG GCGGGCCGGT GCTCGGCGCG CTGTCGGCGG AGGCCGCGCC GGAGCACCTG CGCGGGCGCT ACCTCGCGCT GATCCAGCTG GCCTGGAACC TGGCCAGCAC CGTGGCGCCG GTCGCCTTCG CCTGGCTCCT GGACCGCGGC CCGTCCCAGA TCTGGGTCGC GCTGGTCGTG ATGTCGCTGG TCGGTGCCGG TCTGACCGTG CTGCTCGGCC GGGTGCTTCC GCACGCGGCT CAGGCGGTCA CGAACCGGGC CGAGGAGCCG TTGCCGGCCT GA
|
Protein sequence | MAISGCRPRG PRQEDVGVLD RIGAALGFPS VGNHRRFVTA IAIDAIGSGV FMPVSILYFL VTTDLSLVQV GAAISLASAV ALPAGPLLGG LVDRYGAKHV LLAGNLLQTA GFVAYLATDS FVGLTLWTVV VTVGRTAFWG SYGNIVAAIS APGEREKWFG FLGALRNVGF AVGGLASGLA ITIGTTTAYS AVVLANAVSY VVAFLLLLAV PPTASVAHRA VEGAWGTVLR DRPYRLLWVT QMAYSTAMMV LNFAVPVYAT TVLGLSGWVT GAVFTLNTVM VGFGQGLTVR AMTGALRWRV LLLSNLVFAA SFVVLLGASR LSVGLAAAVV VLGSGIYTLG ELLGGPVLGA LSAEAAPEHL RGRYLALIQL AWNLASTVAP VAFAWLLDRG PSQIWVALVV MSLVGAGLTV LLGRVLPHAA QAVTNRAEEP LPA
|
| |