Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2493 |
Symbol | |
ID | 8448104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 2748994 |
End bp | 2750547 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645041605 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003201849 |
Protein GI | 258652693 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000001804 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00137453 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCACAC CCCCACCGGT GGGCGGCTCG CCGGCCACCA CCGCCCCGGT CTCCCCCGTC CCCCTGCCGC CCGCCGACGG CGCGCCGGCC GGGCCGCCGG CCGGCCGGTT CTACCCATCG CTGCGCGCGG CCTGGATTCC CTTGGCCGCA CTGTGTCTGG CGTTCTTCGT GGAGATGGTC GACAACACCC TGCTGACCGT CGCCCTGCCG ACGATCGGCC GGGACCTGCA GGGCAGCACC ACCTCGCTGC AGTGGATCAC CGGCGCCTAC TCGCTCACCT TCGGCGGGCT GCTGCTGACG GCCGGCTCGA TCGCCGACCG GTTCGGTCGC CGGCGGGTCC TGCTCCTGGG CCTGTCGGCG TTCGGCCTGA TGAGCCTGGC CGTCGTCGCG GTGACCACCA CCGGCGAACT CATCGCCCTG CGGGCCGGGC TGGGCATCGC CGCCGCCGCC ATGGCGCCGA TCACCAACTC GCTGGTGTTC CGGCTGTTCG ACGACGACGC GCTCCGGCGG CGGGCGATGA CGGTGATGAT CGTCGTCGGG ATGAGCGGTT TCATCCTCGG GCCCCTCATC GGGGGCTCCG CCCTGGCTCA CGTCGGCTGG CAGTGGCTGC TGCTGGTCAA CGCGCCGATC GCCCTGATCG CCGCCGTCGG CGTGCGGCTG GGCGTGGCCA AGGACCACCC CGACGATCTC ATCGCCGACC CCCTGGACCT GCCGGGCGCG GCGCTGACCA TCCTGGCCAT CGGGCTGGGC TGCTACACGC TGACCAGCGG TATCGAGCAC GGTTGGGTGT CCCTGCCGAC CCTGCTCTCG GCCGCCGGGG CGATCGCTTC CGTGATCGGC TTCGTGGTGC GCGAGCGTCG CACCGCCTTC CCCATGCTGG ATCTGCGGTT GCTGCGCCAC CCCGTCGTGC GTGGTGCGAC CGTGGCCCAG CTGGGCACGG CCATCGCCAT GGCCGGCGTG ATGTTCAGCC TGGTCCTGCA TTTCCAGTTC GCCTACGGCT GGAGCCCGAT GGTCGCCGGC CTGGCCAACC TGCCGTTCAT CGTCACCATG CTGGCCGCCA CCCCGCTCAC CGAGTACCTG GTCACCCGGT TCGGCCGCCG GATGGCCTGC CTGGTCGGCG CGGGTGCGCT GACCGTGGGT CTGGCCTGGC TGGCCTGGGC GGTCGACCAC GGGTACCTGG CGATCGCCGC CGGCATGGTG GTGATGACCT TCGGGCTGCG CACCGTCATG ACGATCTGCG CGGTCGGCCT GGTCGACGCC ATGCCGGAGA ACCGCACGTC GTTGGGGGCC GCCCTCAACG ACACCGCTCA GGAGGTCGGC TCCAGCATCG GCACCGCCCT GGTCGGCACG TTGATCGCCG CGCTCGTGGT CACCGTCCTG CCGCTCGGCG CGTGGAGCCC GGAGCTGGTC GACTCGTACT TCCACGGCGA GCGGATCGCC TACCTGGTGC TCACGGTGCT GGTCGGTACG GTGGCCTTCC TCGGGGCCGC GACCCTGGAC GACTCGCACC GTCCCGAGCA GCTGGCCGAC GAGCAGCTCG AGTCGCCGGC CTGA
|
Protein sequence | MTTPPPVGGS PATTAPVSPV PLPPADGAPA GPPAGRFYPS LRAAWIPLAA LCLAFFVEMV DNTLLTVALP TIGRDLQGST TSLQWITGAY SLTFGGLLLT AGSIADRFGR RRVLLLGLSA FGLMSLAVVA VTTTGELIAL RAGLGIAAAA MAPITNSLVF RLFDDDALRR RAMTVMIVVG MSGFILGPLI GGSALAHVGW QWLLLVNAPI ALIAAVGVRL GVAKDHPDDL IADPLDLPGA ALTILAIGLG CYTLTSGIEH GWVSLPTLLS AAGAIASVIG FVVRERRTAF PMLDLRLLRH PVVRGATVAQ LGTAIAMAGV MFSLVLHFQF AYGWSPMVAG LANLPFIVTM LAATPLTEYL VTRFGRRMAC LVGAGALTVG LAWLAWAVDH GYLAIAAGMV VMTFGLRTVM TICAVGLVDA MPENRTSLGA ALNDTAQEVG SSIGTALVGT LIAALVVTVL PLGAWSPELV DSYFHGERIA YLVLTVLVGT VAFLGAATLD DSHRPEQLAD EQLESPA
|
| |