Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3403 |
Symbol | |
ID | 8449018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3740711 |
End bp | 3742252 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 645042480 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003202720 |
Protein GI | 258653564 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00000797372 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000013818 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGCTGA ACCCGCGGGC CGATCCGGCC GGGCCGCGGG CCGGAGCCGG CGGCGCCGGG ATCGCGCTGC TGGGGGTGAT GGCGGCGGTC CAGGGCTCGG ATCCCAACAT CGCCAGCACC GCCCTGGTCG GCGCCAGCCG CGGCCTGCAG ATGACCGGTG GCCTGCTGGC CCTGGCCGCC AGCGTGTCCA CCCTGGCGCT GGCCGCCACG GTCATCTCCA CCGGGTTGCT GGCCGACCGG TTCGGCCGGC GCGGCGTGCT GATGGCCGCG TTGGCGCTCT CGGCCGCGGG CGACCTGATC GCCGCCCTGG CCCCGACCGC AGGGCTGTTC CTGATCGGCC GGGCGGTGGC CGGGATCGGC CTGGGCGCGG TGTACGGGGC GGCCTTCGCC TACATCCGGG CGATCACCCC GCCCGATCGG ATCGCCGGTG CCATCGGCAT TTTCGGCGCC GTCTCGGGCG GCGTCACGGT GCTGATGACG TTCCTGGGCG GGGCCCTGGC CTCGGTGCAC TGGCGGCTGG CCTTCCTGGT GGTGCCGGTG GCCGCCCTGC TGTGCGTGCC GGCGGTGCGG GCGGTGCTGC CGGCCCAGCC GCGGGTCGCC GACGGCCCGC GGGACTATCC GGGCCAGGTG CTACTGGCCC TGGGCGTGGT CGGCGTGCTC TACGGGTTCA GCCACGCCGC CGACGGGTTG ACCTCGCCGC TGACCTTCGG CCCGCTGCTG GGCGGGCTGG TGCTGCTGGC CTTGTTCGTG GTCCGCGAGC GGCACACGTC GGCCCGCTTC TTCCCGGTGG AGCTGCTGAC CAAGCCGCTG TTCCTGGCCG CCATCTGCGC CGGCTTCGTC TACAACTTCG GCACCGCCGT CGGCTTCCTG CAGCTCACCG ACCTGTGGCA GTACGTGGTC GGCCTGTCCA CGCTGCGGGT CTCGCTGTGG CAGATGCCGT TCCTGCTGGC CGGCATCGCC GCCGCGGTGC TGTTCGGCCG GCTGATGACC AGGGGGCTGA CCGCGGCCAG CACGGTGGCC ATCGGGTCGC TGGCCGCGGC GGCCGGGTTC GTCCTGCTCG CCGTGCTGCA TTCGTCGACC TCGCTGTGGG GCTTTCTGCC CGGCTCGATC CTGCTCGGCG CGGGCGTCAT CATCGCCTCG CTGCCGTACG GGACGCTGAT CATCGCGCAG GCCCCGGCCC GCTACTTCGG GCCGGTCACC TCGTCGCGGA CGACCATCGG CCAGTTCTTC TACGCGGCCG GGCTGGCCCT GTCCACGGTG CTGGTGAACA CGATGACCAC CGGCGGGGTG GTCCGCCGAC TGGAGCAGGC CGGCGTGCCG CCGACCGACA CCGGGCAGGG GCTGGACGCG GTCACCGCCT TCGCCGCCGA CGGCACCCGC CCCAGCACCG CCCTGGGCCA GCAGGCGCTG GCCGAGGCGG CCCAGTCCTA CGGCCAGGCG TTCGCGCTGA CGACCCTGCT GGCTGCCCTG GTCACCCTGA TCGTCGGCGG CCTGGGCTGG TGGCTGCTGC GCCGGCACGA GGCGCGACCG GCCACCGGCT GA
|
Protein sequence | MLLNPRADPA GPRAGAGGAG IALLGVMAAV QGSDPNIAST ALVGASRGLQ MTGGLLALAA SVSTLALAAT VISTGLLADR FGRRGVLMAA LALSAAGDLI AALAPTAGLF LIGRAVAGIG LGAVYGAAFA YIRAITPPDR IAGAIGIFGA VSGGVTVLMT FLGGALASVH WRLAFLVVPV AALLCVPAVR AVLPAQPRVA DGPRDYPGQV LLALGVVGVL YGFSHAADGL TSPLTFGPLL GGLVLLALFV VRERHTSARF FPVELLTKPL FLAAICAGFV YNFGTAVGFL QLTDLWQYVV GLSTLRVSLW QMPFLLAGIA AAVLFGRLMT RGLTAASTVA IGSLAAAAGF VLLAVLHSST SLWGFLPGSI LLGAGVIIAS LPYGTLIIAQ APARYFGPVT SSRTTIGQFF YAAGLALSTV LVNTMTTGGV VRRLEQAGVP PTDTGQGLDA VTAFAADGTR PSTALGQQAL AEAAQSYGQA FALTTLLAAL VTLIVGGLGW WLLRRHEARP ATG
|
| |