Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1767 |
Symbol | |
ID | 6314288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 1833354 |
End bp | 1834553 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 642644141 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_001917927 |
Protein GI | 188586382 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.18145 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAAACTA TAAGTGCAAT TAAAGTAGCA ATTTTGTCCG CTTCAAATTA TATAACAGTC CTAGTAAATA CAGTACTATT TCCTATTTTC CCTGTCATGG CACAGGCTTT AAATTTAACT TTAAGAGATT TAGCAATCTT AGTCGGTATA GTTTCTTTTC CATCAGCCCT AATTAATTTA GGAGGTGGGA TATTAGCAGA TAGATTCGGA AAAAAAATAA TTATTGTATT ATCCTTAACA TTATATGGGC TTGGGGGGTT ATTAGCTGGA CTAAGTATTA TATTAATGGA AGAACCTTAT CCGGTAATTT TAGTTGGTAG GCTATTTCAA GGAATAGGTG CGGCAACACC CATGTTTTTA TCAGTAGCAC TGGTTGGCGA TATTTTTCAA AGCTTAGAAA GAAGTAAAGC TTTGGGTTTT TTAGAAACGG CTAACGGATT AGGGAAAGTT ACTAGTCCAA TTTTAGGTGC TTTAATTGGT CTTATTACAT GGTATTCCAT ATTTTTTATT TATCCAATTG TGGCTTTACC GGTTGCTATT GCAACCTGGA AAGTTATAGA GGAACCTAAT GAGAATAAAG GGGTAGATTG GGAGAAACAG AAAAAGGCTT TTCGTCAATT TAAAGATGAA TCCAGAATTA TCACTTTGCT CGTGGCTTTT TTGGTAATAT TTATTTTAAT TGGTACCATG TTTTGGTTGA GTGATTTTTT AGAAGCTAGA CTAGAACTAA ATCAGATTTT AAGGGGAGTT GTTATATCCT TACCAGCTCT AGCTATGTTA CTTACCACAT TATTTGCTGA AAGAATACAT AATAAGTTGA ATCCTCGCTT TATTATGGGG GGTGGTCTTA TATTAACATC AGCTTGTTTA ATTGGTATTT ACCATACCTT AGAAACGATT TTATTTTGGC CATTAATAGT AGCTTTAGGT GTGGGGGCAG GAATTGTATT GCCTTCTGTT GACATGGTGA GTACTTCAGT AGAAATTAAA GAAATTAGAG GAGTTATGAG TACAATATAC GGATCTGCTC GATCTTTGGG AGGAGCAACT ACAACTATTA CATTCTCTTA CTTGTTGGAA TACGGATTAC AATTAACATT TTATAGTATA GCAGTGGGAG GGATAATAGT TGGTTTAATT GTATTATTTA GAATGAATGA AAAAAAATTA TTGCCCAAAG AACTGTTACC GGATAAGTGA
|
Protein sequence | MKTISAIKVA ILSASNYITV LVNTVLFPIF PVMAQALNLT LRDLAILVGI VSFPSALINL GGGILADRFG KKIIIVLSLT LYGLGGLLAG LSIILMEEPY PVILVGRLFQ GIGAATPMFL SVALVGDIFQ SLERSKALGF LETANGLGKV TSPILGALIG LITWYSIFFI YPIVALPVAI ATWKVIEEPN ENKGVDWEKQ KKAFRQFKDE SRIITLLVAF LVIFILIGTM FWLSDFLEAR LELNQILRGV VISLPALAML LTTLFAERIH NKLNPRFIMG GGLILTSACL IGIYHTLETI LFWPLIVALG VGAGIVLPSV DMVSTSVEIK EIRGVMSTIY GSARSLGGAT TTITFSYLLE YGLQLTFYSI AVGGIIVGLI VLFRMNEKKL LPKELLPDK
|
| |