Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1837 |
Symbol | |
ID | 6315664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 1912486 |
End bp | 1913733 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642644215 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_001917997 |
Protein GI | 188586452 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00768254 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.336306 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGGTG TTGAGCCTGT GAAAGGTTTA GGTGGCTATA GCTGGGTACT TGAGAACAGG TTGTTTTACT GGTTTTGGTT GGCTAAGGCC ATAAGTTCAT TGGGAGCAGT ATTTTTTAGA CTGACAATTT TAATTACTAT TACAGAAATG ACCGATTCGG CCATGGCATT AGGTTTGGTT TTAGGTGTTC AGGCACTGCC CTCCTTATTT ATGTCTCCTC TAGCCGGAGT ATTGGTAGAT CGCTTAAACA AAAAAATAGT TTTAATAGTT CTCGATATCT TAAGAGCATC TTTAATGCTT TTGGCAATTT TCATAGTAAA TGATCTGATA CCTGTGATAA TAATAGTTGC CATAATGGGT GTATGTACAA CAGTTAGACA GGCGACAGAT ATGGCAATAC TTCCGGCTTT GGTTGAACAA AAAGATTATA TGGCAGCTAC AGGTCTGTTA CGTGGAACAT TACAAATTAT GCAGTTAGTT GGCCCAGGGA TAGCCGGTAT ACTTATAGAT ATATTTGGAC TGCAAACTGT TTTTGGTGTT AACTCTCTGG CATTTCTTTT TTCTGGTTTG TTTTTATTGT TTTTGCCTAT ATTTTTAGAG CAGGATTCTC GGGAAACATT TAATTTGAAA AAAGAAATAA CAGAAGGTCT AGTAGCTATT AAGGGTTCTC GAGTTTTGAT TAGTATGTTG GCTTTATACT GCGCTGTCAG TATCTTTGCC GGAGGAACAG GAGTACTAAT GGTAGATTAT ATTCAAAATA TCCTACAGGC AAGTCCCTAT CAACTAGGAG TCGTACAAAG CGTCTTGGCT TTAGGAGCTA TTTTAGCTAA TCTGGTAGCT GGTTACTTCG GTAATCAGGC TCCGCGGTTT CATTTGCTGT TGGGAGCAAC TTTTGGAATT GGTATTGTTA ATTTAATTTT CTTTACTGAT CCAGGTATGA TTATTTTAGG AATCTGGGCT TTTATTATAG GAGCTTGTGA TGGTATGAAT GAAGCGCCGT TCTATAGTCT CATTATTGAT TATTCGCCAG ATGAAGTAAG AGGTCGTATC ATGAGTTTTG TCAATGCTTT AATACGACTA ACAGCTATTA TCAGTTTAGG ATTAGCAGGA ATATTTGCAG GATGGTTTGG ATCAGCCAAT GTTATCGGGG CAAGTGGTAT AATTCTATTA TTACTTGGAA TGGTTATTTT AATGGGTGAT GGACGAAAAG TACTATCTAG GAAGGATGAA CAGTTAGATT CTAGATGA
|
Protein sequence | MKGVEPVKGL GGYSWVLENR LFYWFWLAKA ISSLGAVFFR LTILITITEM TDSAMALGLV LGVQALPSLF MSPLAGVLVD RLNKKIVLIV LDILRASLML LAIFIVNDLI PVIIIVAIMG VCTTVRQATD MAILPALVEQ KDYMAATGLL RGTLQIMQLV GPGIAGILID IFGLQTVFGV NSLAFLFSGL FLLFLPIFLE QDSRETFNLK KEITEGLVAI KGSRVLISML ALYCAVSIFA GGTGVLMVDY IQNILQASPY QLGVVQSVLA LGAILANLVA GYFGNQAPRF HLLLGATFGI GIVNLIFFTD PGMIILGIWA FIIGACDGMN EAPFYSLIID YSPDEVRGRI MSFVNALIRL TAIISLGLAG IFAGWFGSAN VIGASGIILL LLGMVILMGD GRKVLSRKDE QLDSR
|
| |