Gene Nther_1837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1837 
Symbol 
ID6315664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1912486 
End bp1913733 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content37% 
IMG OID642644215 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001917997 
Protein GI188586452 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00768254 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.336306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGTG TTGAGCCTGT GAAAGGTTTA GGTGGCTATA GCTGGGTACT TGAGAACAGG 
TTGTTTTACT GGTTTTGGTT GGCTAAGGCC ATAAGTTCAT TGGGAGCAGT ATTTTTTAGA
CTGACAATTT TAATTACTAT TACAGAAATG ACCGATTCGG CCATGGCATT AGGTTTGGTT
TTAGGTGTTC AGGCACTGCC CTCCTTATTT ATGTCTCCTC TAGCCGGAGT ATTGGTAGAT
CGCTTAAACA AAAAAATAGT TTTAATAGTT CTCGATATCT TAAGAGCATC TTTAATGCTT
TTGGCAATTT TCATAGTAAA TGATCTGATA CCTGTGATAA TAATAGTTGC CATAATGGGT
GTATGTACAA CAGTTAGACA GGCGACAGAT ATGGCAATAC TTCCGGCTTT GGTTGAACAA
AAAGATTATA TGGCAGCTAC AGGTCTGTTA CGTGGAACAT TACAAATTAT GCAGTTAGTT
GGCCCAGGGA TAGCCGGTAT ACTTATAGAT ATATTTGGAC TGCAAACTGT TTTTGGTGTT
AACTCTCTGG CATTTCTTTT TTCTGGTTTG TTTTTATTGT TTTTGCCTAT ATTTTTAGAG
CAGGATTCTC GGGAAACATT TAATTTGAAA AAAGAAATAA CAGAAGGTCT AGTAGCTATT
AAGGGTTCTC GAGTTTTGAT TAGTATGTTG GCTTTATACT GCGCTGTCAG TATCTTTGCC
GGAGGAACAG GAGTACTAAT GGTAGATTAT ATTCAAAATA TCCTACAGGC AAGTCCCTAT
CAACTAGGAG TCGTACAAAG CGTCTTGGCT TTAGGAGCTA TTTTAGCTAA TCTGGTAGCT
GGTTACTTCG GTAATCAGGC TCCGCGGTTT CATTTGCTGT TGGGAGCAAC TTTTGGAATT
GGTATTGTTA ATTTAATTTT CTTTACTGAT CCAGGTATGA TTATTTTAGG AATCTGGGCT
TTTATTATAG GAGCTTGTGA TGGTATGAAT GAAGCGCCGT TCTATAGTCT CATTATTGAT
TATTCGCCAG ATGAAGTAAG AGGTCGTATC ATGAGTTTTG TCAATGCTTT AATACGACTA
ACAGCTATTA TCAGTTTAGG ATTAGCAGGA ATATTTGCAG GATGGTTTGG ATCAGCCAAT
GTTATCGGGG CAAGTGGTAT AATTCTATTA TTACTTGGAA TGGTTATTTT AATGGGTGAT
GGACGAAAAG TACTATCTAG GAAGGATGAA CAGTTAGATT CTAGATGA
 
Protein sequence
MKGVEPVKGL GGYSWVLENR LFYWFWLAKA ISSLGAVFFR LTILITITEM TDSAMALGLV 
LGVQALPSLF MSPLAGVLVD RLNKKIVLIV LDILRASLML LAIFIVNDLI PVIIIVAIMG
VCTTVRQATD MAILPALVEQ KDYMAATGLL RGTLQIMQLV GPGIAGILID IFGLQTVFGV
NSLAFLFSGL FLLFLPIFLE QDSRETFNLK KEITEGLVAI KGSRVLISML ALYCAVSIFA
GGTGVLMVDY IQNILQASPY QLGVVQSVLA LGAILANLVA GYFGNQAPRF HLLLGATFGI
GIVNLIFFTD PGMIILGIWA FIIGACDGMN EAPFYSLIID YSPDEVRGRI MSFVNALIRL
TAIISLGLAG IFAGWFGSAN VIGASGIILL LLGMVILMGD GRKVLSRKDE QLDSR