Gene Nther_2073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2073 
Symbol 
ID6316057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2193043 
End bp2194335 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content40% 
IMG OID642644461 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001918228 
Protein GI188586683 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.621 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.521626 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAAA ATGTTAAAAA ATCTTACGGA CATCGGGTAG TTATGTCTGC TTGGTTGGCA 
GTTTTCGTCT TATTTGGATA TCGTGCGACC TTTTCTGTAT TACAAGGCCC TATGGCCGAA
AGCACAGGAT GGACTTCTGG AGAACTGTCT CTGGGGTATT CTTTGATGAT GAGTATTTAT
GCTATTACAG CCTTCCTCAG CGGATACATC ATTGACAGAT GGGGCACTCG ACCAGCGTAT
ATTATTGGAG CCATTTTTGC ATGTTTAGGA TTTTTGGTAA CTAGTACTGT AGATTCTTAT
ATACAGTATC TAGCCAGCTA CTCAATTTTT GCCGGAATCG GTACTGGTAT GCTATGGGTG
TCTTCAACAA TTTCTGTCAG AAAATGGTAT GTAGGTAAAT CTTATGCTAC TATGTGGGGA
ATTGCTTTCA CAGGGGCTCC AGCTGCCCAA GTACTTTTAA GCTTGGGAAT AGATGGTGTC
ATAGAGGATA TGGGATGGAG GTTGGCAATG CAGCTTTTAG CCATAATAGT TCTAATTGCA
TTACTCGTTG CAGGGATATT AGCCAAAAAA AATCCCGAAG ACTACAATAT GGTACCCTTC
GGTTCTAATG AAAAAAATAC CTCTTCAAAG GATCATCACA AAAATGCCGA TACATCTAGA
ATTTGGAGTG TTAAGGAAGC TTTTGTAACT CCAGCCATTT GGGTAGTTAT AATTGCCTTT
TTATCTGCCA TGATAGGTGA ATTCTTAATT TGGACTCAGG TAGTGAATTA TTTTATCATT
GACGCAAATC TTTCCCAAAC TACCGCTACT AATTTATATG TAGTTATTGG GTTAGCTGGG
TTAGTAACCA TGCCCCTCAT GGGAATAATT GCAGATAAAG TGGTTTCAAT GGTAGGTGAT
GAAACAAAGG GAAGGAAATA TATGTTAGTT TTTGCTCCTG CAGTAGGTAT AGTAGCCTGT
TTGTTATTAT TACTTACCGA TCAAGCCATT GTATTGGGAG GCACAGCATC AGTTTTATTT
GCTATCTATT GGGCGATTGA GCCAGGTGGG GCAGCAGGAT ATGCAGGAGC AGTTTACGGT
CAAATATCGT TAGGGAAAAT TTGGGGATTA TCCACCTTAA TAGTAATGGG AATCGGGCCA
GCTTTGGGAA GCTTCATGGG AGGTTTTCTA TATGACTTAA CAGGAAGTTA TAATAATTCC
ATTTTATTTG CAATGGGAGC CTTCACATTG TCTACAATTG CAGCTTGCTT GCTACCACTG
AAAATATCAT CGAATTCAGA TCATCCTAAA TAA
 
Protein sequence
MEQNVKKSYG HRVVMSAWLA VFVLFGYRAT FSVLQGPMAE STGWTSGELS LGYSLMMSIY 
AITAFLSGYI IDRWGTRPAY IIGAIFACLG FLVTSTVDSY IQYLASYSIF AGIGTGMLWV
SSTISVRKWY VGKSYATMWG IAFTGAPAAQ VLLSLGIDGV IEDMGWRLAM QLLAIIVLIA
LLVAGILAKK NPEDYNMVPF GSNEKNTSSK DHHKNADTSR IWSVKEAFVT PAIWVVIIAF
LSAMIGEFLI WTQVVNYFII DANLSQTTAT NLYVVIGLAG LVTMPLMGII ADKVVSMVGD
ETKGRKYMLV FAPAVGIVAC LLLLLTDQAI VLGGTASVLF AIYWAIEPGG AAGYAGAVYG
QISLGKIWGL STLIVMGIGP ALGSFMGGFL YDLTGSYNNS ILFAMGAFTL STIAACLLPL
KISSNSDHPK