Gene Nther_1756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1756 
Symbol 
ID6314309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1821896 
End bp1823437 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content39% 
IMG OID642644130 
Productsodium/proline symporter 
Protein accessionYP_001917916 
Protein GI188586371 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.15784 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCCCA TACCTATAAT GTCGACTTTT ATAGCATATT TAATCTTTAT GGTAATAGTC 
GGTATAATAA CCTATAAAAT GACTTATACC TTGGATGATT ATGTGTTGGC TGGGCGCGGT
TTAAATAAAT GGGTTGCAGC TATGTCAGCT CAAGCTAGCG ATATGAGTGG TTGGCTACTC
TTAGGATTAC CAGGTGCCGC TTATGCCTCT GGCATGGGCC AGTGGAGTAT CTGGATGTGT
ATAGGTCTTG CTACAGGAAC GATGCTTAAT TGGCAGTTTA TTGCTAAAAA ACTCAGGGCT
TATACAGAAT TAGCTGGTGA CAGCATTACA TTATCCGAAT TCTTCTCAAA TAGATTTCGA
GATAAGAGTG AAGTACTAAG AATTGTATCT GCTCTATTTA TTTTAGTATT TTTCTTGTTC
TATACTGCGT CTGGCTTGGT TGCCGGGGGG AAATTATTTG AGTCGTTTCT CGGTGTTGAT
TATTATCTTG CTTTAACTAT TGGAACTATT GTTATCTTGA TTTACACCTT TGTAGGGGGA
TTTATTGCAG TTACCTGGTT GGACTTTGTT AATGGTATAT TAATGTTTTT TGCTCTGATC
ATTGCCCCTA TAGGAGTTGT CAGTCATCTT GGAGGATTGA GTGATGTTTT TGCAGAAATT
GGTTCCATAA ACCCTGATCT ATTAGATATA ACTCGAGGTG TAAATTATAA TTACACTGAC
GATATTTTTT GGGAATCTAC AGGCCCTATT GGAGCCATTG GTATTATATC GGCTTTGGCC
TGGGGATTGG GATACTTTGG TCAACCTCAT ATCTTAGCGA GATTTATGGC CATTAAATCT
TTGAGAGCTA TTAAAACTTC TCGGTTAATT GCAGTGATAT GGGTTGTTTT AACATTGATG
GGAGCTTCCA TGGTGGGCTT TATGGGAATT TCCATTTTTG GAGAAGAAGC ACCTCTATAT
GATTCTGAGT GGGTCTTTTT AGAATTGGTT CCATTAATTT TTAACCCGTG GATTGCGGGG
ATTCTATTGG CAGCAGTGTT AGCAGCTATC ATGAGTACAG TGGACTCACA ATTACTTGTT
TCTTCTAGTG CCTTAACCCA AGACTTTTAC AGGCGATTCT TGAGAAAAGA CGCCACTGAA
AAAGAGTTGG TTTGGGGTGG AAGAATTTCT GTCTTAATTA TTACTTTAAT AGCATATTAT
TTAGCATGGG AAGAAGGCCA TGTATTAGAA CTAGTTGAAT ATGCTTGGGC TGGTTTTGGA
GCGACCTTTG GCCCTGCTGT CATAGCTTCT TTGTTTTGGA AAAGAACTAC TCGTAACGGA
GCTCTGGCAG GAATAGTTGT TGGTGGATTG ACGGTGATTT TATGGGAAAT ACTGGGTACT
CCCTTTGGGC TATATGAGAT CGTACCTGGT TTTATTTTGT CACTGGTAGC AATCATTATT
TTTAGTTTGA TTGATGATGA GCCTAGTCAA GAAATTTTGG AAGAATTTGA AAGATCCCAA
GAGCTTTCAC AACCTGGAGC TGAACTGCCA GAAAAGGAAT AA
 
Protein sequence
MVPIPIMSTF IAYLIFMVIV GIITYKMTYT LDDYVLAGRG LNKWVAAMSA QASDMSGWLL 
LGLPGAAYAS GMGQWSIWMC IGLATGTMLN WQFIAKKLRA YTELAGDSIT LSEFFSNRFR
DKSEVLRIVS ALFILVFFLF YTASGLVAGG KLFESFLGVD YYLALTIGTI VILIYTFVGG
FIAVTWLDFV NGILMFFALI IAPIGVVSHL GGLSDVFAEI GSINPDLLDI TRGVNYNYTD
DIFWESTGPI GAIGIISALA WGLGYFGQPH ILARFMAIKS LRAIKTSRLI AVIWVVLTLM
GASMVGFMGI SIFGEEAPLY DSEWVFLELV PLIFNPWIAG ILLAAVLAAI MSTVDSQLLV
SSSALTQDFY RRFLRKDATE KELVWGGRIS VLIITLIAYY LAWEEGHVLE LVEYAWAGFG
ATFGPAVIAS LFWKRTTRNG ALAGIVVGGL TVILWEILGT PFGLYEIVPG FILSLVAIII
FSLIDDEPSQ EILEEFERSQ ELSQPGAELP EKE