Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1756 |
Symbol | |
ID | 6314309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 1821896 |
End bp | 1823437 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642644130 |
Product | sodium/proline symporter |
Protein accession | YP_001917916 |
Protein GI | 188586371 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family [TIGR02121] sodium/proline symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.15784 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCCCCA TACCTATAAT GTCGACTTTT ATAGCATATT TAATCTTTAT GGTAATAGTC GGTATAATAA CCTATAAAAT GACTTATACC TTGGATGATT ATGTGTTGGC TGGGCGCGGT TTAAATAAAT GGGTTGCAGC TATGTCAGCT CAAGCTAGCG ATATGAGTGG TTGGCTACTC TTAGGATTAC CAGGTGCCGC TTATGCCTCT GGCATGGGCC AGTGGAGTAT CTGGATGTGT ATAGGTCTTG CTACAGGAAC GATGCTTAAT TGGCAGTTTA TTGCTAAAAA ACTCAGGGCT TATACAGAAT TAGCTGGTGA CAGCATTACA TTATCCGAAT TCTTCTCAAA TAGATTTCGA GATAAGAGTG AAGTACTAAG AATTGTATCT GCTCTATTTA TTTTAGTATT TTTCTTGTTC TATACTGCGT CTGGCTTGGT TGCCGGGGGG AAATTATTTG AGTCGTTTCT CGGTGTTGAT TATTATCTTG CTTTAACTAT TGGAACTATT GTTATCTTGA TTTACACCTT TGTAGGGGGA TTTATTGCAG TTACCTGGTT GGACTTTGTT AATGGTATAT TAATGTTTTT TGCTCTGATC ATTGCCCCTA TAGGAGTTGT CAGTCATCTT GGAGGATTGA GTGATGTTTT TGCAGAAATT GGTTCCATAA ACCCTGATCT ATTAGATATA ACTCGAGGTG TAAATTATAA TTACACTGAC GATATTTTTT GGGAATCTAC AGGCCCTATT GGAGCCATTG GTATTATATC GGCTTTGGCC TGGGGATTGG GATACTTTGG TCAACCTCAT ATCTTAGCGA GATTTATGGC CATTAAATCT TTGAGAGCTA TTAAAACTTC TCGGTTAATT GCAGTGATAT GGGTTGTTTT AACATTGATG GGAGCTTCCA TGGTGGGCTT TATGGGAATT TCCATTTTTG GAGAAGAAGC ACCTCTATAT GATTCTGAGT GGGTCTTTTT AGAATTGGTT CCATTAATTT TTAACCCGTG GATTGCGGGG ATTCTATTGG CAGCAGTGTT AGCAGCTATC ATGAGTACAG TGGACTCACA ATTACTTGTT TCTTCTAGTG CCTTAACCCA AGACTTTTAC AGGCGATTCT TGAGAAAAGA CGCCACTGAA AAAGAGTTGG TTTGGGGTGG AAGAATTTCT GTCTTAATTA TTACTTTAAT AGCATATTAT TTAGCATGGG AAGAAGGCCA TGTATTAGAA CTAGTTGAAT ATGCTTGGGC TGGTTTTGGA GCGACCTTTG GCCCTGCTGT CATAGCTTCT TTGTTTTGGA AAAGAACTAC TCGTAACGGA GCTCTGGCAG GAATAGTTGT TGGTGGATTG ACGGTGATTT TATGGGAAAT ACTGGGTACT CCCTTTGGGC TATATGAGAT CGTACCTGGT TTTATTTTGT CACTGGTAGC AATCATTATT TTTAGTTTGA TTGATGATGA GCCTAGTCAA GAAATTTTGG AAGAATTTGA AAGATCCCAA GAGCTTTCAC AACCTGGAGC TGAACTGCCA GAAAAGGAAT AA
|
Protein sequence | MVPIPIMSTF IAYLIFMVIV GIITYKMTYT LDDYVLAGRG LNKWVAAMSA QASDMSGWLL LGLPGAAYAS GMGQWSIWMC IGLATGTMLN WQFIAKKLRA YTELAGDSIT LSEFFSNRFR DKSEVLRIVS ALFILVFFLF YTASGLVAGG KLFESFLGVD YYLALTIGTI VILIYTFVGG FIAVTWLDFV NGILMFFALI IAPIGVVSHL GGLSDVFAEI GSINPDLLDI TRGVNYNYTD DIFWESTGPI GAIGIISALA WGLGYFGQPH ILARFMAIKS LRAIKTSRLI AVIWVVLTLM GASMVGFMGI SIFGEEAPLY DSEWVFLELV PLIFNPWIAG ILLAAVLAAI MSTVDSQLLV SSSALTQDFY RRFLRKDATE KELVWGGRIS VLIITLIAYY LAWEEGHVLE LVEYAWAGFG ATFGPAVIAS LFWKRTTRNG ALAGIVVGGL TVILWEILGT PFGLYEIVPG FILSLVAIII FSLIDDEPSQ EILEEFERSQ ELSQPGAELP EKE
|
| |