Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_2868 |
Symbol | |
ID | 6316326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 3099318 |
End bp | 3100928 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642645239 |
Product | Na/Pi-cotransporter II-related protein |
Protein accession | YP_001919003 |
Protein GI | 188587458 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1283] Na+/phosphate symporter |
TIGRFAM ID | [TIGR00704] Na/Pi-cotransporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000013552 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0000000000000784363 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGGGAACAA TTGGAGAGGT ATTGGTGGGC CTTTTGGGTG GTTTGGGTCT CTTTATTTAT GGCATGCAAA TCATGTCTGA AGGGATGCAA AAGGCTGCGG GAGACAAGCT CAGGAATATA CTTGAAGTTC TAACAAAAAA CCCTGTCATA GCAATGTTTA CAGGCGTCAT TCTAACTGTA CTAGTTCAGA GTAGTAGTAC AAGTACAGTC ATGATTGTCA GCTTTGTAAA TGCAGGCTTA ATGAGCCTGG GACAGGCTGT TGGAACTATA TTTGGAGCAA ATATTGGAAC GACAATTACC GCCCAGGTAG TTTCTTTTGA CCTGGGTATG TTTGCTTTAC CTGCCATTGC TGTAGGGGTG GCTCTTAATT CTTTTGCAAA GCGACGATTG AAAAAATACG TGGGAAGATC TATCCTGGGC TTTGGTATCC TTTTTCTAGG ATTAACTATG ATGAGTGATG CCATGGTGCC CCTAAGAGAA CAGGAAATGT TTATTAATAT GCTGAAACAG TTTGGAGCCT TTCCGGCCTT AGGGGTTCTG GCAGGAGCTA TGTTCACTAT TGCTGTTCAA AGCAGTAGTG CCGCCACAGG TGTAATTATT GCTTTAACAT TACAGGATTT GCTCACTTTT GAATCAGGTG TGGCTTTAAT TTTAGGAACC AATATCGGTA CTTGCGCTAC CACTTTAGTT GCAAGTGTGG GTTCAAATTT AGCTGCCAGG CGTACAGCTG CTGCGCATAT AATTTTTAAT ACCATTGGAA CCTTGTTGAT ACTAATTATT TTATCACCTT TTTCAGAAAT TGTACGTATG ACAGCAGATA CAGTCCCTAG ACAGGTGGCA AATGCTCACA CAATTTTTAA CGTGGGAATG GCCGTTGTTT TTTTACCTTT TACTAATAAA TTTGTCAATT TAGTAATAAA ACTTATCCCT GGTGAAGAAA CAGGTATCCA GCAGGGAAGT AAGTATTTGG ATAAACGTGT TTTGGCCACC CCAAGTGTGG CCATTTCAAA TGCTAGAAAA GAAGTTATTC GTATGGGTAA GCTAGCTCAC GAGATGGTTG ATGAAGCTTA TGAAAGTTTT TTAGAAAAGG ATTTTCGAAA AATGAAGTTA GTAGAGCAAA AAGAAGATGT GGTAGACCAA CTAGAGAAAG AGATTTCAAC TTATTTATCG GCTATTTCTT ATAGTTCGTT GACTACCAGT CAAAGTAAAC AAGTAACTTC TCTCATGAAC GCTATTAACG ATATTGAAAG GGTAGGAGAC CATAGCGAAA ATTTAGTCAA TCTGACAAAA GCAATTATAG AAGATAATCT TCCGTTTAGT GATACAGCTA TCAAAGAACT TTCGGACTTC CACGAAAAAG TGTCTGGAAT GTACCAAAAA GCTATTAATG CCTTTGAAGA CGAAGATTAC GAAAAAGCAA GAGAAGTTGT TGAATATGAT GATGTGATTG ATGAAATGGA AAAAATATTG AGAAAGCATC ACATGATGAG ATTGAATGAA AAGCGCTGTC ATCCTTCTTC AGGTGTGGTG TATCTTGATA TCTTAAGTAA TTTCGAACGA ATTGGAGACC ATTCAACTAA CTTAAGTGAA GCTATCTTTG GTGATGACTA A
|
Protein sequence | MGTIGEVLVG LLGGLGLFIY GMQIMSEGMQ KAAGDKLRNI LEVLTKNPVI AMFTGVILTV LVQSSSTSTV MIVSFVNAGL MSLGQAVGTI FGANIGTTIT AQVVSFDLGM FALPAIAVGV ALNSFAKRRL KKYVGRSILG FGILFLGLTM MSDAMVPLRE QEMFINMLKQ FGAFPALGVL AGAMFTIAVQ SSSAATGVII ALTLQDLLTF ESGVALILGT NIGTCATTLV ASVGSNLAAR RTAAAHIIFN TIGTLLILII LSPFSEIVRM TADTVPRQVA NAHTIFNVGM AVVFLPFTNK FVNLVIKLIP GEETGIQQGS KYLDKRVLAT PSVAISNARK EVIRMGKLAH EMVDEAYESF LEKDFRKMKL VEQKEDVVDQ LEKEISTYLS AISYSSLTTS QSKQVTSLMN AINDIERVGD HSENLVNLTK AIIEDNLPFS DTAIKELSDF HEKVSGMYQK AINAFEDEDY EKAREVVEYD DVIDEMEKIL RKHHMMRLNE KRCHPSSGVV YLDILSNFER IGDHSTNLSE AIFGDD
|
| |