Gene Nther_1596 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1596 
Symbol 
ID6315775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1671840 
End bp1673558 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content38% 
IMG OID642643967 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001917758 
Protein GI188586213 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0368881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value7.93881e-23 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCAAGTC GAAAAATATT TGTTTTGTTT ATTGCTTTTC TCCTTACAAT GGGAGTGGCT 
TTGAGTGCTT GTGCACCTCC AGAAGAGGCT AAAGATGAAG AAGTGGAAGA AAAAGAAGAC
GAAGAACCCG TAGTTGAAAA CCCTGCAGTT GAACGTCCTA ATGAATTAAT TATCGGAGGA
ACTGATTTAG ATGGAATTTT TAATCCAGTT CTTTACTCTA CGGCATATGA TGCTTGGGTT
ATAGGAATGA TGTATGATAC ATTGCTTACA GTAGACGAAA ATGGTGAATT AACGACGGAT
CAGAGATCCA TAGCTAAAGA CTATGAAATT TCCGACGATG GTTTAGAATA CACTTTTTAT
CTTAGAGAAG GATGGAAATT TCACGATGGA GTAGAAGTAA CTGCGGAAGA TGTTGCATTT
ACCCTTGAAG TAACTGCTCA TCCAGATTAC GATGGGCCAC GTGCCAGCTG GTCAGATAAT
ATTGTAGGTG TTGATGAATA CAGAGCAGGA GAAACTGATG AATTAGAAGG TGTTATTGTT
GAAGATGATT ATACTCTCAC TGTAAAATCC CAAGAACCCG ATGCCGGTGA TATATTTGAT
TACTCAACTT ATGCTCTTCC AAAGCATTAT TATGAATTTG ATGACTACGA AGAAATTCAC
GATTTAACAA ATGATCCTAT GGGAAGTGGA CCTTTTCAAT TAGTAGAGTA TAGCCCGGAT
GAACACGCTA TTTTAGAACC TTTCGAGGAT TATTATCATG GTGAACCCGA GTTAGATCGT
ATAATTTACG AAGAAATTGA AACGGAACAG CAGATCCCTA TGGTTGAAAC GGCAGAAGCA
GATATTGTGG CAGTATCATC AACTCCCGAA AATTATGAGA TGCTACAAGA AGAAGATCAT
CAAGAAACAA TTACTTTCTT AGATAACTCT TATTCCTATA TCGGACTTAA CCATCAAAAT
GAACATTTAC AACATCAAGA AGTCAGGCAA GCCTTGGCTT ATGCCATAGA TATTGAATCA
TTCATTGAAG GAATGTACGG GGAAGAACTG GCAAGACCTA TGGCAACGCC ATTTTCACCT
GTATCTTGGG CTTATCCTAA TGAAGACCAA TTAAATTTTT ATGAATATGA TCCGGATAAA
GCCAATGAAT TGCTTGAGGA AGCAGGATAT GAATGGGATG AAAATGAAGA ATATCGCTAC
AATGAAGATG GTGAACGCTT AAGTATTGTG TGGGAAACCT TAGCAGATAA TGAATGGTCT
GAGCATTTAA CAACCCTTGC CTTAGAACAG TGGCCACAAA TAGGTGTGGA TTTGGAACTA
GAGTCGTATG AATTTAATAC TTTAGTGGAT AGAGTAAATG TAGAAAAACG TGGAGAAGTA
GATATGTGGA ATATGGCATG GAGTTTAGCT ACTGACCCAG ATCCTAGCAA TATATTCAGT
GTTCAATATG CAGATGACGG ATGGAATATG GGCTACTATC ACAATGAAGA GGCAGAAGAG
TTAATGGAGG AAGGGATTAG AACCTTTGAT CAGGATGATC GAGCAGAAGT ATATAATGAA
TTAGCTTTAT TATTAAATAA AGATTTACCA TATATCTTCG TTTACAGCTC CAAAGATTTA
TGGTCTGTTA ACAATAGAGT AGAAAACTTC GAACCTTCAG CTTGGCAAGC ATTTTCGTGG
AACATTCATG AATGGGAGAT TACCGAATAT CAAGAATAG
 
Protein sequence
MSSRKIFVLF IAFLLTMGVA LSACAPPEEA KDEEVEEKED EEPVVENPAV ERPNELIIGG 
TDLDGIFNPV LYSTAYDAWV IGMMYDTLLT VDENGELTTD QRSIAKDYEI SDDGLEYTFY
LREGWKFHDG VEVTAEDVAF TLEVTAHPDY DGPRASWSDN IVGVDEYRAG ETDELEGVIV
EDDYTLTVKS QEPDAGDIFD YSTYALPKHY YEFDDYEEIH DLTNDPMGSG PFQLVEYSPD
EHAILEPFED YYHGEPELDR IIYEEIETEQ QIPMVETAEA DIVAVSSTPE NYEMLQEEDH
QETITFLDNS YSYIGLNHQN EHLQHQEVRQ ALAYAIDIES FIEGMYGEEL ARPMATPFSP
VSWAYPNEDQ LNFYEYDPDK ANELLEEAGY EWDENEEYRY NEDGERLSIV WETLADNEWS
EHLTTLALEQ WPQIGVDLEL ESYEFNTLVD RVNVEKRGEV DMWNMAWSLA TDPDPSNIFS
VQYADDGWNM GYYHNEEAEE LMEEGIRTFD QDDRAEVYNE LALLLNKDLP YIFVYSSKDL
WSVNNRVENF EPSAWQAFSW NIHEWEITEY QE