Gene Nther_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2201 
Symbol 
ID6314873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2337209 
End bp2338912 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content40% 
IMG OID642644589 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001918355 
Protein GI188586810 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAATG CAGGACAATT TTTTAAGAAA TTTTCAGTAC TCATGGCAGT TTTAATAGTT 
ACTACTGCGT TAGTAATCGG ATGTGGAGAC GGTGGCGATC CAGACGCATC TGATGATTTG
GACCCAGATA GAAAGGTTCC AGAAGTAAGA TTTGTTTCAT CCACTGCTGA TGATAACCAG
ATAAGAAATG AAGCAGTACA GCTAGTTGCT GACTGGTGGG AAGAAATCGG TTTAGAAGTA
GATATTCAAA CCAGAGAGTT TAACTCTCTA GTCAACAGAG TATTGGCTGC CCCCGAAGAT
AAGGATTTCG AAGCATATAT TCTAGGTTGG AGTGGTAGAG TATCAAGATC AGACCCTGAT
ATGTTCCTTT ATTCCTTATA TCATTCCAAT CAAGCTGTAG ATGGCGGTAA CAACAGTAGT
GTTTTCAAAA ATGAAAAGTA TGATGAACTT GTTTCGAAAC AAAGAGCGGA AATGGATCTA
GAAAAGCGAC AAGAACTCGT CTTTGAGGCC CAGGAAGTCC TGGCAGAAGA AGTTCCTGAT
ATCACCCTAT ATTACAGAGA CGAAATTCAA GGTTATAACA ATGAGCGATG GGAAGATCTA
CCAAGTATGG CGGGAGAAGG TATCTTTAAC GAACAGTTCC CCTATGAGGC TACTCCCAAA
ACAGATGATG CAGAGTTTGT AATAGCTAAC TCCGCAAACT TGGATACATT TAACCCCTTT
GCGGCAGAGA CAGTTTACGA ATGGAAATTT TTACGACTTG TTTACGACAA ATTAGTTAGA
TTAGATGAGA ATTTTGAGCC CCAACCCTGG GCAGCAGAAG AAATCAACGT AGTAGAAGGC
GAAGACGGTG AAGTAATAGA TGTAACCTTG AGAGATGATT TACAGTTCCA CGATGGAGAA
CCACTTGGGC CAGAAGATGT TGTCTTTACT TTCGACTACA TGTTTGAAGA AGGCATACCT
TATTTCCAAG CCTTTTTAGA CCCAATTGAA ACAGTAGACC TCATGGACGA TGATGAAACT
ATCAGATTTA CTTTAGAAGA AGCTTATGCA CCCTTTATAA CCAATACTTT AGGTCAAATT
CCTATCTTAC CAGAACACAT CTGGGCTGAT GTTATGGAAG AAGAAAATTT AGATCATCCT
TCTCAGTTCG ACAATGCCGA AGCTATCGGA AGCGGTCCTT TCAAATTTGA CAATTGGGAA
AGAGGCGAGT ACATCAGAAT TGTTAAAAAC GATGATTACT TCAAAGCTGA TGATATCGAT
GTGGAAGCTA TCAGATACGA CAAGTACAGC CATTCTGAAG GAGTTTTCGG TGCCCTGGAG
AACCAACAGG CCGATGTAAA CGAAAACACT TTTGACCCTG AATACGTTCA ACAAGCAGAA
GATCTTGATC ATTTAACAGT AGTTAGAGAA CCCGATATCG GCTTTGACTT CATTGGCCTC
AATAACTACA AAGAGCCTTT TAACGATAAA GCCGTAAGAC AAGCCGCTGC TCATGCCATC
GATTTAGATG AGCTTGTAGA CGTTCTCTTG TACGGTTATG GAGATCCAGC AGGTGCAGGT
CAGACTATTT CAACAGGTAA TGAAATGTGG AAAAATGACG ATGTAAAAGA ATATCCTTTT
GATATCGATA AAGCAAGAGA AATCTTAAAA GATGCCGGCT ATGAATGGGA TAGTGACGGA
AGATTATATT TCCCTGAAGA GTAA
 
Protein sequence
MENAGQFFKK FSVLMAVLIV TTALVIGCGD GGDPDASDDL DPDRKVPEVR FVSSTADDNQ 
IRNEAVQLVA DWWEEIGLEV DIQTREFNSL VNRVLAAPED KDFEAYILGW SGRVSRSDPD
MFLYSLYHSN QAVDGGNNSS VFKNEKYDEL VSKQRAEMDL EKRQELVFEA QEVLAEEVPD
ITLYYRDEIQ GYNNERWEDL PSMAGEGIFN EQFPYEATPK TDDAEFVIAN SANLDTFNPF
AAETVYEWKF LRLVYDKLVR LDENFEPQPW AAEEINVVEG EDGEVIDVTL RDDLQFHDGE
PLGPEDVVFT FDYMFEEGIP YFQAFLDPIE TVDLMDDDET IRFTLEEAYA PFITNTLGQI
PILPEHIWAD VMEEENLDHP SQFDNAEAIG SGPFKFDNWE RGEYIRIVKN DDYFKADDID
VEAIRYDKYS HSEGVFGALE NQQADVNENT FDPEYVQQAE DLDHLTVVRE PDIGFDFIGL
NNYKEPFNDK AVRQAAAHAI DLDELVDVLL YGYGDPAGAG QTISTGNEMW KNDDVKEYPF
DIDKAREILK DAGYEWDSDG RLYFPEE