Gene Nther_0477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0477 
Symbol 
ID6315540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp507157 
End bp508878 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content40% 
IMG OID642642861 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001916661 
Protein GI188585116 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGCA GAAAGTTATT TGTATTATTT CTGGCATTGA TGTTTGCTAC TGCTGTAGCC 
ATCGGTGGCT GTGCACCTCC AGAAGAAGCT AAAGAGGAGG ATCCAGAAGC AGAGGAAGAA
GAAGAGGCTC CAGCAGAAGT AGATAATCCT GCTGTAGAAA GACCTAATGA AATTGTAATT
GGAGGTCCAG ATCTAGAAGG AATTTTCAAT CCAGTTTTAT ACAGTGGTGT TTATGACTCC
TGGGTTTTAG GTATGATTTT TGATTCACTA CTTACAGTAG ATGAAAATGG ACAGTTAACA
ACTGATCAGA GATCAGTTGC CAAAGATTAT GAAATCTCAG AAGATGGCAC TGTATACACT
TTCCACTTAC GTGAAGGATG GGAATTTCAC GATGGAGAAG AGATAACTGC CGAAGATGTG
GCCTTTTCTT TAGAAGTTAC TGCTCACCCT GATTACGACG GACCACGCTC CAGCTTTTCC
GACGATATAG TGGGTGTAGA TGAATTTCGA GAAGGAGAAA CTGACGAGCT AGAAGGAATT
ACAGTAGAAG ATGATTATAC CTTGGTAGTT GAAGCTCAAG AACCTAGTGC TGATAACATA
TTTGATTTTG CAGTAATGAT AATGCCAAAA CATTATTATG ATTTTGAAAA CTATGAAGAC
TTCCAGGATT TAACAGACGA CCCAATGGGA AGTGGTCCCT TCGAACTTGT AGAATACAGT
CCGGATCAAC ATGCTATCTT AGAATCCTTT GAAGACTATT ACCATGGTGC ACCAAATGTG
GATCGAATAA TCTACGAAGA AACTGAAACT GAACAACAAA TTCCAATGGT TGAAACAGGA
GAAGCTGATA TTGTCCAGGT AAGTTCAACT CCAGAAAATA TGGAAATGTT AGAAGATATA
GACCATCAGG AAGCATTAAC TTTCTTTGAT AATGCCTATA CTTATATTGG TTTGAACCAC
GAAGATGAAC ACTTGCAGCA TCAAGAGGTT AGGCAGGCAT TAGCTTATGC CCTTGATATC
GAAGCTTTCC TTGATGGAAT GTTTGGAGAT GAATTGGCTC AACCGATATA CACTCCGTTT
TCACCCGTAT CATGGGCATA TCCCGATGAA GACGATATGA ACGATTATGC TTATGACCCT
GATAAGGCCA ATGAACTACT TGATGAAGCG GGTTATGAGT GGGATGACAA TGAAGAGTAT
CGAGTAAATG AAGACGGAGA GAGACTATCC TTTACATGGG AAACCATTGC TGACAACGAA
TGGTCCGAAC ATTTAACTAC TCTAGCCTTA GAACAATGGC CACAAATAGG AGTAGAACTA
GAAATCGAAA ACTACGATTT CAATACCTTG ACCGACAGAA TTGATCAGGA CCGCGGAGAT
GTAGATATGT GGAATATGGG TTGGTCACTA TCAGCCGAAC CCGATCCATC TAATATCTTT
AGTGTAGAAT ATGCCGATGC TGGCTTAAAC TACGGAATGT ATCACAACGA AGAAGCAGAA
GAATTGATGC AAGAAGGTCT TGAAACCTTT GATCAAGATG AAAGAGCAGA AGTTTACAAT
GAACTAGGTG TACTATTTAA TGAAGATCTA CCCTATATCT TTGTCTACAG TAACAAAGAG
ATTTGGTCCA CAAATGACCG AGTTGAAAAC TATGAACCAA CTGCATTCCA ACACTTGACA
TGGAATATCC ACGAGTGGGA AGTAACAGAC TACGAAGAAT AA
 
Protein sequence
MSSRKLFVLF LALMFATAVA IGGCAPPEEA KEEDPEAEEE EEAPAEVDNP AVERPNEIVI 
GGPDLEGIFN PVLYSGVYDS WVLGMIFDSL LTVDENGQLT TDQRSVAKDY EISEDGTVYT
FHLREGWEFH DGEEITAEDV AFSLEVTAHP DYDGPRSSFS DDIVGVDEFR EGETDELEGI
TVEDDYTLVV EAQEPSADNI FDFAVMIMPK HYYDFENYED FQDLTDDPMG SGPFELVEYS
PDQHAILESF EDYYHGAPNV DRIIYEETET EQQIPMVETG EADIVQVSST PENMEMLEDI
DHQEALTFFD NAYTYIGLNH EDEHLQHQEV RQALAYALDI EAFLDGMFGD ELAQPIYTPF
SPVSWAYPDE DDMNDYAYDP DKANELLDEA GYEWDDNEEY RVNEDGERLS FTWETIADNE
WSEHLTTLAL EQWPQIGVEL EIENYDFNTL TDRIDQDRGD VDMWNMGWSL SAEPDPSNIF
SVEYADAGLN YGMYHNEEAE ELMQEGLETF DQDERAEVYN ELGVLFNEDL PYIFVYSNKE
IWSTNDRVEN YEPTAFQHLT WNIHEWEVTD YEE