Gene Nther_2790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2790 
Symbol 
ID6314457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp3013544 
End bp3015127 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content39% 
IMG OID642645162 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001918926 
Protein GI188587381 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCAAA AACTAAAGAA ACTCGCAATA GTAACTACCT GCTCGGCTTT AATATTGACA 
GCTTGTGGTG GTGAAGTTGA TGAAGAAGCT GGTGGAAAAA ACGATGAGCA AGGCAGTAAG
GAAGAAGCAC TCGAGGTTGA ATATGAGGAT AGATTGACAA TTGGTATGGG TACCGACATG
GTTACTTTTG ATATCCATGA CCATAACAAC ACCTCGACAG AAGCTGTTCA TATCAATATT
TTTGATTATT TGTTTCGACA AGAAGATGGT GAAATACAGT CTGAATTAGT TTCAAAGCAT
GAGATTATAG ATGATGAAAC GTGGAAATTT AAATTAAAGG AAGGCGTAAA ATTTCACAAT
GGAGACGAGT TGACTTCAAA AGATGTCCAA TTCACCTTTC ACAGGGTGAT AGAAGACAGT
TCTTTAACGG AACACTCGAA TTACAATCAA ATTTCAGAAG TGGAAGTTAT TAATGATTAC
GAATTTTATA TTCATACAGA AGATCCTGAA CCGGCCCTAC TTAACCGGAT TTCTCGGATG
GGTTCAGGAA TTTTACCAAA GGACTATATT GAAGAAGAGG GCTGGGATCA CTTTTTAGAC
AGTCCTATTG GCTCAGGTCC CTATGAATTT GTTGAATGGG AACGGGATAA TCGTGTTGTA
CTAAATCCCT TTGAAGACTA TTATGGTGAC TATGATTCTC CTTGGGAAGA AGTGGTTTAC
CGAGTAATAC CTGAAGATTC TACCAGGGTT TCCGAGTTGT TAACAGGGGG AGTAGATTTA
GCGGTGAATG TTCCTCCTAC TGACTGGGAT AGGGTTGATG ATAATCAGGG GACTCTAATA
TCGACAGGTG ACTCCAATAG AGTTATGCTG TTAATCTTAA ATCACGAAGA AGGACGACCT
ACTGAAGAGC AAAAGGTGAG GGAAGCTATT GACTATGCCA TTGATAATGA AGCACTCACT
GAATCTATTC TAGACGGTCA GGGGACTCCA GTCAGAACAA GGGTTACTCC AGGTAACACC
GGTTATAATG AAGAGTTATA CGACGATTAT CGTTATGATC CCGACTATTC TAGAGAACTG
CTTGAAGAAG CTGGATACAG CGATGGTGTT GAACTAAAGT TTCATGCGCC TCAAGGGCGA
TATTTGATGG ATAGCGATGT TTCAGAAATG ATTACAGGTA TGTTAGCAGA AGTGGGAATA
AATGCTGATT TAAACCTGCT TGAATTCAGT CAATTTGCCG ACAAGTATCT GGGAAATGAA
AATGAGGATT TAATGTTCTT AGGATTGTCT AATTCCATGT TTGATGCTGC CCATGCTCTT
AGAAATTTCC ATTCAGAACA AAATACTGAG AGGACCTATT ATGAAAATGA GCGAGTAGAC
GAGCTGTTAG AAAAAGCCGA ATCTAACATG GATTTTGAAG AACGAAAAGA ACAATATCAA
GAAGTTCAAG AAATTGTTGC AGAAGAATTA CCTTATGTTT ACTTGTATCA ACAAAAGGAT
AGTTACGGTA TAAACAATCG AATTGACTTC GAACCCCGTT TGGATGAAAT GATTTACATT
CCGGAAATTG GTAAAACAGA CTAG
 
Protein sequence
MIQKLKKLAI VTTCSALILT ACGGEVDEEA GGKNDEQGSK EEALEVEYED RLTIGMGTDM 
VTFDIHDHNN TSTEAVHINI FDYLFRQEDG EIQSELVSKH EIIDDETWKF KLKEGVKFHN
GDELTSKDVQ FTFHRVIEDS SLTEHSNYNQ ISEVEVINDY EFYIHTEDPE PALLNRISRM
GSGILPKDYI EEEGWDHFLD SPIGSGPYEF VEWERDNRVV LNPFEDYYGD YDSPWEEVVY
RVIPEDSTRV SELLTGGVDL AVNVPPTDWD RVDDNQGTLI STGDSNRVML LILNHEEGRP
TEEQKVREAI DYAIDNEALT ESILDGQGTP VRTRVTPGNT GYNEELYDDY RYDPDYSREL
LEEAGYSDGV ELKFHAPQGR YLMDSDVSEM ITGMLAEVGI NADLNLLEFS QFADKYLGNE
NEDLMFLGLS NSMFDAAHAL RNFHSEQNTE RTYYENERVD ELLEKAESNM DFEERKEQYQ
EVQEIVAEEL PYVYLYQQKD SYGINNRIDF EPRLDEMIYI PEIGKTD