Gene Nther_2798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2798 
Symbol 
ID6314465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp3022756 
End bp3024429 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content40% 
IMG OID642645170 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001918934 
Protein GI188587389 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGAAAGC TGTTAATTTT ACTGCTCGTA GTTTTTGTGG CGTCTTTTGG ATTTATGGGT 
TGTGATCCGG CTGATCCTGA AGAAATTGCT GAAGAAGAAG AAAAAGAGCC TGAAGAAGAT
GTTGACGAAG AAGAGGAAGA GCTAGAACAG GCTATTACTC ATGCTATATG GAGTGAACCA
GACGGACTAT TTAATTATTG TGAATATGAA AGTACTTATG ATCTTGATGC TTTTAATCAA
GTATTTGATG GCATGATGGA AGCAGATCCC CATGCAGATT TGGAACTAAA TCCAAATTTA
GCAGAAGAAC ATGAAATAAG TGATGATGGA AAAACTTTTT ATTACAAACT AAGAGATGAT
ATTGAATTCC ATGATGGTGA GCCACTAACT GCAGAAGATG TTAAATTCAC TTTCGAATGG
ATGTGTCATG AAGACTATAT AGGTCCCAGA GCTAGTTACT GGCAGCACCT GGAAGGGTTT
GACGAATATC GAGCTGGAGA AGCTGACGAA GTAGAGGGAA TTGAAATTAT TAGCGACCAT
GAAATAGAAT TCCACTTTGA ACAAGTTGAT GCTTCTGCCG TATATTATGT TAGTACCTGG
GGCATTTCAC CTAAACATGT ATGGGAAGAT ATCCCTGTAG GTGAAAGAAG AGAAGCCCCT
GAAATGACAG AACCAATTGG AACAGGACCT TTTGAATTTG AGGAATACGT TGAAGGTCAA
TACGTAGAAT TAGTTGCTAA TGAAGATTAT CACAGGGGAG AACCAGAATT AGAGAGAATT
ACTGTAGAAG TAAAAAGTCC CGATGTAGTC AGGGCAGACT TGGAGACTGG TGAAGTTGAT
ATCGCTGAAG TTCCACCTGA TGAGAGCGAG TGGGATGATT ATGAAGCTCA CGATGAGTTA
GCCCTTGAAT ATTATCCCAC TAATGGTTAT CAATACATGG GTATGAACTT AAGGGATGAA
AGTATCTTTA GTGATCATGC TGTAAGAGAG GCAGCTACTT ACGCTATCAA TAGGGAAGGT
ATGGTTGATG GTATCCTAGA TGGTTTAGGT AAAGTCCAAA ATTCCCACTT TTCACCTAAC
CAATGGGCTT ATGACGAAGA TTTAGATACT TATCCCCATG ATCCTGATAA AGCCGAAGAA
ATATTAGAAG ATGCTGGCTA TACTAAAAAC GATGATGGAA TCTGGGAACA AAATGGTGAG
CCCTTAGAAT TCACTCTATT ATATCCTACC GGCGACGAAC CCAGAGAACA AGCAGCTTTA
ATCATTGAAC AGGACTTACA GGGAATTGGT ATTGACATTA CTTTAGAATC TTTAGAATTT
GCCACTTTGA GTGACAGGGT TTTTGACGAA ATTGATTTCG ATGCTTATCT TATGGGTTGG
TCTTTAGGTG CAGACCCAGA TCCAACGGGT ATCTGGGGAC CTGATGAAAG ATTCAATGCT
GTTGGATTTG TACATCCGGA AAGCGACGAA CTAATGGAGC AGGGTCTTCG TACTACAGAT
AGGGATGAAC GAAGAGAACA TTATGTAGAA TGGCAGCGCT TACTTCAAGA AGAGATGCCT
TATGTATTCT TGTACGCTGA CTATGAAGGT TATGCCTACA ACAATGATAT CCAGGTATTT
GAACCAAATC CTTGGAATAT TTGGTATGAC GTACATGAAT GGTACCTTGA ATAA
 
Protein sequence
MRKLLILLLV VFVASFGFMG CDPADPEEIA EEEEKEPEED VDEEEEELEQ AITHAIWSEP 
DGLFNYCEYE STYDLDAFNQ VFDGMMEADP HADLELNPNL AEEHEISDDG KTFYYKLRDD
IEFHDGEPLT AEDVKFTFEW MCHEDYIGPR ASYWQHLEGF DEYRAGEADE VEGIEIISDH
EIEFHFEQVD ASAVYYVSTW GISPKHVWED IPVGERREAP EMTEPIGTGP FEFEEYVEGQ
YVELVANEDY HRGEPELERI TVEVKSPDVV RADLETGEVD IAEVPPDESE WDDYEAHDEL
ALEYYPTNGY QYMGMNLRDE SIFSDHAVRE AATYAINREG MVDGILDGLG KVQNSHFSPN
QWAYDEDLDT YPHDPDKAEE ILEDAGYTKN DDGIWEQNGE PLEFTLLYPT GDEPREQAAL
IIEQDLQGIG IDITLESLEF ATLSDRVFDE IDFDAYLMGW SLGADPDPTG IWGPDERFNA
VGFVHPESDE LMEQGLRTTD RDERREHYVE WQRLLQEEMP YVFLYADYEG YAYNNDIQVF
EPNPWNIWYD VHEWYLE