Gene Nther_2405 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2405 
Symbol 
ID6314596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2570043 
End bp2571641 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content41% 
IMG OID642644793 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001918558 
Protein GI188587013 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000184044 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAAAAT TTAACAAAAA GCCATTTTGG CTAGCAATTG TGCTATCTGG TCTTTTGGTG 
TTCACCTTGA CAGGCTGTGG AGAGCCTGAT GTTGAGGACG AAGCCGAAGC ACCAGAAGGA
GAAGAAGAGG AAGAAATGGA AGAACGACTT GAAGAAGATC ACTTGGTAGT TGCCCAGGGG
GCAGATGCAC CAACACTAGA TCCTATTGGT GAAAACGATC AACCTTCTGC AAGAATTACA
GAGCAAATTT TTGACACTCT TGTGGAGCAG GACGAGAATA TGGAGGTTCA GCCAGGACTG
GCTGAAGACT GGGAACAGAT CGATGATACT ACTTACGAAT TCTATCTTCG CGAAGGCGTT
AAATTTCACA ATGGTGAAGA GCTAACAGCT GAAGATGTTA AATATTCATA TCAAAGATTG
TTAGATGAAG ACGAAGCTTC CCCGGGAGCT TTTATTTTAG AAATGGTTGA TGTTGATAAT
ATTGAAATTG TAGATGATTA TACCGTGCAA ATTCCTCTAG AAGAACCTTT TGCACCAATT
CTTTATCATC TTGGTCACTC AGTAACTGCC ATTGTCAATG AAGATGCAGT TGAAGAACAT
GGCGATGACT TTGGACAGAA TCCTGTAGGA ACTGGGCCAT TTAAATTTGA CGATTGGGAT
ATTGGAAACA GAATTGACCT AGTTAGTTTT GATGACCACT GGAGAGGAGA AGCTGGTGTA
GAGCAGCTTT CTTTCAGAAA CATTGAAGAG GACACAAACA GAACTATTGA ATTAGAAACA
GGTGGTGCTG ATATTATTTA CGATGTAGCA CCAACAGATC TTGAAAGAGT AGAAGATCAT
GAAGAATTAA CTCTTTTAAG AGAACCTAAC TTGTCAACTG AGTATATTGG CTTTAATATT
GACAAAGAAC CCTTTGATGA TGAAAGGGTA CGACAAGCAA TTAACTATGC ATTAGATATG
GAACCAATTG TCGAAGGAGT ATATTATGGA CTGGGAGAAC CTGCTAGAAG TCCCCTTGCT
CCAGCAGTAG TGCACAATAA TCAAGATGTA AAATCTTATG AGCAGGATAT GGAGAGAGCT
GAAGAACTAC TGGCTGAAGC TGGCTATGAA GACGGATTTG AAGCAGAAAT CTGGACTAAT
GACCAGCAGC AGCGTCAGGA CATCGCTGAA ATGGTACAAG GCCAGTTGTC TCAACTGGGT
ATTGACCTAA ACATCAGCAT TCGCGAGTGG GGAACTTATC TTGAAGAAAC TGCCCAAGGA
GAGCATGATA TGTTCATACT TGGCTGGGTT TCAGTAACCG GTGATGCCGA TTACGGATTA
TACTCCCTAT TCCATGGTGA TGAGCATGGA GCAGCCGGAA ACAGAACTTT CTATGATAAT
GACAGAGTAG ACGAACTTCT AGACGAAGGT CGTAGAACTT TCGATGAAGA TGAGCGTGCT
GAGATCTATG CGGAAATTCA AGAAATAGTT ACTGAAGAAG CTCCTTGGAT CTTTACTCAA
GTGGGTGAAG AAGCAGTTGG TACTAGAGAT TTTGTAGAGA ACTTTACTAT TAATCCTGCA
GGCCACCATG ACCTGTTTGA GGTAACTATT GCAGATTAA
 
Protein sequence
MKKFNKKPFW LAIVLSGLLV FTLTGCGEPD VEDEAEAPEG EEEEEMEERL EEDHLVVAQG 
ADAPTLDPIG ENDQPSARIT EQIFDTLVEQ DENMEVQPGL AEDWEQIDDT TYEFYLREGV
KFHNGEELTA EDVKYSYQRL LDEDEASPGA FILEMVDVDN IEIVDDYTVQ IPLEEPFAPI
LYHLGHSVTA IVNEDAVEEH GDDFGQNPVG TGPFKFDDWD IGNRIDLVSF DDHWRGEAGV
EQLSFRNIEE DTNRTIELET GGADIIYDVA PTDLERVEDH EELTLLREPN LSTEYIGFNI
DKEPFDDERV RQAINYALDM EPIVEGVYYG LGEPARSPLA PAVVHNNQDV KSYEQDMERA
EELLAEAGYE DGFEAEIWTN DQQQRQDIAE MVQGQLSQLG IDLNISIREW GTYLEETAQG
EHDMFILGWV SVTGDADYGL YSLFHGDEHG AAGNRTFYDN DRVDELLDEG RRTFDEDERA
EIYAEIQEIV TEEAPWIFTQ VGEEAVGTRD FVENFTINPA GHHDLFEVTI AD