Gene Nther_1045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1045 
Symbol 
ID6314226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1110022 
End bp1111092 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content34% 
IMG OID642643417 
Producthypothetical protein 
Protein accessionYP_001917217 
Protein GI188585672 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0229946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCAGTAA GAAGGTCATT TCAATTATTA TTGATCACAT GCTTGTCTGT TTCAGTAACT 
TTTATGACAG GTTGCACACC AGAGTACGAA GTGGAGATAC AAGCTGATCC TGAAGAAGCA
GGAGAAATTG AAGGTGAAGG CACTTACGAA GAAGGAGAAG AGGTTACAGT TGAAGCTGAA
CCTAATGAAG GTTATGAATT TAAAAAATGG GAAAAAGAAG GTGAAGAAGT AAGTCAAGAT
CAAGAATATA AATTTGAAAT TGAAGAAAGT ATAGAATTAG TTGCTGAATT TGCAGAAACT
GTAAACATTC CTGATGAAAA CTTAGAAGCA GCTATTAAGG AAGAGTTAGG TGTAGATCAA
GTGACAAAAG AAAATATTAA ACAGTTAACA TCTTTGGAGG CGAGAAGAGA AGGGATTAGC
GATCTGATTA ACTTGGGAAA GGCAGAAAAT CTTAAAAACT TAAACCTTTC GGGCAATAAA
ATTCAAGATA TAACCGCTTT AACTGAACTT ACGGGACTAG AGAAGTTAAA CTTAAACAAT
AATGAGATAA CAGATATTAA AGCACTACAT GAATTGACTA ATCTTAAAGA AGTCAACCTT
ATAGGAAATG AAATCGATGA AATAAACTTT TTAGGAGAAT TAAATGATCT CAAAAAACTT
TCTGTAAGGG ATAATGAGAT GAATTTGACA TTGGTTGATT TTGACCAATC TCCTGGAGAT
GACTATATAG CTTATAGGGT AAAAAGCCCA TTACCCTTAA CTGTCGATGT TCAGGTAGTA
AACATAAATG AAGACTTAAG CGTTGAACAG GTATTTGAAC CAAATAGTGT TACTGATCGG
GGCGAGCTTT ATAGTTTTAC TGAAGCGTCT TCCTATGAAG ACCCATTTGT TCAGCATGTT
CCCAAAAAAC ATCCATTTGA CGTTTATAAA GACCATGAAT GGAAAAATAC ACAAGTTTTA
AAGGTTGTTG TTCGTGAAGA ACATGAAGAG GGTAAGGATG GAGAGCACAA GTTACCTGGA
GAGTATAAAA TCGATTTTAA AAACAGAGAG ATTATTGAAA TTAATAATTA A
 
Protein sequence
MSVRRSFQLL LITCLSVSVT FMTGCTPEYE VEIQADPEEA GEIEGEGTYE EGEEVTVEAE 
PNEGYEFKKW EKEGEEVSQD QEYKFEIEES IELVAEFAET VNIPDENLEA AIKEELGVDQ
VTKENIKQLT SLEARREGIS DLINLGKAEN LKNLNLSGNK IQDITALTEL TGLEKLNLNN
NEITDIKALH ELTNLKEVNL IGNEIDEINF LGELNDLKKL SVRDNEMNLT LVDFDQSPGD
DYIAYRVKSP LPLTVDVQVV NINEDLSVEQ VFEPNSVTDR GELYSFTEAS SYEDPFVQHV
PKKHPFDVYK DHEWKNTQVL KVVVREEHEE GKDGEHKLPG EYKIDFKNRE IIEINN