Gene Nther_0469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0469 
Symbol 
ID6315532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp497222 
End bp498343 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content37% 
IMG OID642642853 
Productprotein of unknown function DUF362 
Protein accessionYP_001916653 
Protein GI188585108 
COG category[S] Function unknown 
COG ID[COG2006] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAA TTAACTCTCA AGCAATATAC ATAAGCTATG GAGACGATCC CAAATCCATG 
GTCAAGGAAC TATTTGATAA GTATCCTCCT TTTCCCGAAC TAAATTATAA TGCAAAAATT
GGTCTAAAGC CAAATCTGGT GCTTCCCAAT TCTTCTGAAT CGGGAGCTAC TACTTCTCCA
GAGCTAGTTA GTGGAATAGT AGAACATTTT TTATCCAGGG GGTTTAAAAA GCTAATCATA
ATGGAGAGCT CATGGGTAGG AGCTAGTACA GAAGCTTCTT TTGAGAAATG TGGTTACACT
GCAATATCTA ATAGGTATAA TCTTCCCTTA GTTGATCTTA AAACAGTTCC AACTAAAGCC
ATTACATTCA AGGATCACAC TATAAAAATC GCAACTCCTC CTTTACAAGT TGACAGATTA
GTAAATGTAC CAGTATTAAA GGCACACTGT CAAACAGATG TAACATGCGC TCTCAAAAAC
CTAAAAGGAT GTCTTCCTGA TAGTGAGAAA CGCAATTTTC ATATACAAGG TCTCGATCAT
TATATAGCTA CTTTAAATCA AATTCTTTTC CCGGATTTAA CTATTGTTGA TGCTATTTTA
GGAGACTTGA CTTTTGAAGA AGGCGGAAAT CCTGTTAATA TGGGTAGGAT ATTAATGGGA
ACTGATCCTG TCTTAATTGA TTCTTACGGA GCTCATTTAC TTGGATATTC TAGTGATAAA
CTGCCTTCAC ATATTGAATT AGCAGGCCAG TCCGGGGTAG GTTCCGTATA TCATCCAGAA
AAACATGAGA TAGTAGAACT AAATCGAAAT AGACGGCAAA TGGACTTCTT TAGGACAAGG
GGACCTTTCT TGAATGAATT GAAAAGCTAT TTGGAAGAGG ATATGGCTTG TTCACCTTGT
ACAGGTTCAG TATACCAAGC ATTAATGCGT TTAAAAGATG AAGGAAGTTT AAAAAAACTT
AAAGTAAAAA TTGCCCTTGG TCAAGGAGTG AATAAATGGA CACGAGATTA TATCGAAGGT
CATTTGGGTG TAGGAGAGTG TACAAGGCAC TGCTCTGACT ATGTTTCTGG ATGTCCTCCT
AGAGCTGATA AGATTTTAAG CATGTTGAAA AATTACCTAT AG
 
Protein sequence
MNKINSQAIY ISYGDDPKSM VKELFDKYPP FPELNYNAKI GLKPNLVLPN SSESGATTSP 
ELVSGIVEHF LSRGFKKLII MESSWVGAST EASFEKCGYT AISNRYNLPL VDLKTVPTKA
ITFKDHTIKI ATPPLQVDRL VNVPVLKAHC QTDVTCALKN LKGCLPDSEK RNFHIQGLDH
YIATLNQILF PDLTIVDAIL GDLTFEEGGN PVNMGRILMG TDPVLIDSYG AHLLGYSSDK
LPSHIELAGQ SGVGSVYHPE KHEIVELNRN RRQMDFFRTR GPFLNELKSY LEEDMACSPC
TGSVYQALMR LKDEGSLKKL KVKIALGQGV NKWTRDYIEG HLGVGECTRH CSDYVSGCPP
RADKILSMLK NYL