Gene Nther_0169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0169 
Symbol 
ID6316551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp193231 
End bp194430 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content38% 
IMG OID642642547 
Producttransposase IS111A/IS1328/IS1533 
Protein accessionYP_001916356 
Protein GI188584811 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00419214 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTTTG TTGGGATTGA TTGGGCTGAT ACAAAACATG ATATCCTGGT CATGAGTGGC 
GATGGTAGAG AACTAGATAA CTTCACTATT CAACATTCTA AAGATGGTTT TGAAACTCTA
AAAAACAAGC TATTAAAACA TGATGACGAT CCTGAAAACT TCTATTGCTT AATTGAAACT
AAACATGGAC TTTTAACCCA ATATCTTTTA GAAAATAACT TCACTGTTTA TTCTGTTAAC
CCCAAACTAG TTGATGCTAG AAGAAAAGCT TCTGGGGCTA AAACTGACTT TATTGATGCT
AAAATACTAG CTAATATGGG TAGATCAGAG CTCCATGACT TACATAAGCT AGAGCCTGAT
TCTGAACACA TCCAAGAACT TAAAGTACTC ACCAGAGATC AAGAAGCTCT TATACAAGAA
AGTGCTAGGT TAACAAATAG ACTGATTTCA ACCCTGAAAG AATATTACCC TGTTGCTCTT
GAATTGTTTT CTAAAATAAC TCTACCTGTT TCTCTAGCTT TTTTAAGGAA ATATCCTACT
CCAAAACAGG CTCGAAAAGC TAGTAGAGAT GAGATCTTTA AGTTTTTAAA AAAGCAAAAA
CATCCTAACC CTACGTCTAA AGCTAATGAG ATCTTCACAA AGCTTCAAAA ACCTAATTTA
GAAGGAAACA GAGCCATTTG TTCTGCCAAG TCTAAGTTTT TATTTACTAT CCTAGATCAG
CTAGAGCCTT TATTAGAACA TATTGATGAG TACGACAAGG AAATCGAGAA ACTTTTTAAG
TCCCACTCTG ACAGTAAAAT TTTCGACAGC ATACCAGGTG CCGGTAAGCG AATTGCACCG
AGGCTGCTGG CAGAGTGGGG AGACGATCGC AGCCGTTATG CTGACGCCTC GGTGGTACAG
GCCCTTGCGG GAACTTCACC AGTACTCCAT CAAAGTGGCA AAATGCGTAT TGTAAAAAGG
CGACACTCTT GTATTAAGCC TTTTCGAAAC GCTTTACATC AGTTCGCTCT ACAAACTACA
AGGTGGATCC CCTGGGCCAA AGACTATTAC TACAAAAAGC GCAAAGAAGG TAAACAGCAT
CATGAAGCTG TAAGGACTCT AGCTAATATT TGGGTTAGGA TACTCTTTGC TATGTGGGTA
AACAAAGAGC CCTACAACGA AAGCAAGTTC ATAAAAGCTA GAGAAAAACA CGCTGCTTAA
 
Protein sequence
MYFVGIDWAD TKHDILVMSG DGRELDNFTI QHSKDGFETL KNKLLKHDDD PENFYCLIET 
KHGLLTQYLL ENNFTVYSVN PKLVDARRKA SGAKTDFIDA KILANMGRSE LHDLHKLEPD
SEHIQELKVL TRDQEALIQE SARLTNRLIS TLKEYYPVAL ELFSKITLPV SLAFLRKYPT
PKQARKASRD EIFKFLKKQK HPNPTSKANE IFTKLQKPNL EGNRAICSAK SKFLFTILDQ
LEPLLEHIDE YDKEIEKLFK SHSDSKIFDS IPGAGKRIAP RLLAEWGDDR SRYADASVVQ
ALAGTSPVLH QSGKMRIVKR RHSCIKPFRN ALHQFALQTT RWIPWAKDYY YKKRKEGKQH
HEAVRTLANI WVRILFAMWV NKEPYNESKF IKAREKHAA