Gene Nther_2351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2351 
Symbol 
ID6316938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2501579 
End bp2502820 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content40% 
IMG OID642644739 
Productthiamine biosynthesis protein ThiI 
Protein accessionYP_001918504 
Protein GI188586959 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0301] Thiamine biosynthesis ATP pyrophosphatase 
TIGRFAM ID[TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAACT ATTTACTGAT TCGTTACGGT GAAATAGGCT TAAAGGGTAA AAACAGATCG 
TATTTTGAAA AGAGCTTGGT AAAAAATATG CAGTCAGCCT TAAAAGATTT AGAGATAGGT
AAGATTAAAT CTACCCAGGG ACGCATGTAT ATACCTCTAA GTGGTAGTAG TGATGAATTG
ACTAGAGTTT TAGACAGGGT GACCAGGGTC TTTGGAATCG AAACCGTCAG CCCTGCCGTA
AAAGTGGAGT CCGATTTGGA AGTCATCAAA AAGACGGCTT TACAAGTCTT TAAAAATCAC
ATGGATAGTA TAAGCCCACA AAATCAGGTT TCTTTTAAAG TGGATTGTCG CCGAGCTGAT
AAATTATTTT CCAAAAATTC CATGGAAATG AATCAGATTT TGGGTGCACA CATACTGGAC
CATGTTCCAG GTCTGAAGGT AGATGTAAAG CAACCACAGA TTCTTCTTCA GGTAGAAATT
AGAGAAGATG GGACCTATAT TTTTACGGAA AAAATTCCAG GACATGGTGG TCTACCTATT
GGTACTACTG GTAAAGGAGT GTTAATGTTA TCTGGAGGTA TAGACAGTCC TGTTGCAGGA
TGGCTTGCCA TGAAAAGAGG GATTCAAGTA GTTGGTCTTC ATTTTCACAG TTACCCCTTT
ACCAGTCAGC GGGCTTTAAA AAAAGTTGAA GATATCTCCC AAGTTCTTTC ACGTTACGGT
ACAGGTCCAA CAGGTGGCTT TAAGTTGATT ACCAATCACT TCACCGATAT CCAAAAAGCC
ATTCAGAATT ACTGCAGTGA AAGCATGTGG GTTACAGTAA TGCGCAGATT CATGTTTTAT
ATAGCTAATC GAATGGCTCA GAAAGAACAA GCAATGACAG TGGTCACGGG TGAAAATGTT
GGGCAGGTAG CTAGCCAAAC CCTTGAAAGT ATGCATGCAG TCAGTCAAGA TGTTGTAAAC
CTTCCCATTT TGCGTCCCCT GGCTGGATTG GATAAAAAAG AGATCATGAG CAAGGCTGAA
ACTATTGGCA CCTACGATAT ATCTATCCGT CCTTATGAGG ACTGCTGTAC ATTGTTTTTA
CCTAAAAATC CTAAAACTCG TCCTAGTCTG GAACAAACTA AGAGGGAAAT AAGCAAACTG
AACTTTGAGG AACTGGTGGA AGAATCCCTG GAGAAAACTG AAATTAAGTA TTTTGAACCT
TATCAGGACA TTACAGAGGA AGAATTGACC TTTGATGTTT AA
 
Protein sequence
MYNYLLIRYG EIGLKGKNRS YFEKSLVKNM QSALKDLEIG KIKSTQGRMY IPLSGSSDEL 
TRVLDRVTRV FGIETVSPAV KVESDLEVIK KTALQVFKNH MDSISPQNQV SFKVDCRRAD
KLFSKNSMEM NQILGAHILD HVPGLKVDVK QPQILLQVEI REDGTYIFTE KIPGHGGLPI
GTTGKGVLML SGGIDSPVAG WLAMKRGIQV VGLHFHSYPF TSQRALKKVE DISQVLSRYG
TGPTGGFKLI TNHFTDIQKA IQNYCSESMW VTVMRRFMFY IANRMAQKEQ AMTVVTGENV
GQVASQTLES MHAVSQDVVN LPILRPLAGL DKKEIMSKAE TIGTYDISIR PYEDCCTLFL
PKNPKTRPSL EQTKREISKL NFEELVEESL EKTEIKYFEP YQDITEEELT FDV