Gene Nther_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2002 
Symbol 
ID6314468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2109517 
End bp2110746 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content35% 
IMG OID642644389 
ProductHistidine--tRNA ligase 
Protein accessionYP_001918157 
Protein GI188586612 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3705] ATP phosphoribosyltransferase involved in histidine biosynthesis 
TIGRFAM ID[TIGR00443] ATP phosphoribosyltransferase, regulatory subunit 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTCCCAA GAGAGATTCA GCTGTTACAA AAACCATCAG GAGTCAACGA TAACTTACCT 
AATAGAGTAG AAAAGTTTCG ATTATTAGAA AATAAAATTT TAGAAACCTT TAAACTATGG
GGGTATGATG AGGTTTCTGT GCCTTATATC GAATATTCCA ATATATTCTC AGTAAATAAA
GAAAACTTGC TGGAACAACA AATGTTCCAG TTCAATGATG AAACTGGACG TTTGGTTGTA
CTTAGACCTG ATTTTACACC TTCAATTGCT AGATTAGTTG CAACTTACTA TAAAAATCAT
CCCCTGCCAT TGAGATTATG TTATTCTGGT AAAATCTTTC GAGCAAATAA TGGCAATTCA
ATTAATGAAA AAGAACAAAC TCAAGTGGGA GTTGAATTGA TAGGTGGTGA AGCACCGGGT
GGAGATGCCG AATTAGTAGC TATGGCAGCA GAGACATTTC AAAAAATGGA GATAAAGAAT
TTTGTTATCT GTATTGGCAA TCTTAAGTTT ATCAATAATT TATTAGATTG TCTTAAAGTT
GATCAATCCT CTCAAAGTAT GTTAATTGAA GCTTTAAATG AAAATAATTT AGTCCAATAT
CAGGGGATAA TTGATAGTCT GGAATTGGAA TCAGACAGTA AGGATATTCT TAAAAAGTTA
CCAAAATTCA GGGGTAATAA AGAAAATCTT AACTATGTTA GGGAAGTAGC TGGGTTTAAG
CCAGTTATGA ATATTTTAGA TGAACTCGAA AATATCTTTA AAACTCTTGA AAGTTACGGC
TTGCAAGATA AGGTGACATT TGATCTGTCT CTAGTACGAA AATTAGATTA TTATACCGGT
TTTATTATGG AAGGTTATGC CGAAAAAGTT GGCTTTCCAC TTTGTGGAGG TGGCCGATAT
GATAAGCTGA TGGATAAATA CGGTATGAGC TTACCGGCTA CGGGTTTCGC CTTTTCTTTT
GATAGTTTAA TAGATTTAGT GAAAGTTAAA GGTGATGCGT TGTCAAAACT TCATTTTGGA
TATACCCAGG GTAAGCGTGA CCAAGCCTTA GAACATGTAA AAGATTATCG TCAGCGAGGA
TATAGAGTGA GTTTGGAGTT AAATAGCCAG GAATTGGAAG TATCTCGACG CAAAGCGAAA
GAACAGGGAG CTGACCAATT TTATTATATA GGTAATCATG GAATGGTTAA GGAAAATATC
AGCACTAATT CGGAAAGAAA GGATGTCTAG
 
Protein sequence
MVPREIQLLQ KPSGVNDNLP NRVEKFRLLE NKILETFKLW GYDEVSVPYI EYSNIFSVNK 
ENLLEQQMFQ FNDETGRLVV LRPDFTPSIA RLVATYYKNH PLPLRLCYSG KIFRANNGNS
INEKEQTQVG VELIGGEAPG GDAELVAMAA ETFQKMEIKN FVICIGNLKF INNLLDCLKV
DQSSQSMLIE ALNENNLVQY QGIIDSLELE SDSKDILKKL PKFRGNKENL NYVREVAGFK
PVMNILDELE NIFKTLESYG LQDKVTFDLS LVRKLDYYTG FIMEGYAEKV GFPLCGGGRY
DKLMDKYGMS LPATGFAFSF DSLIDLVKVK GDALSKLHFG YTQGKRDQAL EHVKDYRQRG
YRVSLELNSQ ELEVSRRKAK EQGADQFYYI GNHGMVKENI STNSERKDV