Gene Nther_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2066 
Symbol 
ID6316050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2183987 
End bp2185018 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content38% 
IMG OID642644454 
ProductPyridoxal-5'-phosphate-dependent protein beta subunit 
Protein accessionYP_001918221 
Protein GI188586676 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000391247 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.00289425 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATTTTC TTTGTCAAAA TTGCGGTAAA ATGTATGAAA TAACTAGTTT AAATTACACT 
TGTTCTTGTG GGGGTTTGTT TGACTTTCAA GGGACTTTTA TTGGTGAAAT GAAAGAAATG
GTTTCTCTGG GGGAACCAGT AACTCCCATA ATAACTAAGT CTTTTAATAA TTATCCCTTG
AAACTAAAAC TTGATCACTT AATGCCTACT GGTTCATTTA AAGATCGAGG GGCTAAGGTA
CTTATAAGTG CATTGAAAAA GTTGGGTATA GAGGAAGTAG TAGAAGACTC GTCAGGTAAT
GCGGGAGCTG CTATTGCGGC TTACTGTGCT GCTGCCGGGA TTGATTGCCA AATTTATGTT
CCTGAGTCTG CTTCGGGCAG CAAATTAAAG CAAATTCAGG CCTATGGTGC TGAGGTAGTT
AAAGTACCTG GGGATAGAGA TGATACGTCA AGGGCAGTAA AAAAAGCATG TAAAAACAAT
TATTATGCAT CGCATATATA TAATCCTTTA TTTTTTGAAG GGACTAAAAC AATAGCCCAT
GAAATTTATC AGCAAATAGG TATACCAAAA ACAATGGTGA TTCCTGCAGG AAATGGTACT
TTATTGCTTG GTGTATATAA AGGATTTTAT GAATTAGGGA ATCTTCCTAA AATTATTGCT
GTACAAAGTG AACGTTGTGC CCCATTAACT CAAGAAAGCG ATGATACAAC TGTTTCAGCT
AGTTCGAATC CGACAGGTGA CACTATAGCC AAGGGAATTG CGGCTAAAAA CCCACCAAGA
AAACTGGAGA TGCTTAAGGC TATTTCAGAT AGTAGGGGGC AAGTACTCAC TGTATCTGAA
GATGAGATAA AATCCTCTAG ACAAGCTCTC TGGAATAAGG GTATTTTTGT TGAAACAACA
GCTGCCGTTG CAGTGGCTGG TGCTATAAAG TTATTTGATG CAACTTCATC TTTTGAAAGT
ACTGGCACTT TAAAAACTGA TATGCAAGAT GTATTGGTGC CATTAACTGG TACTGGATTA
AAAGAGTTTT AA
 
Protein sequence
MNFLCQNCGK MYEITSLNYT CSCGGLFDFQ GTFIGEMKEM VSLGEPVTPI ITKSFNNYPL 
KLKLDHLMPT GSFKDRGAKV LISALKKLGI EEVVEDSSGN AGAAIAAYCA AAGIDCQIYV
PESASGSKLK QIQAYGAEVV KVPGDRDDTS RAVKKACKNN YYASHIYNPL FFEGTKTIAH
EIYQQIGIPK TMVIPAGNGT LLLGVYKGFY ELGNLPKIIA VQSERCAPLT QESDDTTVSA
SSNPTGDTIA KGIAAKNPPR KLEMLKAISD SRGQVLTVSE DEIKSSRQAL WNKGIFVETT
AAVAVAGAIK LFDATSSFES TGTLKTDMQD VLVPLTGTGL KEF