Gene Tneu_1028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1028 
Symbol 
ID6164998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp919563 
End bp920924 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content66% 
IMG OID641668180 
Producttryptophan synthase subunit beta 
Protein accessionYP_001794405 
Protein GI171185486 
COG category[R] General function prediction only 
COG ID[COG1350] Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) 
TIGRFAM ID[TIGR01415] pyridoxal-phosphate dependent TrpB-like enzyme 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.992034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0454913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGTGA GGGCTCCGCA GGAGGAGTGG ATGGTCCCCC GCCGGTGGTA CAACATCGTC 
CCCGATCTGC CCAAGCCCCT GCCGCCGTAT CTTAAGCCTA ACGGCGAGCC GGTGAGGCCG
GAGGAGTTCG AGGTGTTGTT TGCGAGGGAG CTGGTTAGGC AGGAGTTCTC CCAGGAGAGG
TGGATCCCGG TGCCGCCCCC CGTGAGAGAC GTGTACCTCC TCTGGCGGCC GACGCCGCTT
CTGAGGGCGA GGAGGCTGGA GGAGCTCCTC AAGACGCCGG CGAGGATCTA CTACAAGTTC
GAGGGGGTTT CGCCGCCGGG GAGCCACAAG CCGAACACGG CGGTGGCCCA GCTGTACTAC
GTATCGAAGG AGGGGGCGGG GAGGGTCACC ACCGAGACTG GGGCGGGGCA GTGGGGCTCC
TCGGTGGCCT TCGCCGCCTC TCTATTCGGC GTGAAGGCCA CCGTCTACAT GGTCAGGGCG
TCCTACGGCC AGAAGCCCTA CAGGAGGGTC CTCATGGAGC TGTGGGGGGC GGAGGTGGTG
CCCAGCCCCA GCGACAGGAC CGAGGCCGGG AGGAGGTTCC TGGCCGAGGA CCCCAACCAC
CCGGGGTCGC TGGGGATAGC CATCTCGGAG GCTGTGGAAG ACGCCGTGAA GACGGGGGCG
AAGTACGTCC TCGGCTCGGT CCTCAACCAC GTACTTATCC ACCAGACGGT TATAGGCCTC
GAGGCGGCTG AGCAGATCAG GTACTTCGGC GACTACCCAG ACTACGTGGT GGGGGCCTGC
GGAGGCGGCA GCTCCTTCTC GGGCCTCTTC TGGCCTTTCT ACCACGAGAA GAGGGTGGGG
AAGGCCGAGA GGGATGTGAA GTTCGTGGCG GTGGAGCCCG TGGCGGTCCC CACCTTGACC
AGGGGGGAGT ACATATACGA CCTGGGGGAC ACGGCCGGCT TGACGCCCTT GATCAAGATG
TACTCCGTGG GCCACGGCTA CAAGCCGCCT CCCATACACG CGGGTGGTCT GAGGTACCAC
GGCTGCGCGC CCACCCTCTC CCTGCTCGTG GCGGAGGGGG AGGTCTCCGC GGTGGCCTAC
AGGCAGAGGG AGGTTTTCGA GGCGGCGAGG CTGTTCGCCC AGGCGGAGGG GGTGGTGCCT
GCGCCCGAGT CGGCCCACGC CGTGAAAGCC GCCGTGGAGC TGGCTCTGCA GGCCAAGAGG
GAGGGGAGGC CCGTCACGAT CCTCTTCAAC ATGTCGGGCC ACGGCCTCCT GGATCTGGCG
GCTTACGACG AGTACATGAG GGGGGTTCTG CAGGACGTGG AGCCGACTGC GGAGGAGATT
ATGGCGAACA TCGCCAGAGC TAAGGCCCTT CTCCGTCAGT AG
 
Protein sequence
MGVRAPQEEW MVPRRWYNIV PDLPKPLPPY LKPNGEPVRP EEFEVLFARE LVRQEFSQER 
WIPVPPPVRD VYLLWRPTPL LRARRLEELL KTPARIYYKF EGVSPPGSHK PNTAVAQLYY
VSKEGAGRVT TETGAGQWGS SVAFAASLFG VKATVYMVRA SYGQKPYRRV LMELWGAEVV
PSPSDRTEAG RRFLAEDPNH PGSLGIAISE AVEDAVKTGA KYVLGSVLNH VLIHQTVIGL
EAAEQIRYFG DYPDYVVGAC GGGSSFSGLF WPFYHEKRVG KAERDVKFVA VEPVAVPTLT
RGEYIYDLGD TAGLTPLIKM YSVGHGYKPP PIHAGGLRYH GCAPTLSLLV AEGEVSAVAY
RQREVFEAAR LFAQAEGVVP APESAHAVKA AVELALQAKR EGRPVTILFN MSGHGLLDLA
AYDEYMRGVL QDVEPTAEEI MANIARAKAL LRQ