Gene Msed_1691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1691 
Symbol 
ID5105337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1629255 
End bp1630529 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content47% 
IMG OID640507585 
Producttryptophan synthase subunit beta 
Protein accessionYP_001191770 
Protein GI146304454 
COG category[R] General function prediction only 
COG ID[COG1350] Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) 
TIGRFAM ID[TIGR01415] pyridoxal-phosphate dependent TrpB-like enzyme 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.248453 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAATA GTGAGGAAAT AATACCAAGC TATTGGTATA ACATAATACC CGATTTACCC 
AAACCTTTAC CTCCTCCTAG GGATCCCCCA GACGCGGAGT TCTCTAGGAT TGACCTTTTG
AGAAAAATAA TGCCCAGTGA GGTTCTAAGA CAGCAGTTTA CAGTGGAGAG GTTTGTTCCA
ATTCCTAAAG AAGTAAGGGA GGCTTACATC AATATAGGCA GACCTACGCC TTTAGTCAGA
GCTAGAAGGC TTGAGGAATT CCTGAACACT CCGGCTAAAA TATATTTCAA GTATGAAGGG
GCAACTCCCA CTGGTTCCCA TAAGATAAAC ACAGCCCTAC CGCAGGCTTA CTTTTCCATG
AAGGAGGGAG TGGACCACCT GGTCACTGAG ACTGGGGCTG GCCAATGGGG AACAGCTGTA
GCTCTATCTG CGAGAATGTA TGGACTAAAT TCCACCATTT TCATGGTAAA GGTGAGTTAC
GAACAGAAAC CGCAAAGGAG AACCATCATG CAATTGTATG GCGCACGAGT TTTCGCCAGC
CCTACCTCGC ATACCGAGTA CGGCAAGAAG GTTCTAAACG AGAACCCGAA TCATCCAGGT
TCCCTCGGAA TAGCCATGAG CGAGGCAATA GAGTACGCTC TCTCCAACGG CTATAAGTAT
CTCGTCGGGA GCGTGTTAGA TGTTGTGGTG TTACATCAAA GCGTAATAGG GCTTGAGGCT
ATGAAGCAGT TGCAGGAACT AGACGAGGAA CCAGACGTGT TAGTGGGCTG TGTGGGAGGA
GGGAGTAACT TTGGAGGCTT TACCTTCCCG TTCATCGGGT CCAAGAAGGG ATCGAAGTAC
ATTGCAGTTG GATCTTACGA GATACCAAAG TTCAGTAAGG GGGCCTATAA CTACGATTTC
CCAGACAGTG CTGGGCTGTT ACCCCTTGTC AAGATGATTA CCCTAGGAAG GGATTACGTA
CCTCCACCAA TATATTCGGG TGGGCTCAGA TATCACGGGG CCGCGCCCTC CCTAAGTATG
TTGATCAAGG AAGGGATAGT GGATTGGAGG GAGTACAATG AGAAGGAGAT ATTCGAAGCA
GCTCAAATAT TCCTTCAAAC CCAAGGAATA GTTCCAGCAC CAGAATCCTC TCATGCAATT
AGGGCAGTGA TAGAGGAGGC ACGCGAGGCC AAGATCAAGA ACGAGAAAAA GGTGATCGTA
TTCAACCTGA GTGGTCATGG ATTACTGGAT CTACCTAACT ACGAGTCTAT GATGAAGAGG
ATTGGTCAAG ATTGA
 
Protein sequence
MANSEEIIPS YWYNIIPDLP KPLPPPRDPP DAEFSRIDLL RKIMPSEVLR QQFTVERFVP 
IPKEVREAYI NIGRPTPLVR ARRLEEFLNT PAKIYFKYEG ATPTGSHKIN TALPQAYFSM
KEGVDHLVTE TGAGQWGTAV ALSARMYGLN STIFMVKVSY EQKPQRRTIM QLYGARVFAS
PTSHTEYGKK VLNENPNHPG SLGIAMSEAI EYALSNGYKY LVGSVLDVVV LHQSVIGLEA
MKQLQELDEE PDVLVGCVGG GSNFGGFTFP FIGSKKGSKY IAVGSYEIPK FSKGAYNYDF
PDSAGLLPLV KMITLGRDYV PPPIYSGGLR YHGAAPSLSM LIKEGIVDWR EYNEKEIFEA
AQIFLQTQGI VPAPESSHAI RAVIEEAREA KIKNEKKVIV FNLSGHGLLD LPNYESMMKR
IGQD