Gene Msed_1400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1400 
Symbol 
ID5104610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1372257 
End bp1373261 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content48% 
IMG OID640507289 
Productthreonine synthase 
Protein accessionYP_001191482 
Protein GI146304166 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000170642 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGGTTA AATGCATAAA ATGTGGAAGG GAGAGGGAAG GCGTAGAGGT TAGGTGCAAA 
TGCGGTGGAG TCTTCAAGGT AGAGGTGGAC GTCCCATTTT CTAAAAATCT TAGGGAAAAC
TTCCCTTACG TTAAGAGATG GATTTCCCTA GGTGAATGGA ACACCCCGTC CATTAGGGTT
GAAGGCTTTA CATATAAGCT GGATTTCCTT AATCCCACCG GTTCATACAA GGATAGAGGA
TCGGTAACCC TAATTTCTCA CCTTTCTCAG CTTGGGATAA GGGAGATCTC TGAGGACTCA
TCTGGTAATG CAGGCGCCTC TATAGCTGCC TACGGAGCTA TGGCTGGGAT GAAGGTGAAG
GTCTTTGTTC CCTCAACTGC AAGGGGAGGG AAACTGAAAC AAATTGAATC TTACGGGGCT
GAAGTAGTCA GGGTGGAAGG CACAAGAGAT GACGTATCAC GGGCTGCAGA GAACTCTGGA
GCCTATTATG CGTCCCACGT CCTTCAACCT GAGTTTAGGG ACGGAATAAG GTCCTTAGCC
TACGAGATAG CGAGGGATCA TGGATGGAAA TCCCCAGGAG AAGTATTTTT GCCCACGTCA
GCAGGTACTC TACTCCTAGG AGTTTATGAA GGGTTTAGAC ACATGGTCAG CGAGGGAGTC
CTAGATAGGA TGCCCAAGTT AGTTGCAGTT CAAACGGAAC AAGTGAGTCC AGTATGCTCC
AAGTTCCGTG GAATAGAATA TAGACCGCCA AGCAGGGTCA CCTCCATAGC TGATGCCCTT
GTTTCAACGA ATCCCGTACT CATGGATGAA ATGATTAGGG TTCTTCAAGA GACGGGTGAC
TGCGTAGTTG TTAGTGAAAA TGAGATCATG GACTCTTGGA AATACCTTTC CAGGAAAGGG
ATACTTGCCG AGTATAGCTC TGCGGTGGCA CTAGCCGGAG GAAGGAAATA TGAGGTGTCA
GACCCTGTCA TAGTGCTAAC AGGCAATGGA CTTAAGACTT TATAG
 
Protein sequence
MRVKCIKCGR EREGVEVRCK CGGVFKVEVD VPFSKNLREN FPYVKRWISL GEWNTPSIRV 
EGFTYKLDFL NPTGSYKDRG SVTLISHLSQ LGIREISEDS SGNAGASIAA YGAMAGMKVK
VFVPSTARGG KLKQIESYGA EVVRVEGTRD DVSRAAENSG AYYASHVLQP EFRDGIRSLA
YEIARDHGWK SPGEVFLPTS AGTLLLGVYE GFRHMVSEGV LDRMPKLVAV QTEQVSPVCS
KFRGIEYRPP SRVTSIADAL VSTNPVLMDE MIRVLQETGD CVVVSENEIM DSWKYLSRKG
ILAEYSSAVA LAGGRKYEVS DPVIVLTGNG LKTL