Gene Msed_1460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1460 
Symbol 
ID5104830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1429812 
End bp1431572 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content46% 
IMG OID640507348 
Productthermopsin 
Protein accessionYP_001191541 
Protein GI146304225 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTGT ATGAGTCTAA CACCTCTACC ACCCTTAGCG CAGGTCAATA CGAGTATTTC 
CCGCTCAACG TTAACACAAC AGAGATACTT TCCTACTCCT GGAATTCAAC AGGTAGTGTT
GCAGTGATGG TAATGAACCA GACCCAACTT CAAGAATTTC TCAATGGTAC AGGAAGCCCC
TACAAGGGGT TGGTTATCCT GAATTCATCG TTCAGCAATC AAGTTCTGCT AACTCCCGGT
AAGTATTACT TTGTGCTCTA CGCCTACCTA CAACAGGTAA CCCTGCAGTA CAGCCTGAAA
CTGGTTCCAG CTCAGGTGTC CTATACCCTT CCGGTGGGGT ATCAGGAAAA TTATCAGTTG
AACCTAAGCT ATCCCTTTCA TCTTTACCTC TATCTAGTTT CCAATAACTC GTTCTCTGTG
AGGGTCACTT CAGGCAACGT TACTTACTTC AGTGCAGCTC CCTCCAGGGA TACTCCTCTT
ACTTTCGTCA ATCACACGTT AACCCTCAGC CCAGGCAACT ATTCCATTAC TGTGGTTAAT
CCTGGATCCT CAACGATAGC CGTTTATTCC TCAGTGCTGT ACGCCTCCAC TTATCCAGAT
CCCTTATCCT TGAATAGGAC AGATTACCCT ATGGGAGTGG CAAGTTATGG CCTATTTAAC
AGGTCAGGGG TACCAGTCCC GTACGTGGTC AAGGCATCCT CCGTAGTGGG GTTTGCAAAT
ATTTCCTCTA TCTTTGCTTA TAATCAGACT GCTGAGAAAC TGAATGTCTC CCCCTATTCG
GCGAGTTTAC AGTTAAATGT CCCACTGGTC GTGATAAACG GAAAGCAGAA CCAGACTTAC
TGGGTACAGA ACGTGATAGT TTTCATGACT AATGAGTCGA CCCTATGCTA CGAGTCCTCC
GTCTTAAACG TGACTAACGC GAATGCCACC TTAACAAACA TCTCTATACA AGGGAGAGGC
GGAGTGTATC CGCCCTTCAA TAATGGAATA TACTATACCT ACAAGACTAA GGGAGTGCAG
TACAAGACGC CCCTATCTCT CTTGATCTCC ATAAATGTAT CAGTGATAAA GAAACTTGGA
GTGAGAATAG GCTTCGATTA TAAGGTGCTT GAGAACGGTT CGGTAGTTAA CGGTAGCTGG
AACCAGTTTG ATTCTCCTCT CATTCTAGAC TCGGGAGTGT CACAGGCCTA TCTTTACGTG
GATGGATATA ACTCCCCATC TACCCTGAAT TTCTATGATG CTGAACTAGT TTTTGGAGGC
GGAGGAAACG GAGAGGTGGC ATATTTCCAG AACCTTTCGG CTACCCTTGC CATCTTCTAC
TATAACGGAT CTCTTCATCC CTTCCCAAGT GTGTATAGCT TCGGCGCAGA TACTGCCGAG
GGCACGAGCG ACTTACACGT GTCATTAATG AATGGTCTGG TTTCCGTTTC TAAGGGTCAG
GACAACCCAG TCTTTCTCAC GAACCAGTTT AATGCGTCCA TACCGGTATT GCGAGTTGTC
GTTAATCATG TTCAAAACAA GAGCTCTGTG TCTAACGTTA CCACTACTAC AACTCATACG
AACACATCAA CTTCCACCAG TTCCAATGTT ACCAAGAATA CTGTACCTCC ACCAAGTAAC
ACTAGCCAGA CTTCTAGCGC CCCTACGAAG AAGGGTGGGC TTCCCCCTTA CCTTCTACCT
GGGCTAGTGA TTGCCGTAAT CGTCGTGATA GTAATATGGG TGCTCATTAA CAGGTTCAGA
AAGCCCGACC TGAATATATG A
 
Protein sequence
MNLYESNTST TLSAGQYEYF PLNVNTTEIL SYSWNSTGSV AVMVMNQTQL QEFLNGTGSP 
YKGLVILNSS FSNQVLLTPG KYYFVLYAYL QQVTLQYSLK LVPAQVSYTL PVGYQENYQL
NLSYPFHLYL YLVSNNSFSV RVTSGNVTYF SAAPSRDTPL TFVNHTLTLS PGNYSITVVN
PGSSTIAVYS SVLYASTYPD PLSLNRTDYP MGVASYGLFN RSGVPVPYVV KASSVVGFAN
ISSIFAYNQT AEKLNVSPYS ASLQLNVPLV VINGKQNQTY WVQNVIVFMT NESTLCYESS
VLNVTNANAT LTNISIQGRG GVYPPFNNGI YYTYKTKGVQ YKTPLSLLIS INVSVIKKLG
VRIGFDYKVL ENGSVVNGSW NQFDSPLILD SGVSQAYLYV DGYNSPSTLN FYDAELVFGG
GGNGEVAYFQ NLSATLAIFY YNGSLHPFPS VYSFGADTAE GTSDLHVSLM NGLVSVSKGQ
DNPVFLTNQF NASIPVLRVV VNHVQNKSSV SNVTTTTTHT NTSTSTSSNV TKNTVPPPSN
TSQTSSAPTK KGGLPPYLLP GLVIAVIVVI VIWVLINRFR KPDLNI