Gene Msed_1231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1231 
Symbol 
ID5103845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1207751 
End bp1210525 
Gene Length2775 bp 
Protein Length924 aa 
Translation table11 
GC content46% 
IMG OID640507123 
Productpeptidase A5, thermopsin 
Protein accessionYP_001191316 
Protein GI146304000 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAATTA CGGATTATGG CGTAGGGCCT AACGGTTTTT ACTCTTATGA CACTACCCAA 
TTCTTAGGGA CAGTGTACAT AAACAGTTTA CAAGCGTTGA CTTTTACTAG TGGAAGTACT
GCGGTAACTT TTCAGCTCAA TGTCATGCTT AACTATGAGG CAAGAGGATC GTCTTACGCT
CTTTGGGTTC AGGACGTCGC ATGTTTTAAT ACTGTAAACC ATGAAATATA TTTTATTGAT
AACATTTGGA ATTCCACTGT ACCATTTGGT AATGTGACCG GTCTTCAAGG AAATGGTCAG
TATGCTGTAT CCTCATCAAA TGGAAATTAT CCCTCGCAAG TTTTTTATGG CGACGTGTCC
CACGATCCAG GATCAATTGT TACACAATCA CTTCCAACGG CTTTTGACCT TCTAGTAAAT
GTAAGTACCA ATAACCTAGG TCAGCCTGTT ATCTATTTCT GGTACAATGA TGGTTATGGA
TGGATAAACT ATGATACCGT TACAGTGACG AATGTTCAAG GTGCTACCAA CGTTTATTTT
GAGGTAAATG GATATGCCCT TACAGGGAAT GGCAATAATT ACGATGCCGA GCTTGTTCTC
GCCGGACCAG GTGGAGGTTC AAACACTTAT ATTAACCAAG CAAACGTTGA TTTACTTTTA
TTTTACTGGA ACGGACATAA TTTCCAGGAG GTTGCAAATA CATACAACTA TGGTTCAGAT
ACTGCCGAGA CATCGTCCAA TGTTAAGGTA CTTTATTATT ATTATCCATA TTCTGGCTTA
CCCACGGGAT TTATAACCAC CGGGCCTGGG GCTCTTGGTC CATTGTGGAA CAGTACTGAC
GTCACTTGCC TAACTGTAAA CACCGGGTTA AACCAGGGAT ACGTTCAGAT CTACAATGAT
AGTTATTCCT ACCAGTACGC CGAGCAAGTA AATAACCTCT GTTTGTTCCA AGGGGGATCC
GCCACATTCA CTCTTACGCC CTACGTGAAC TATTCAGTCC TAGTTTACAA CACTAACGAT
CAGCTGGTTG GAGAGGCCAA CGTAATGGCT CAGCCTGGCT CCATGACCAC CAGCGTGACG
CAATTTAATG TGGTGATACC TCCAAGCGTA GATCTCACAC TTGGGCAGAC AACCAATATT
CCAGTTGAAA TTCAAGCTTA TGGACACGTA ACAGTCCAGG TTAAGGGCAA CTCTCTCACA
AGGACTTACA ACGTGTACGT AAATGGACAA ACCACGTTGA ACGTTCCCGT AGAAGCAACG
TCACCTGGCT CCTATCAACT TGAGGTTAAC GCAACTCTCT TCCCAGGGTT CTCCGTGGTT
AAGTATGTTA CAGTGAACAT TGCACAACCT CAACTTTACT CCGTGACAGG ATCCTTCTCA
GTGAATGGGC AAACGCCTCC CACTACACCC CTCGTAACCT TTACCTTCCC CAACGGCTCA
AGGGTCACCT TGCCACTAAA TAACCTGAAC ATTAAGGTGC CCCCCGGTAC CACCTATTCC
GCCTCAAATA TAACTGAGTC TGGCTATAGG TGGATCGCAG TTAATGGTCA AGGAGAGATA
CTGGGCACAA CTCAGCTCTC CATAACCTAC TACGAGCAGG TTCAAGTGAA CTTCCAGTAT
CAGGGCCCTG CAGTTCCTAC CGTGAGTTAC TACTACCTGG GGAACCTGGA GACCGCGAAG
TTGCCCGTCA CGTTATGGGT GGACTACGGC ACTAGTTACA CATTCCCACA AATCCTCTCG
AGCAGTTCAG GGGAAAGGGT GATCGCATAT AACTACCAGG GAACAGCAAC CTCTCCCAGC
ACCTTCACAG TCACCTATTA TCAACAGTAC TACGTTGACG TCTCATCTCC TATCCCAGTC
TACGCTCTCG TGAACGGCCA CAATGAGAGC CTAAGCTCGG GATGGTACAA CCAATCTACG
TCAATAGACG TCGAGAACGT TCCTTACTAC CTAGGCCCAG GTGAGAGGGA AGTCATTACT
AGTGTGACGC CTTCATCCTC TCTCTCCGTG AATTCTCCCC TAAACATTAC CGTCACCCCA
GAGACTCAGT ACTATGTTCA GGTGTCGTCT CCTATCCCCG TTTATGCCGA AGTTGACTAT
GAAAACTTCA CCCTCCAGTC AGGTTGGTAC TATCAAGGAA CCTTGATACA GGTTGAGAAC
ATCACATATT ACCCCTCATC TAACGAACGT TACGTGATAA CTTCGATTAC TCCATCAACT
TTCACGGTGA ACCAGCCGGA AACCATTACG ATCTCCACCG TGGTTCAGTA TAGGCTCACC
TTACTTTCAT CAATTCCTAC ATACGCCCTT GTTAACGGAA ATAACGAGAC CTTGACCTCA
GGTTGGTATA ACGCGGGAAC AACCATAAAC GTGGAGAATA TCACATATTA CGTTACCCCT
ACAACTAGGG AACTCGTCAC CCAGATCTCT CCTTCCACCC TGACCATGAA CGGGCCTTCA
GCTATTTCAG TACAGACAGT GAAACAGTAT CTAGTACAGA TAAACTCTCA ATACCCTGTA
ACAATAAATG GGGTTCAGAC AAACAGTGAG TGGGTTAATG CAGGGTCAAG CATCACCTTG
AATGCCAATT TACCTTTCTA CTTGACGGGA AGTTTCAGCG GAACTGCTCC AGTGGCTCTT
GGAGGGAGTA TAGCGGTTAA CCAACCCGTC CAGGAGACGT TACAGACGTC CATCAGTCTA
GTATTTGTGG GGATAGTTGC CGTTATTGCC ATAGTGGCTG TGGGTGTGGT GATAGTACTG
ATAAAGAGAA GGTGA
 
Protein sequence
MGITDYGVGP NGFYSYDTTQ FLGTVYINSL QALTFTSGST AVTFQLNVML NYEARGSSYA 
LWVQDVACFN TVNHEIYFID NIWNSTVPFG NVTGLQGNGQ YAVSSSNGNY PSQVFYGDVS
HDPGSIVTQS LPTAFDLLVN VSTNNLGQPV IYFWYNDGYG WINYDTVTVT NVQGATNVYF
EVNGYALTGN GNNYDAELVL AGPGGGSNTY INQANVDLLL FYWNGHNFQE VANTYNYGSD
TAETSSNVKV LYYYYPYSGL PTGFITTGPG ALGPLWNSTD VTCLTVNTGL NQGYVQIYND
SYSYQYAEQV NNLCLFQGGS ATFTLTPYVN YSVLVYNTND QLVGEANVMA QPGSMTTSVT
QFNVVIPPSV DLTLGQTTNI PVEIQAYGHV TVQVKGNSLT RTYNVYVNGQ TTLNVPVEAT
SPGSYQLEVN ATLFPGFSVV KYVTVNIAQP QLYSVTGSFS VNGQTPPTTP LVTFTFPNGS
RVTLPLNNLN IKVPPGTTYS ASNITESGYR WIAVNGQGEI LGTTQLSITY YEQVQVNFQY
QGPAVPTVSY YYLGNLETAK LPVTLWVDYG TSYTFPQILS SSSGERVIAY NYQGTATSPS
TFTVTYYQQY YVDVSSPIPV YALVNGHNES LSSGWYNQST SIDVENVPYY LGPGEREVIT
SVTPSSSLSV NSPLNITVTP ETQYYVQVSS PIPVYAEVDY ENFTLQSGWY YQGTLIQVEN
ITYYPSSNER YVITSITPST FTVNQPETIT ISTVVQYRLT LLSSIPTYAL VNGNNETLTS
GWYNAGTTIN VENITYYVTP TTRELVTQIS PSTLTMNGPS AISVQTVKQY LVQINSQYPV
TINGVQTNSE WVNAGSSITL NANLPFYLTG SFSGTAPVAL GGSIAVNQPV QETLQTSISL
VFVGIVAVIA IVAVGVVIVL IKRR