Gene Msed_0458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0458 
Symbol 
ID5105454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp411401 
End bp412876 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content46% 
IMG OID640506364 
Producthypothetical protein 
Protein accessionYP_001190559 
Protein GI146303243 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGGGA TCATGAGACA AATCTTAGGC TCAACATTGT TAATTTTACT ACTCGGATCT 
TTCATAGGAT TAATTGCCGG CTCTATTGGC ACTGGAGCCT CTACCACTCA AACTTACAAC
GTCACCTTTA TGGAGCATGG GTTGCCCAGC GGAACCATGT GGTCCGTTAC CTTTAACGGA
CAGACAAAGA ACTCCACGAG CAATGAGATA GTGTTTCAAG TTCAGGGAGC AAGTTACTCA
TCCTTCTCAA TTCCAAACGT GGGCAACTAT ATTCCAACAC CTTCTAATGG ACAAGTGTTC
GTGAATTCCT CGCTTAGTAT AAACGTGACG TTCGCTCTGC CATTCGTTAA GCTAGTGATT
GTTAAGTTAG TGGTTTTACA GCAAAGTACC GGAGCCTCAG TCACGGAGCT ACAGCCTGGC
ACATCATATG TTGTTGGTGT GGAGGTTCAG AATCAGGGTA ACGTTAATGC GCTCACAGAG
GTGAATGAGA CAGTCCTCTA CAACGGAAAA GTCGTTACGT CTGACGTTCC CATAGCTAGC
ATAGCTCCTG GGGCCTCGGA AACACTAAGC TTTATCTGGA CCCCATCCAC TGCAGGTATA
TATACGTTCC TTGTTAACGT CAAGGCCAAT CCCAACATCT CAGTAAGTGA GACGTATCCA
CTTTACGTGG GAGTATCGCC AGTGAACGTA TACAACGTGA GCTTTGTGCA GACAGGCCTA
CCCGCAGGGA CACAATGGTC CGTTACCCTC AACGGCACAA CTAAATCATC AACCTCTAAC
ATGATAACCT TCCAGGTTCC AGCCGGAACC TACACCTATT CGGTGCAGAA CGTCACAGGT
TATCTAAGCA AGGATGTTAC AGGTGAAGTT ACGGTGAAAA ACAGCTCGGT AACGGTACAA
ATCACTTTCC TCCCCTTGGT GTTTAAGCCA GTGGCAACGC TATTAGTTTC CTATAACGGT
CAGGAAGTTA CCCAGTTACA AACCAACATT ACGTATGACT TGATAGTCAC TGTAAAGAAT
GAAGGGAACA CTTCAGGTCA GGGTTATGTT CTCGTCATAG CATCTCAGGG TTCGACGACC
GTCCTTAACA AGGCCTTGAA CTATACCTTA AAGCCAGGGC AGGCTGAGAA TTTTACCCTG
CTCTTTAACC TCAACTCGAC GCAACCCCTT TCCATCAAGG TAAGCACATA CTCTTTGACC
CCCAAGGGAG AAGTTCCAGT CTACAACTCA TCCTCACAGT TCACAGTTGT TCAACAGCCT
ACCACGACGA AGACAACAAC CACTAACACA TCTACGTCAA CGACAAATAC GACGAAGACA
ACAACCACTA ACACATCTAC GTCAACCACC ACTAACACAA CGACTCCTTC ACCTAAACCA
TCCTCAGGCT CGTCTAATAC CTTACTAATT GTCGGAATAG TTGTAGTTGT TGTCGTTATT
ATAGCGGTGG CAGTGATTTT CCTAAAACGG AAATAA
 
Protein sequence
MFGIMRQILG STLLILLLGS FIGLIAGSIG TGASTTQTYN VTFMEHGLPS GTMWSVTFNG 
QTKNSTSNEI VFQVQGASYS SFSIPNVGNY IPTPSNGQVF VNSSLSINVT FALPFVKLVI
VKLVVLQQST GASVTELQPG TSYVVGVEVQ NQGNVNALTE VNETVLYNGK VVTSDVPIAS
IAPGASETLS FIWTPSTAGI YTFLVNVKAN PNISVSETYP LYVGVSPVNV YNVSFVQTGL
PAGTQWSVTL NGTTKSSTSN MITFQVPAGT YTYSVQNVTG YLSKDVTGEV TVKNSSVTVQ
ITFLPLVFKP VATLLVSYNG QEVTQLQTNI TYDLIVTVKN EGNTSGQGYV LVIASQGSTT
VLNKALNYTL KPGQAENFTL LFNLNSTQPL SIKVSTYSLT PKGEVPVYNS SSQFTVVQQP
TTTKTTTTNT STSTTNTTKT TTTNTSTSTT TNTTTPSPKP SSGSSNTLLI VGIVVVVVVI
IAVAVIFLKR K