Gene Msed_1155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1155 
Symbol 
ID5103503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1119052 
End bp1120803 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content41% 
IMG OID640507047 
Producthypothetical protein 
Protein accessionYP_001191240 
Protein GI146303924 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.104357 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTTGA TGCCACATTC ACTAGAATAT ACTGTCTTCA CCAATATTCA ATTGAGGATT 
TCCTCCGATG CAGAGAATGC ATCAACTAAA AGAGTTGAAA AGAATATTAA AATCAAAAAG
ATTGAAATAA CGGTTAAGGA AGAGAAAGCG AAGGTTAGGG TATGGACAGA TCAACAAGAG
CTCGATTTAT TTGCAGAAAA ATTGACTATA GACGATATTA AGATTCCCAC ACAAATTACA
ATCTCCATGA AATCCGTCTC TGAGGTGTTG TCTGAGCGCC GTTTGACCTT TGAAGTGACC
AAGAAATCTC TCGATGTAGC TATCATGGAG GTAAAATTGG ATTCTTCAAA GAAATTGGGT
CCCTTACCGT GCTATGTGAA ATTTTACGGA AAAGTTTACC ACAACGGTAC GGGGATTATT
GCACACGAGT TAAGAATCTA CACCAGGATA AAAATAAGTA GGGTCAGGTA CTTCGTGACT
AATAGGGATA ACAAGAACAT TAATGTTCGA CTCGAGTATC AGATTGAAAA TGACCTGGAC
TTCGTTGTTA AACACGTCTT TGTCCCACTT AAAAATTTCG TGATTGGGTT AATGATAAGG
GACGAGAGCG GAACACAACT AAGTTTCATC AGTAGGAGAG AGATCATGAC AAGGCTCTCG
CTTGACCGCT CACTAGATGA ATTGAATTAC TTTGTGATAG TTCCTCTTCA AGGTGGGCTG
AAGCCAAGGC AAAGGAAAAT CCTCAACTTT GAGGGAATTG AGGTCTTACC GCCTAGTAAA
CCCAAGGAGA AGAGTGGGTC AAAGGAGAAG AATGAGCCAG GGACGGAAAG CGTATTTGAG
ACAGAGTTCA ATGGTGATGC TGCCTTGGGA ATAATCATCG AATCTCCTTC ATCTAACAAG
GACATTGTTG TTAAGAGTAT TACTAGTAAA GTTGTTAAGA AGATTACTAG TGGACAAGAG
GATAATCAAA GTGGTCAAAA GGAAATACCA CTCAACCCTT GCAATGATAT GAAGTCTGAA
GAGAGTTCAA CCCCTACAAC AGTTGCCTCA AGCAATTCAC AGCCCGAGTT TTACTGCGAG
CCCAGGAACT GTGATGATTA TGAAGGTCAA CATAATATAA TCAGATCGTC CCACAGAATA
GACCTAGAAT TCAGATCGAG GGGCGGTAAT ATATCTACAT CATCGCCAGT GACGGCGATT
ATTTCAGTCA CTTATAACAT AGTTCCAGAG AAGAAGGCTC AGGATTACCT GTGGTTCCTT
ACCGCCTTCT ACTGGTTAGC CACAACAACC ATATTTGGGC TATTCGTCGA GGAGTTATCC
ATGGATTTAT TTGGATTTAA CTTCTTGACC TTCGTTGTGG GATTATTAGC AGGTGGTGTT
GCGGGAGCCT TTTTATTCTT CCCCTACTAT ATAAGCCATG TAGGAAATTT ATTTGAGTCA
GATAGTATAA GCTTCCTGAG GGAGGCCTTC AGGAGAGTTA TACGATACAA GTTCAACCTA
GCCCTTTTCA TGGTGAGCAT ATTCATCCTG GTGATATCTA TGAGCCTGAG GATATATCCC
GCCTTGCTCG AGTCAACTCC CGCCATCACC GTAATCAGAG AAGTTTATTT GGACATAAAT
TTTGCCCTGG CAGTTATATT GGAGTTCTCG ACAATAACCT CTGAGGAGCT AGAGTACAAG
TCTCCGTTCG AGGTGCTAAC AATATCCCTT CTAAGCTTGA TACTGATCTC CATGTTCCTC
CTGCTTATTT AA
 
Protein sequence
MILMPHSLEY TVFTNIQLRI SSDAENASTK RVEKNIKIKK IEITVKEEKA KVRVWTDQQE 
LDLFAEKLTI DDIKIPTQIT ISMKSVSEVL SERRLTFEVT KKSLDVAIME VKLDSSKKLG
PLPCYVKFYG KVYHNGTGII AHELRIYTRI KISRVRYFVT NRDNKNINVR LEYQIENDLD
FVVKHVFVPL KNFVIGLMIR DESGTQLSFI SRREIMTRLS LDRSLDELNY FVIVPLQGGL
KPRQRKILNF EGIEVLPPSK PKEKSGSKEK NEPGTESVFE TEFNGDAALG IIIESPSSNK
DIVVKSITSK VVKKITSGQE DNQSGQKEIP LNPCNDMKSE ESSTPTTVAS SNSQPEFYCE
PRNCDDYEGQ HNIIRSSHRI DLEFRSRGGN ISTSSPVTAI ISVTYNIVPE KKAQDYLWFL
TAFYWLATTT IFGLFVEELS MDLFGFNFLT FVVGLLAGGV AGAFLFFPYY ISHVGNLFES
DSISFLREAF RRVIRYKFNL ALFMVSIFIL VISMSLRIYP ALLESTPAIT VIREVYLDIN
FALAVILEFS TITSEELEYK SPFEVLTISL LSLILISMFL LLI