Gene Msed_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0221 
Symbol 
ID5104087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp182314 
End bp183693 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content48% 
IMG OID640506126 
Producthypothetical protein 
Protein accessionYP_001190322 
Protein GI146303006 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000130576 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0122393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGTCA GGAGTAAGAG ACACAACAGG GTCAGGGCAC CGTTTGATTG CGAGAGAGGG 
TTGCCCTATA CAGAGGTAAA CGTGAACGGG AAAATTGTGG AGAACTGTGA GCCTCCTACA
AGACTTCATG GGCTTAGCTT TCCCTTAAGC GGGATGTATC TACATCGAGT CAAACTTCTG
AGAACCTTTC CCTGGCTCCT GGAAAAGGTA GCAGAACGGA TGAATGTTCC AGAAGGATAC
TCCACCGTGG AGGATTACAG GGTTGAGAGG GTTAGCGTGG ATACGCTAAT AGTAGGATCA
GGACTTTCAG GTTTAACAGC TCTATCCAAG AGTAAGGCCA TGCTTGTCAC GAATGATCTT
TACACAGACC TCTTTGACGA TCCCTTGAAT CAGGGAGAGC TTTTGGGGAA GGTCAAGGAC
ATTATCAAGC AGAACGAGGG TAGGATAATT CAGGGCGACT TCCTAGGAAA GTTTACTGAG
GGCTTTGTGG TCAGAACGGG TAAGAAACTC GTTCTAGTTT CCCCCTCCAG GGTGATCTTC
GCCGTTGGTG GAAGATATTT GCCTCCTATC TTCGAGGGTA ATGACTATCC CAACGTGATT
TCCAGGAGAC TTTATCTCAA GAGGAGATCT GCCTATAGAA GGATAGTGGT TCTTGGATCT
TCGGATGATG CAATTAAAAC TAGTCTAATT TCAGGAGGTA AAATTCTGAC TCCGCGTGGG
GTAAGGCTCT TCTCAAGAAG GTACTTGGAG TTGGCCGAAA CTAGGGGGGT GGAGATAGAG
GAGGTCGACT CTCTTAAGGT GAAACCAAGG GATGGCAAAC TTTTCGTGGA GTGGAATAGT
AGTAATCTTC TGGTTGACGC TGTGGTTTTC GCTCCAGTTA AACAGCCCAG ACTGGAACCC
ATAGCTAATG CAGGGTGTGA ATACAGGTTT TACCCAAACA TGGGACTATA CGTTCCGGAA
CATGAGATGG ACGGTTACAT GAGGAGTTGT GGGCACTTTG TAGTTGGAGG GGCCAGAGGC
ATCATGGACG AAGAAACGTC GATGTTAAGT GCTGAGGCTC CTTTTAGCGC TGAGGCCCTC
TCTACCCTAG CCAGTCATCT GAAAGAAACT CCCCTTCACG AGTACTACAC AAGGAATTTC
GTATCTGTGA AGAGTCCATA TTACTATTCT CCAGGAGGTT ACGCTTGTTT CTGCGAAGAT
GTGCTCTGGA GTGATGTGGA ACAGGTCATG AAAATGGGTT ACGACAATGT GGAGTTAATC
AAAAGGGTTG GTGGGATTGG TCTTGGCGAG TGTCAGGGCA AGGTTTGCAC ATACGTTACT
GGTAGTATCC TGTCAAGTCA GAGGCTGATA ACCTTCAGAT CACCGCTTTA CCCGATGTGA
 
Protein sequence
MEVRSKRHNR VRAPFDCERG LPYTEVNVNG KIVENCEPPT RLHGLSFPLS GMYLHRVKLL 
RTFPWLLEKV AERMNVPEGY STVEDYRVER VSVDTLIVGS GLSGLTALSK SKAMLVTNDL
YTDLFDDPLN QGELLGKVKD IIKQNEGRII QGDFLGKFTE GFVVRTGKKL VLVSPSRVIF
AVGGRYLPPI FEGNDYPNVI SRRLYLKRRS AYRRIVVLGS SDDAIKTSLI SGGKILTPRG
VRLFSRRYLE LAETRGVEIE EVDSLKVKPR DGKLFVEWNS SNLLVDAVVF APVKQPRLEP
IANAGCEYRF YPNMGLYVPE HEMDGYMRSC GHFVVGGARG IMDEETSMLS AEAPFSAEAL
STLASHLKET PLHEYYTRNF VSVKSPYYYS PGGYACFCED VLWSDVEQVM KMGYDNVELI
KRVGGIGLGE CQGKVCTYVT GSILSSQRLI TFRSPLYPM