Gene Msed_1730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1730 
Symbol 
ID5105093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1667094 
End bp1668140 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content42% 
IMG OID640507625 
Producthypothetical protein 
Protein accessionYP_001191809 
Protein GI146304493 
COG category[S] Function unknown 
COG ID[COG2237] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.234359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.75509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGG CGATAATATA CATTGATATA GATGACGATT TGAGTAAGGC TGGTGTTTCT 
ACCCCAGTTA TAGGAGAGGC TAAAGCTAGG GAAGCCATAG AGAAATCCTC GCGATTTTTG
GCCCTAGACT CAGACTTCAA TACCATGGTA ACTGCGTTCA ACATTTACCT TGATATGAAA
GCAAATGGAG AAGATGTGGA GATCGTATTT GTTGCTGGGT CACAAAGAGG TGGTCTCGAT
TCTCAAATGG CACTCTCTAA GCAAGTTGAT GAGGTTATTA GAGCTATTAA GCCTGATCAG
GCAATCCTAG TTTATGATAG CCCTGAGGAC GCAAAGGCTA TCCCGGTCAT CGAAAGCAGG
CTGAAGATAG TGGGAATAGA GAGAGTTATA GTGGAACAGC ATAGAGGGGT GGAGGAGACA
TATATCTTGC TTGGGAAATA TCTCAAGAGA CTAGTAACGG AAAGCAGGTA CTCGAGACTG
TTTCTAGGGG TTCCTGGAAT AATTCTTTTC GTTTCAAGTA TATTGGCAAT TGCTGGCCTA
ACGGCTTACG TCCTTCCCTC CATACTTTTG GTTCTAGGAG GAGCCATGTT AGTTAGGGGA
TTTGGAATAG ATGATGCCAT AGAGAGATGG TGGGAGAACT CAACTATCAT GGTTATTGTG
GCAATTCTAT CCTCCATTTC CCTGGTACTT GCTATAATTA ACGGATATCT AGTTGCTTCC
ACAGCTGGAC CGCTTTCCAT AAGGTCTGCC TCATCGACTT TACTAGCCAT ACTACCGTAT
CTCACGTTTT CTATTATCAT TTTATACTCA GGTAAGCTCT TGTCTAGAGC TCTATCAAAG
GACATAAGGA TTTGGCACGA TCTATTAAAG ATACTCGCAT CCATACTTGC CTACTTCGTG
CTTACTGGGT TACTTAGAAA TCTTGAGAGC GGTGCTTACA TAATTCAGAT TCAATCATTC
TACCTTCTTC TACTTTCCTC GTTCATTTTA ATTGCCACAT ACTTCGTTCT TTCCAATTTC
GAGAAAAACA GATTAAGATC ACAATGA
 
Protein sequence
MKTAIIYIDI DDDLSKAGVS TPVIGEAKAR EAIEKSSRFL ALDSDFNTMV TAFNIYLDMK 
ANGEDVEIVF VAGSQRGGLD SQMALSKQVD EVIRAIKPDQ AILVYDSPED AKAIPVIESR
LKIVGIERVI VEQHRGVEET YILLGKYLKR LVTESRYSRL FLGVPGIILF VSSILAIAGL
TAYVLPSILL VLGGAMLVRG FGIDDAIERW WENSTIMVIV AILSSISLVL AIINGYLVAS
TAGPLSIRSA SSTLLAILPY LTFSIIILYS GKLLSRALSK DIRIWHDLLK ILASILAYFV
LTGLLRNLES GAYIIQIQSF YLLLLSSFIL IATYFVLSNF EKNRLRSQ