Gene Msed_1156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1156 
Symbol 
ID5103504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1121624 
End bp1122847 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content55% 
IMG OID640507048 
Productprotein of unknown function DUF395, YeeE/YedE 
Protein accessionYP_001191241 
Protein GI146303925 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.643231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0925546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAACCT TTACAGCACC CATGTGGGTT GGAATCCTCA TAGGCTTCAT CATAGGCGCA 
GCAGCGGAGG CCTGGGGGAT AGCAAACCCA GAGACGCTAA TAAGACTAGC GAAGTGGGAG
GATAGACTCT TCGTAATATG CATCGCCCTG GGCCTGGCGA TCTCGACTCC CGTACTATTT
GGGCTCTATG CCCTCGGGGT GGGTTTCCAC TTCAGCCCGA AGCCACTCTA CCTAGTTGGA
GTTGGCCTCG GCGGAGTCCT GTTTGGAGCA GGGCTGGCAA TAGCCGGGTA TTTCCCTGGT
TCCATCTGGA TGGCCCTAGG TGAAGGAAGA AGGGACGCAA TCTATGCTTT ACTGGGAGCA
CTCCTCGGAG CTGCCTCGTG GACGGCCCTG TACCAGACCA GCGTTGGCCA GTGGCTAGTG
AGTACTCTGA ACTTCGGTAG CCTCGTGATC GGTGGAAAGC ACGTGTCTAC CTTCGTGATT
CAGCCGTTCC AGGGACTGAC CCCCGTGGAC CTTTTCGGGA TATCCCTAGT TTACGCGGTC
GGCCTCTTCC TAGTCGCATA TTACCTTCCG AGGTATAAGG GAGGACAGAG GAGTTGCATT
AGGGAGAACC TTGAGAGGAG GAACACTCCC GTCGAGGTCC AGAAGCACCT CGACACAGCA
ATCTACTTGA CCGATGGCGG TCTACCCTAC TCCCAGACCT CTCTAGCCAA GAAGGTGAAC
GAGTACTACG CAACGGAGAG CAACGTAACC AGGTGGTTCA TGGTCTCCAT CGCCGGTATC
GTGGGGCTTA CTGTGGTACT GGAGATGTTC CTTCACCAGA TATTCGGCGA ATCCACAACC
TACTCCTGGA TAGTTGGGCA ACTCTTCATG CCATCTTTCA AGTATAGCCA GATAGTCTTC
AAGGGGATTG GATGGGAGCC CTTCAGCGAC ATTGGGACCT TGATGGGAGC CTTCTTCAGC
GCAGTCTTCA TTACTAGGAG GTTCACATCC TTTAGGAACA TCATACCGCC AAGCTGGGCC
CACAGGTTCG GGACAAATGA GGCAGTGAGG TTCGTGGGTT CCTTCCTGGG AGGTTACCTG
ATGCTGTTCG GAGCCAGGAT GGCAGGCGGT TGCGCCAGCG GACACATCCT CAGCGGTGAC
ATGCAGATGG CCCTGAGTGG TCTCGAGTTC ACAGCAGCCG TTTTTGCAGC AATGATCATA
ACTGCGAAGG TGGTGTACAA ATGA
 
Protein sequence
MITFTAPMWV GILIGFIIGA AAEAWGIANP ETLIRLAKWE DRLFVICIAL GLAISTPVLF 
GLYALGVGFH FSPKPLYLVG VGLGGVLFGA GLAIAGYFPG SIWMALGEGR RDAIYALLGA
LLGAASWTAL YQTSVGQWLV STLNFGSLVI GGKHVSTFVI QPFQGLTPVD LFGISLVYAV
GLFLVAYYLP RYKGGQRSCI RENLERRNTP VEVQKHLDTA IYLTDGGLPY SQTSLAKKVN
EYYATESNVT RWFMVSIAGI VGLTVVLEMF LHQIFGESTT YSWIVGQLFM PSFKYSQIVF
KGIGWEPFSD IGTLMGAFFS AVFITRRFTS FRNIIPPSWA HRFGTNEAVR FVGSFLGGYL
MLFGARMAGG CASGHILSGD MQMALSGLEF TAAVFAAMII TAKVVYK