Gene Msed_0417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0417 
Symbol 
ID5105534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp368272 
End bp369366 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content50% 
IMG OID640506323 
Productmajor facilitator transporter 
Protein accessionYP_001190518 
Protein GI146303202 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.117512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCACG TAACGAAGCT GGCCTTTTCC GGTGGGATAA GATCCTTTAC GTCGTCTCTA 
ATCTGGCCGT ATATAGGGTT TGGCCTCTAT AAGTACCTTG GTTTGTCCCT GGTTCAGGTC
AGCCAGTTCT ATCTAACCCA GCTCTTGATC TCGTCGATAG CCTATGTCAT TGGGGGATAC
TTGACAGATT ACCTAGGGAG GAGACTCGTG ATGACGCTAG CTACTTCGCT TTCCTCGCTA
GTGCTCACTC TAGCCTTTTT CCTTAACACT GCGGGAGTCA TAGGAATGGT TCTGCTCCAG
TCAGGATTCA GTAGCATTTA TGCTGTAGCT AACATGGCCA GCGTAGGGGA CATGGGAGGT
AACTTTAAGC AACTTGTGAG GTCGTTTAGT GTAATACGCG TTGGGATCAA TGCTGGATGG
GCCATAGGTC CTGCAATTGG AGGTTTACTT CTGGGGGATA TAGGATTCAA ACCACTGCTA
CTCCTGGGCG GGGTCCTGTC AGTGGTTGCC ATTCCCTTTG TGTACTCCCT TCCAGATCAC
AAGGGGAGGG TTAGGTTCTT CCTTCCCAAC AGGAAGTTCG CCATGTTTCT GATACCCACC
CTTCTCACCT TTACTGTAAT GGGACAGCTG GGATTTCCTC TAGTTACCTA CTACAGTGGA
CTTGGCATTG CGGTCTGGCA GGTGGGTCTC CTCTACGCCG TCAACGGAGG ACTCATTATA
CTCCTGCAGA GATGGATTGG GGAAAGGGTA TCTGGAAATT ATAGGACCTG GATATCCGTA
GGAATGCTCA TGTACTCTTT GAGTTACGGG CTTGTATCTC TGGTCTCTAA CGTATGGGAA
GCCCTTCTAG ACGTCGTGGG AATTACCTTG GCTGAGATGA TTGTGTCTCC CCTATCCCAA
TCCATTTCCA CATCCCTAGC TGAAAGTGAG ACGAGGGGAA CCTACTCCGG GATATATGGA
CTAGTAAGTT CCATGGGGAG AACCCTTGGT TCCTCCATGT CCGCCTTCCT ACTCACTAGG
GGAGGGGAGG TGACGTGGTC GTCAGTGGGA GGTGTTGGGG CAGTCTCAGC TATTCTTTAC
CTAGCATTGA TTTGA
 
Protein sequence
MNHVTKLAFS GGIRSFTSSL IWPYIGFGLY KYLGLSLVQV SQFYLTQLLI SSIAYVIGGY 
LTDYLGRRLV MTLATSLSSL VLTLAFFLNT AGVIGMVLLQ SGFSSIYAVA NMASVGDMGG
NFKQLVRSFS VIRVGINAGW AIGPAIGGLL LGDIGFKPLL LLGGVLSVVA IPFVYSLPDH
KGRVRFFLPN RKFAMFLIPT LLTFTVMGQL GFPLVTYYSG LGIAVWQVGL LYAVNGGLII
LLQRWIGERV SGNYRTWISV GMLMYSLSYG LVSLVSNVWE ALLDVVGITL AEMIVSPLSQ
SISTSLAESE TRGTYSGIYG LVSSMGRTLG SSMSAFLLTR GGEVTWSSVG GVGAVSAILY
LALI