Gene Msed_0889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0889 
Symbol 
ID5103535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp823184 
End bp824587 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content45% 
IMG OID640506792 
Productamino acid permease-associated region 
Protein accessionYP_001190985 
Protein GI146303669 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGCCT TTTCCTTGGG CAATATCATA GGATCGGGAA TTCTGCTCCT GCCCGCTGTA 
ACCGCATCAA TGGCAGGTGG TCTAACTCCA CTTGTGTGGT TAGCTGGAGG AATAATGTTA
CTTCCCTTAG TTTTTGTGTA CTCTCAGCTT GCGAAAATGT ATCCGGTAGT GGGTAGTAAG
GTAAGATTCG TTAATGTAAC CCACGGCAAA GTTCTTGCCT CTTCCGTAGG ATGGCTCTAT
CTCATAGGAA CGTTGTTTGT GGTACCGATT GAGGCTGAGG CTTCCCTACA GTACCTTTCC
TACATCTTTC CATCTTTATG GAGTAACGGT TCCCCAACCT TAGCGGGTGA TCTCGTCGAA
ATAGGCATTG TTTCCGCAAT TTACTTCCTA GTTTACATGG GAATTAGAAC TCAGTCCCTA
AGTGTGAATG TTATAACTTA CACCAAGCTT GGGATATTGG CACTTTACGT AGCTCTTGTG
GGGATTCTCG CGTTTCATCC TTCAAACTTT ACCATTCCAA CCCAATCGGG CACATCAACC
TTCCTAGATG CCATAGCGTT AACGATGTTC GCGTACGGAG GATTTAGAAG CGCTATGGTA
TATGCTGGTG AAAGCAAAAA CAAGAACCAG ACCGGAAAAG CCATACTGAT TGCCTTCCTG
CTTTCCATGA TTGTCTACAC CCTTGTTCCC ATAGTTTTCA TAGGTTCCCT CACGCCTGAA
ATTCTAGGTC ACGGATGGGG ATACGTTTCA AAGATGAGTG CGCCTTTAAC TCAAAGCGCT
TTAATAGCCG GTATACCAGT GCTTGGAGCA CTCTTCATCA TAGATGGTGT AATCTCGCCC
TCCGGTGCAT CTTTAATTGG CGCTGGAGAC ATCAGCAGGT ACATGTATGC TCTGGTCAAG
GTGGGCAGTG CTCCCAAGGG ACTGGGAAAG GTAAGCGAAA AGCGTGGAAT TCCTGTTATT
CCAACGTTAC TCTCCCTCTT AGCTTCGATA GTGCTGTTAT TTGTTTCTCC TACGTTTGAG
CAGTCAATTG GCTACCTCAT TGCAGCTCAT GTGCTGGGTT ACGCAACAGG GCCTATTAGC
CTGTATGTTC TCACTTCCAA CAGAGGATAT AAGGCCATTT CCATGGTTGG ATTTATAACA
TCTGGCCTCA TATATACTTG GCTGGGATTC CCTAAGACTC TATTTGGCAC CCTGATCATA
GGGGTATCCA TGCTAGTAAT GGCCATGATA AACAGACCCG TGAAACCAGC ATTATGGTAC
GTGGGATATG CTATGGTATT AACAACTATA AGTGTTCTGG TTTCTAACAC CATTTATGAG
ATTATTGCAA CTTTAGCTCT GGCACCCGTA TTCTTTGCTC TAGCCATAAA ATCGGCTAAG
GGATCTGGAG AAGTAGAAGC CTAG
 
Protein sequence
MIAFSLGNII GSGILLLPAV TASMAGGLTP LVWLAGGIML LPLVFVYSQL AKMYPVVGSK 
VRFVNVTHGK VLASSVGWLY LIGTLFVVPI EAEASLQYLS YIFPSLWSNG SPTLAGDLVE
IGIVSAIYFL VYMGIRTQSL SVNVITYTKL GILALYVALV GILAFHPSNF TIPTQSGTST
FLDAIALTMF AYGGFRSAMV YAGESKNKNQ TGKAILIAFL LSMIVYTLVP IVFIGSLTPE
ILGHGWGYVS KMSAPLTQSA LIAGIPVLGA LFIIDGVISP SGASLIGAGD ISRYMYALVK
VGSAPKGLGK VSEKRGIPVI PTLLSLLASI VLLFVSPTFE QSIGYLIAAH VLGYATGPIS
LYVLTSNRGY KAISMVGFIT SGLIYTWLGF PKTLFGTLII GVSMLVMAMI NRPVKPALWY
VGYAMVLTTI SVLVSNTIYE IIATLALAPV FFALAIKSAK GSGEVEA