Gene Msed_1373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1373 
Symbol 
ID5103432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1345361 
End bp1346815 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content44% 
IMG OID640507262 
Productamino acid permease-associated region 
Protein accessionYP_001191455 
Protein GI146304139 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0348286 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAGG GATCTCAAGA GCAGCGAAAT GAAGAGCCTA AACGGGTCAT AGGAATCCTT 
GACTTAATAT TTATCTCCTT GGGTGGACAA TCGCCTTTCC TCAGTGTGTT AACCTACGGT
GTAGAGGCGT ACCTTCTCGC GGGTACGGGA GCGTCACTGG CCATTATACT AGGTACTATC
CTGGTTCTAG TTAACGGAAT GTCAGTTTAT ATCTTATCTA GGAAATTTAC TAAAACTGGA
GGGTATTACA CCTACGCATA TTATTCCCTA ACGAAGAGGC TTGGCTTTGA GACAGGCTGG
CTGTATCTCC TCTACTCCTC CATGTACGGT TCGGCATACG TATTAGGCGC CTCCTACATT
CTTTCCACAA TAATCCCTGT TCCTGCCATC GCGGTTGCAG CAATAATCCT CACTATTTCC
AGCATATTTG CAATTTTGGG AATAAAACCC ACAGCCAAGT ACGCCATAGT TGCAAGCCTG
ATAGAGATAG GTATGATGGC TGCCCTAGCT GTGCTATTTA TGGCCTCCAC GCATTTCTAC
CTTTACAACC CAATTCCGTC CAACATTAAT CTGTCCACTC TTGCCTTGGC AATTCTTTTC
GGCTCTTCTA TTCCCACAGG TTACGGTTCC ATAACTCCCT TATCAGGGGA AGTTAAGAAT
CCAGAAAAGA GTGTTCCAAG GGCTATAATA ACTGTCATAT TGTTAGGCGG GCTACTGGCC
GCATTTGACA TTTATGGAAT AACGGATCAC GTGATCTATT TCCATTTAGT GGCTAATCAA
TTGAATTTAA TTCAGCTTAT TGAGGACAGA TTCGGTCTAT TGACTCTCGC GTTTGTCCTG
TTCGCAGCTG CCAACGACGG TATACTAGCC ACTTTAACGT ATATTATGGC AACCTCTAGA
ACCATTTTCG CTATGTCTAG GGGAGGATTT TTACCAGAGA TACTTGGAAG ACTTGAAACG
GGAAGAGGTC CGCTTTATGC GGTCATAGTC ACTGTGGTAA GTTTCGTGAT CATAGTTCTA
GGCGGAATTC TGATTACGGG TTTTAATGCC TTCCTTGCCT TTTCAATAAC TGGTTTAGTC
TCACTACTGG CAAACATATT TGTTCACTTG GCCTCAGATT TCTCTCTCTT CAAGATCTCA
CTTTCTAAAA TAAACAAAAG AATAAGCTGG TTAGTGCTTT CATTGGGAGG TATAGCGTTC
TCATCATACG AACTACTTCA GTCAATAAGG ACCTCGTCAC CAGTGATAGT CTACTTCTTT
ATGGGAACCA TTATCCTAGG ATTCCTAGCT GCTGAAATAA TAGAAATGAG CGAATCAGGA
AAGGAAGAGG ACTATCCAGA GAGGAATTCA CAGGCAAATC AACAGGGGAG TCATGACCAA
GCCAATCATC ATGAAAAGCA GCAGGTCAAT GGAAAAATCA GGGTGAAAAG GTCGGACGGC
GAGGAGGTAA CCTAG
 
Protein sequence
MSKGSQEQRN EEPKRVIGIL DLIFISLGGQ SPFLSVLTYG VEAYLLAGTG ASLAIILGTI 
LVLVNGMSVY ILSRKFTKTG GYYTYAYYSL TKRLGFETGW LYLLYSSMYG SAYVLGASYI
LSTIIPVPAI AVAAIILTIS SIFAILGIKP TAKYAIVASL IEIGMMAALA VLFMASTHFY
LYNPIPSNIN LSTLALAILF GSSIPTGYGS ITPLSGEVKN PEKSVPRAII TVILLGGLLA
AFDIYGITDH VIYFHLVANQ LNLIQLIEDR FGLLTLAFVL FAAANDGILA TLTYIMATSR
TIFAMSRGGF LPEILGRLET GRGPLYAVIV TVVSFVIIVL GGILITGFNA FLAFSITGLV
SLLANIFVHL ASDFSLFKIS LSKINKRISW LVLSLGGIAF SSYELLQSIR TSSPVIVYFF
MGTIILGFLA AEIIEMSESG KEEDYPERNS QANQQGSHDQ ANHHEKQQVN GKIRVKRSDG
EEVT