Gene Msed_1022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1022 
Symbol 
ID5104325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp945179 
End bp946396 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content38% 
IMG OID640506921 
Producthypothetical protein 
Protein accessionYP_001191114 
Protein GI146303798 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.136217 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAATCA GGGACAAACA ACTAATTGAG AAAGTGGGTA AGTATTACCT AGCTTACAGG 
GGAGATATAG ACCTAGACAG ACTGTTAACC CAGGGCCTAG AGAACATAGA AATTAATGAA
ATAACTAAAT ACTTTATTCT GTCATCCATG ACCACATGGA TCGTAGATAG AGCTTTGGCC
TACCTCGAGC AGGCCATCGA AGCCAGGCTA ACCTATAGGG AATTTGTAAT AAGTTATGAT
CGCGAACCTA TGGGAGCTAT TGATTTACCT AGAAGCATTC CTGTGATGTC AAGGGGAATT
TATGCGTACT ATACTTACAT AAAGGGGTAT GATGCGCCGG AGTACGCTAT CATGAATTAT
CTCTTGAAGC GTATATATAG TACAGCTTTA CAATACTACA ATAAAATAAA GGATGTTCGA
GAGGAAATCA AATACTTTCG CGTGAAGGGT AGGATGAAAA CTAGACTGGA TCGGCTAAGG
AAAGGTCTCA GCTACTTTAA GGGAGAATAT TTTAGACCTT TGACAGATTA CGATCCCGAG
TGGCTAAGGG AGACTTTCAA TCTCTACTAT ACATTATCTC AGCTAAAGGA ACTTTCCTTG
GGCATTTCTA CTCAGAAGGC GCCGTCTATG AATAAGAAAA TGCTTAAAGT GATTTTGTGG
AAATTGTATG AGCTTTACGT TTTCTTCATT TTCGTTAAAT ATCTAGAGAG GGAAGGATTC
GATGTAGCGA AGGAAAACGG AAGATATGTG GCCAAGAAAG GAAACAGAAG ACTTAGCTTA
ATCCTTAATA GCGATCTAGA TTTCTCGCAA CTGGACTCCG TTGATGACTT AGATAATACT
GAGATTTTCA GAGGTAGACC TGATCTCTCA TTGGTAGCTG GAAATTCTGT ACTAGTCGAA
TGCAAATATT CTAGCAAGGT TGGATATATT ACCTCTAGCA GATTTAAACT CATGGCATAT
GCTTATGAAT ACAATCCTCT TACCGCGATA CTTATTTATC CAGGATTAGA TAAGGAGGTT
GAGGTCATGG ATTCGGAGGA GAAAGCAACG TACCAGATCA ATGAGAAGGC CAAGGAGGAA
GGATTCGTGG ATATTAATTT CAAAAATTCC AAAAAATTAT ATATAGTGGT CCTAAATCCT
GCTGATGATG ACGAAACTAA CGAGGAGAAA ATAGCAAGGA TATTTACATC AAATAGTTAC
CTAAGCAAGT TATTATGA
 
Protein sequence
MLIRDKQLIE KVGKYYLAYR GDIDLDRLLT QGLENIEINE ITKYFILSSM TTWIVDRALA 
YLEQAIEARL TYREFVISYD REPMGAIDLP RSIPVMSRGI YAYYTYIKGY DAPEYAIMNY
LLKRIYSTAL QYYNKIKDVR EEIKYFRVKG RMKTRLDRLR KGLSYFKGEY FRPLTDYDPE
WLRETFNLYY TLSQLKELSL GISTQKAPSM NKKMLKVILW KLYELYVFFI FVKYLEREGF
DVAKENGRYV AKKGNRRLSL ILNSDLDFSQ LDSVDDLDNT EIFRGRPDLS LVAGNSVLVE
CKYSSKVGYI TSSRFKLMAY AYEYNPLTAI LIYPGLDKEV EVMDSEEKAT YQINEKAKEE
GFVDINFKNS KKLYIVVLNP ADDDETNEEK IARIFTSNSY LSKLL