Gene Msed_1492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1492 
Symbol 
ID5104739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1457763 
End bp1458914 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content46% 
IMG OID640507380 
Productvon Willebrand factor, type A 
Protein accessionYP_001191573 
Protein GI146304257 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTTT CCATGAAGGT AGAGGTAAGC CACAAGTACT CTTTCAACAG CGACCTAAAG 
ATGGCTTTTA AAATTCTCCT AGTCCCAGAG AAGATATCTA CAGCCACAGG ATTTCACTAT
ATTGTTCTCC TGGACACCAG TGGATCCATG GACGGTCTTA AGATTGAAAG TGCTAAGAAG
GGGGCAATAG AGCTACTTAA AAGGATACCA CAGGGCAATA AGGTGTCATT CGTCACCTTT
TCCAGTAGGG TTAACATCGT GAGAGAGTTC GTGGATCCGG AGGATCTTAC GGCAGAGATT
TCGAGCCTAT CCGCTGGCGG TCAAACAGCC TTCTTTACCG CTCTTCTCAC CGCGTTCAAT
CTTCACAACA AGCACGGAAT TCCAAGTTAT GTGATCTTAT TAACGGACGG AAATCCCACT
GATGATACAA ACGTTGAGAC ATACAAGAGG ATAGCCATAC CCAATGGCGT TCAGACCATA
TCCTTTGGAC TCGGTGATGA TTATAACGAA ACCATACTCA AGTCTCTAGC TGACAGATCA
GGTGGAGTCT TCTATCACGT AAATGATGCC ATGGAAATTC CAGAGAAACT TCCCAAAGCT
GCAAAAACCA AGATAGCTGC TAAGAACGTT ACAGTGGATA TAGTCGCTGA GTCCAATGTG
AAACTGCTAA ACTATTCTGG TCCTCCAGTA CAATTGAACG CGGTTGAGGG AGTAGTCAAG
ATACTTGGCG AAGCTGTGGT TCCTCCCAAC TATAGTGGAA ACTTTATGAC AGTTAAGGCA
AACTATGAGG AGCCAGTAGA CGGGAGAAAG CAAGCACTTC TGAGCGTAGT TAACATAAAA
CCGGCAGATA GTCAGGCTAC CTTCGTGAGT GGAGTGAACA AGGATGTTCT CCTAGAGTAC
GAGTACTTCA ACAACCTTCA GAAGATATCC AGCGAAGTAC AGGCTGGTAA CCTGGTGGAG
GCAACCAGGA CACTTAAGAG GATGGAGGAA ATAGCTGGCC AAACAAGAAA GATTGAACTC
ATGGAGACCA CGCGAAGGTT ATCAGATAGC TTAGAGACCA CAAAGAGGTC AGGAAATGCC
ACGGAGCAAA CCAGAAAGCT GTCGAAGGAA GTCTCGAGCG AAGTCACAAG AAAGCTCAGG
GGAGAGAGTT AG
 
Protein sequence
MTLSMKVEVS HKYSFNSDLK MAFKILLVPE KISTATGFHY IVLLDTSGSM DGLKIESAKK 
GAIELLKRIP QGNKVSFVTF SSRVNIVREF VDPEDLTAEI SSLSAGGQTA FFTALLTAFN
LHNKHGIPSY VILLTDGNPT DDTNVETYKR IAIPNGVQTI SFGLGDDYNE TILKSLADRS
GGVFYHVNDA MEIPEKLPKA AKTKIAAKNV TVDIVAESNV KLLNYSGPPV QLNAVEGVVK
ILGEAVVPPN YSGNFMTVKA NYEEPVDGRK QALLSVVNIK PADSQATFVS GVNKDVLLEY
EYFNNLQKIS SEVQAGNLVE ATRTLKRMEE IAGQTRKIEL METTRRLSDS LETTKRSGNA
TEQTRKLSKE VSSEVTRKLR GES