Gene Msed_1383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1383 
Symbol 
ID5104593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1359381 
End bp1360472 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content47% 
IMG OID640507272 
Productvon Willebrand factor, type A 
Protein accessionYP_001191465 
Protein GI146304149 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.130674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000484488 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCAGAA TTGTCCTTGT TCCTGAATCG AAATTTGAAG CTAAGAACCT CCACTATGTG 
ATCCTGATTG ATAGGAGTTA CTCCATGAAG GGTGAGAAGC TGGAGATGGC CAAGGAGGGA
GCTAGGTTAC TTGTTGATAA CCTGCCCAAG GATAGTCGTT TCTCCTTACT GGCCTTCAAC
GAAAAGGTGT CGATAATCAA GGAGCATGAA CATCCTTCAG AGATGGGGAA GGAACTTGAG
AGCCTTAAAG TTGGAAGCGG TACCGCAATG TATAAGGCAT TACAGGAGGC ATTTAACCTA
GCTAGAAAGT ACGGCGAACC AACATACGTT ATATTGCTCA CTGATGGGGT TCCCTCAGAC
ATGGGATGTA TGCCTGGGCT ATCTAGGAAA TTTGACCTAA ACAGATGTCT TCCCGTATAT
CAGGGATTGT CAGTACCTGA GAACGTACAG ATCATATCCT TTGGAATTGG AGATGACTAC
AGCGAAGAAA TACTCACTGA GGTTTCAGAA AAGGGAAGAG GCTTCTTTTA TCATGTTACT
GACCCTGCTC AAATCCCTGA GAAGATGCCC AAGCTGGTAA AATCTGAGGT TGCTGCTAGT
GACGTAACGG TGGATCTAGT GTCCGAGTCC CCTGTGAAAT TACTGAACTA TGATTCCCTG
CCTGTAAGGA TAAACGCTGT TGAAGGCGTG GTAAAAATTT TTGGAGAAAC GGTAATTCCC
AAGGAGTATA CGGGAAAGTT TATGACGCTT AAGGTGAAGT ACAGGGATGA GAAGGGGATT
CGGGATAGGA CACAGGAGTT CTTCCTGACC CGCGCACAGA ACCAGCAGGA CTTCATCTCA
GCGATCGACA GAGACGTCAT CATGGAGTAT GAATATCTGC AAACGCTTCA GAACTACTCC
AGGGATCTAG AGGCTAGAAA CCTGGTTGAG GCCACGAAGA AGTTGGATAG GCTAAGGGAG
ATAGCGGAGC AGACCAGGAG ACAGGACCTT CAGGAGGTGG CGGAGGAACT CACGAGAAAA
ATGACAAGCG GAGAGGGTAA CCCGAAGGAA ATTGCGAGCG AGGTTACAAG GAAGATGAGG
GGTGCGGAGT AG
 
Protein sequence
MFRIVLVPES KFEAKNLHYV ILIDRSYSMK GEKLEMAKEG ARLLVDNLPK DSRFSLLAFN 
EKVSIIKEHE HPSEMGKELE SLKVGSGTAM YKALQEAFNL ARKYGEPTYV ILLTDGVPSD
MGCMPGLSRK FDLNRCLPVY QGLSVPENVQ IISFGIGDDY SEEILTEVSE KGRGFFYHVT
DPAQIPEKMP KLVKSEVAAS DVTVDLVSES PVKLLNYDSL PVRINAVEGV VKIFGETVIP
KEYTGKFMTL KVKYRDEKGI RDRTQEFFLT RAQNQQDFIS AIDRDVIMEY EYLQTLQNYS
RDLEARNLVE ATKKLDRLRE IAEQTRRQDL QEVAEELTRK MTSGEGNPKE IASEVTRKMR
GAE