Gene Msed_0667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0667 
Symbol 
ID5105273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp609730 
End bp611085 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content46% 
IMG OID640506571 
Productvon Willebrand factor, type A 
Protein accessionYP_001190766 
Protein GI146303450 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.949287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0104959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGGTC TCCTTAGAGG TGTAGATTAC GAAAGCCCCG TTGTAAAGTA CAGGGGCGAG 
AGGATACTCA ACACCCTAAG GAGGGTCTCT GGGAAGGAAA GCAACGTAGA TCCCCTATTC
CTCATTGATA CCTATTACGT GCACTATCTC CCCCTACCTA TACTAAAAAC AAAGGGGGAA
ATAGATCAAA GCGACTCCAT TAAGTACTCA TTGATTGATC TTACCATTTC GTCAGAGATT
GTAAACAGGA ACAGAAACTA CTCAATTGCA AACTCAGCAG TGAGTATGGC CCTTTCCGTG
AGTTATGTGC AAAACTTGAT AGAGGAATTG GAGAGAATTA GGAGGACTTC ACAGTCGCAG
GAGGAGAGAG AGGCCGCAGA GCAGATCCTT AACGGAATAA TGAAGGGAAG TCAGGGTAAG
GAGGGAAAGC AGAATCAGAA CCAGCAACAG GAAAACCAGA CCACTGGAAA GCTAATGAAG
CAGGTTCATG AAAAGGCTAT GGCAAAGGCG TCTGAGGATG CTAATTCCGT CAGGAGTATG
CAGAGGATAG TTGGAGGTAA TGGGGCCGGT ACAGGATCGA TGATGAACTT TGAGGGAGAT
ATACACGACG TGCTAAGACT AGCTAGGAAC ACGGAGATCA AGAAGATCCT GGAGTTTCTG
AGCGGTATCC CGAAGCTGGG TAGCTTCACC AAGAAGAGGA CCACAAGATA TGCTAGGGGA
GAGCTTTATG GATATGAGGA AGGTTCAGAC CTAGAGAGAC TGGTTCCCTC AGAACTGGCC
TTACCCGAGG AACTCTTTGA TGTGAAGCTT GCAGAGAGCC AGCTATTACT ATATCAGAAA
CAGATTAAGG AAACCCTAGG ACCCATATAT CTACTATTGG ACAAGTCAGG CAGCATGGAT
GGAGAAAAGA TCCTGTGGGC TAAGGCTGTA GCACTGGCCC TCTACAGTAG GGCTAGAAGG
GAGAACAGGG ACTTCTATCT AAGGTTCTTC GATAACATTC CATATCCTCT GATCAAGGTC
ATAAAGAATG CCAAGAGCAA GGACGTGATC AAGATGGTAG AGTATATTGG GAAGATAAGA
GGTGGAGGCG GAACAGATAT ATCTAGATCT GTAATGTCTG CCTGCGACGA TATAAAGGAC
GGTCATGTTA GGGGTGTAAG CGAGGTCATA ATTTTGACGG ATGGAGAGGA TAAGATCGCT
GAAACCACTG TTAGGAGATC CCTTAAAGAG GCAAATGCTA CGCTCATTAG CGTGATGATA
AGAGGAGATA ACGCGGATCT AAAGAGAGTT TCAGATAACT ACCTAGTGGT TTACCGTCTA
GATCAGGGAG ACCTACTTAG GGTAGTTGAA TCCTAA
 
Protein sequence
MTGLLRGVDY ESPVVKYRGE RILNTLRRVS GKESNVDPLF LIDTYYVHYL PLPILKTKGE 
IDQSDSIKYS LIDLTISSEI VNRNRNYSIA NSAVSMALSV SYVQNLIEEL ERIRRTSQSQ
EEREAAEQIL NGIMKGSQGK EGKQNQNQQQ ENQTTGKLMK QVHEKAMAKA SEDANSVRSM
QRIVGGNGAG TGSMMNFEGD IHDVLRLARN TEIKKILEFL SGIPKLGSFT KKRTTRYARG
ELYGYEEGSD LERLVPSELA LPEELFDVKL AESQLLLYQK QIKETLGPIY LLLDKSGSMD
GEKILWAKAV ALALYSRARR ENRDFYLRFF DNIPYPLIKV IKNAKSKDVI KMVEYIGKIR
GGGGTDISRS VMSACDDIKD GHVRGVSEVI ILTDGEDKIA ETTVRRSLKE ANATLISVMI
RGDNADLKRV SDNYLVVYRL DQGDLLRVVE S