Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0335 |
Symbol | |
ID | 5105493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 291950 |
End bp | 293134 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506241 |
Product | von Willebrand factor, type A |
Protein accession | YP_001190436 |
Protein GI | 146303120 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCCTTT CTCTACGGCT AGAGGGCTCT CACTTATTTT CGTGGAATGG GGAGCTCAAG TTTGCGTTTC GAGCGACCAT AGTCCCGGAG AGAGTGAAAC CTGTACCCCT TGACCTCTTC ATAGTCCTTG ACGTGAGCGG TTCCATGGGT ATTATTGATA ACCCTCCTGA GGTAGACGAT AGCCTGATTG CAGGCACGGC CGAGGTTGAT GGACACGTAG TTAGGTACTT GAAGGACGAC ATAGGCGTTA ACAACAGGTT AGAGGTGGCA CTCGAGGCCA TAAGGAACCT TTTGGAGAAC GCTGATACTT CCACAAGGGT TACGATTATC ACGTTCTCGG ACCACGTGAA CGTTCTCTGC AGGAGGGTTA CACCTAGTAC GGCCCTGGAG CACTTAGAGG AAATAGTCCC TGACGGAAAC ACTGCCCTCT ACTCCGCAGT CAAGAAGGCC ATTTCCCTCA TTGACGAACA TCCAGCCAGA GTATTACTCA TCACCGATGG CTATCCCACT GATGTGGAGG ATGAGACGGA GTACTCTAAG CTAGAGGTCC CTAGATTCTC GCAGTTCATT CCCATTGGCG TAGGCGAGTA TAACGCGAAA ATCCTACGCA GTTTGGCAGA CCTTAGTAAC GGACGCTTCT ATCACGTGAA TGACGTGAGC GAGATCTCAA GGATAATGGA GGAAGAGAGG GCGAAGCCAT CTGGTGGAGT GAAGGTCAGG GTAGACGTCC TCTCTAAGTT TCCCGTGAAT TATGTGAACT ACACCCCTCC GATCTACATT GGTACAGTTG AGGGTGTCAC AAGGATTTAC GGTTTCATTC AAGTGCCCCC CAAATACTCT GGCGAACTCG TGAGGGTTAA GTTAACCTAC ACAGACACGC TTGATGACAG GGAGTATTCC CTAGAGAAGT TCATCTCCGT AATCCCAGCG ACGGACAGTG CGCAGTTCGT CTCTGGGCTA AACAAGTACC TTCTATGGGA GGCAGAATAT TACGAGAAGA TGAAGGAGAT ATCCAAGCTC CTGGAGTCGG GCATGCAGGT CGAGGCAACC AGGAAGATGC AGGAGCTTAA GGATATTGCA GAAAGGACGA GAAAGGCAGA TCTGATTGAG GCTACGAAGA AGTTGATGAA TTCGAGCGAC GAAAAGGAGA TAAGTAGTGA AATAACGAGG AAAATGAGAT CATGA
|
Protein sequence | MVLSLRLEGS HLFSWNGELK FAFRATIVPE RVKPVPLDLF IVLDVSGSMG IIDNPPEVDD SLIAGTAEVD GHVVRYLKDD IGVNNRLEVA LEAIRNLLEN ADTSTRVTII TFSDHVNVLC RRVTPSTALE HLEEIVPDGN TALYSAVKKA ISLIDEHPAR VLLITDGYPT DVEDETEYSK LEVPRFSQFI PIGVGEYNAK ILRSLADLSN GRFYHVNDVS EISRIMEEER AKPSGGVKVR VDVLSKFPVN YVNYTPPIYI GTVEGVTRIY GFIQVPPKYS GELVRVKLTY TDTLDDREYS LEKFISVIPA TDSAQFVSGL NKYLLWEAEY YEKMKEISKL LESGMQVEAT RKMQELKDIA ERTRKADLIE ATKKLMNSSD EKEISSEITR KMRS
|
| |