Gene Msed_0335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0335 
Symbol 
ID5105493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp291950 
End bp293134 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content49% 
IMG OID640506241 
Productvon Willebrand factor, type A 
Protein accessionYP_001190436 
Protein GI146303120 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCTTT CTCTACGGCT AGAGGGCTCT CACTTATTTT CGTGGAATGG GGAGCTCAAG 
TTTGCGTTTC GAGCGACCAT AGTCCCGGAG AGAGTGAAAC CTGTACCCCT TGACCTCTTC
ATAGTCCTTG ACGTGAGCGG TTCCATGGGT ATTATTGATA ACCCTCCTGA GGTAGACGAT
AGCCTGATTG CAGGCACGGC CGAGGTTGAT GGACACGTAG TTAGGTACTT GAAGGACGAC
ATAGGCGTTA ACAACAGGTT AGAGGTGGCA CTCGAGGCCA TAAGGAACCT TTTGGAGAAC
GCTGATACTT CCACAAGGGT TACGATTATC ACGTTCTCGG ACCACGTGAA CGTTCTCTGC
AGGAGGGTTA CACCTAGTAC GGCCCTGGAG CACTTAGAGG AAATAGTCCC TGACGGAAAC
ACTGCCCTCT ACTCCGCAGT CAAGAAGGCC ATTTCCCTCA TTGACGAACA TCCAGCCAGA
GTATTACTCA TCACCGATGG CTATCCCACT GATGTGGAGG ATGAGACGGA GTACTCTAAG
CTAGAGGTCC CTAGATTCTC GCAGTTCATT CCCATTGGCG TAGGCGAGTA TAACGCGAAA
ATCCTACGCA GTTTGGCAGA CCTTAGTAAC GGACGCTTCT ATCACGTGAA TGACGTGAGC
GAGATCTCAA GGATAATGGA GGAAGAGAGG GCGAAGCCAT CTGGTGGAGT GAAGGTCAGG
GTAGACGTCC TCTCTAAGTT TCCCGTGAAT TATGTGAACT ACACCCCTCC GATCTACATT
GGTACAGTTG AGGGTGTCAC AAGGATTTAC GGTTTCATTC AAGTGCCCCC CAAATACTCT
GGCGAACTCG TGAGGGTTAA GTTAACCTAC ACAGACACGC TTGATGACAG GGAGTATTCC
CTAGAGAAGT TCATCTCCGT AATCCCAGCG ACGGACAGTG CGCAGTTCGT CTCTGGGCTA
AACAAGTACC TTCTATGGGA GGCAGAATAT TACGAGAAGA TGAAGGAGAT ATCCAAGCTC
CTGGAGTCGG GCATGCAGGT CGAGGCAACC AGGAAGATGC AGGAGCTTAA GGATATTGCA
GAAAGGACGA GAAAGGCAGA TCTGATTGAG GCTACGAAGA AGTTGATGAA TTCGAGCGAC
GAAAAGGAGA TAAGTAGTGA AATAACGAGG AAAATGAGAT CATGA
 
Protein sequence
MVLSLRLEGS HLFSWNGELK FAFRATIVPE RVKPVPLDLF IVLDVSGSMG IIDNPPEVDD 
SLIAGTAEVD GHVVRYLKDD IGVNNRLEVA LEAIRNLLEN ADTSTRVTII TFSDHVNVLC
RRVTPSTALE HLEEIVPDGN TALYSAVKKA ISLIDEHPAR VLLITDGYPT DVEDETEYSK
LEVPRFSQFI PIGVGEYNAK ILRSLADLSN GRFYHVNDVS EISRIMEEER AKPSGGVKVR
VDVLSKFPVN YVNYTPPIYI GTVEGVTRIY GFIQVPPKYS GELVRVKLTY TDTLDDREYS
LEKFISVIPA TDSAQFVSGL NKYLLWEAEY YEKMKEISKL LESGMQVEAT RKMQELKDIA
ERTRKADLIE ATKKLMNSSD EKEISSEITR KMRS