Gene Msed_1141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1141 
Symbol 
ID5103489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1077940 
End bp1079418 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content51% 
IMG OID640507033 
ProductCRISPR-associated helicase Cas3 family protein protein 
Protein accessionYP_001191226 
Protein GI146303910 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.607397 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCCT TAGTTGATTA TTACGGAGAG GCTTGTAAGC TACAAGGGTT TGAGCCTAGG 
AAGGGGATAG AGGAGACCCT TTCCAAGATA GAGGAGGGGA AAGCAGTCAT CCTAACTGCC
CCAACGGGTT ACGGCAAGAC CTCACTTACG TATGCCTTGG GACTGGCCAG TCTCAGGGGG
AACGGGCACT TTGATAGGGT AATACACGTC CTGCCCCTTA GAAGTATTGT CCAGGACCTT
ACCTCAAAGC TCGAGAACTT CATGAGGCTA GCGGGTTACT CCAACACAGT TGTTGGGGCC
TACGACATGG ACTTCCACGA CACTCCTTAC TTCCTGAGAA AGGTGAACGT TACGACGCTT
GACTCATTCG TATTGAACAT GTTCAAACTA CCTGTGTCCG AGATTACTAG GGGTATGAAG
GGGATGGGGA CTCACTACGA GGTTCCGAGG GGGGCGATCT ACTCCTCAGT GGTTGTGTTT
GACGAATTCC ATCTTTTCTC CGATGACGGA GGAAAGGACA AAAGCCTGAC CTCAGTGATA
GCGTCACTTA GGGGACTTGG GGCGATGCAG GTCCCCTTCG TAATCATGAC GGCAACCCTT
CCAGGTTCCC TTAGGGACTT GATCAAGGAG GAGCTTGAGG ACGTCGTTGA GGTCGTGGAG
GTCAAGGACA ATTTCAAGAT AGAGAGGGAT GTTAGCGTTG ACTTCGTGGA TGAGCTGGAC
TTCAATAAGC TTGATAGAAG GACTCTCGTG GTCATGAACA CCAGGAAGGG GGCCATCACC
GCGTACCAAG AGGCCAAGAA GGCCGGACTG TCCCCCGTTC TGATTCACTC GAAGTTCAGC
GCCATGGATA GGAGAAGGAA AGTTGACGAG ATCAAGAACG CTAAACTGGT CATCTCAACG
CAGGTGATAG AGGCCGGAAT TGACGTTTCG TTTGACGTCC TCTACACCGA GGCTGCCCCA
CTCCCCAACT TGGTCCAGAG GGCTGGCAGA GTTGCGAGGT ATGGCGGACA GGGAGAGGTT
CACATTCTTC CCTTCAGCGG TCACGTCTAC GATCGGAACG ATGTTGAGAC GAGCCTTGAA
ATTGTGAGAA GGGAGGGCAA ACTTGATTCG TCACTTATGT CGAGTTTTAA CACCAGTTAC
ATCCTGAACT CCGATCTTCT GTTCTCGTTA AATATCTTGG ATGAGGGGCC CTTCTTCTCG
TCGGAGGCAA CTGCGAAACT CCTTAAGAAG GAATGCTCGA TCACAAGGGA GACATCCCTT
ATCATGGGCT TTCCCCAAGG ATGTAGATCC TCAGCCTGCG GGATCCCGCT AACTGAGGAT
GAGGCTAAGG ACTTGTTGGA GAGAGGGGCC AAGCCACTTC GCGATGGAGA ACTAGTTGAC
TGGAAACCTG GGAACCTTTG CCTCTCAATA GATTTCATGC TGAAGGGAAT TGACGGAATC
TCTGTGGACT ACAATCAGGA GGTTGGGGCG ATACTATGA
 
Protein sequence
MSSLVDYYGE ACKLQGFEPR KGIEETLSKI EEGKAVILTA PTGYGKTSLT YALGLASLRG 
NGHFDRVIHV LPLRSIVQDL TSKLENFMRL AGYSNTVVGA YDMDFHDTPY FLRKVNVTTL
DSFVLNMFKL PVSEITRGMK GMGTHYEVPR GAIYSSVVVF DEFHLFSDDG GKDKSLTSVI
ASLRGLGAMQ VPFVIMTATL PGSLRDLIKE ELEDVVEVVE VKDNFKIERD VSVDFVDELD
FNKLDRRTLV VMNTRKGAIT AYQEAKKAGL SPVLIHSKFS AMDRRRKVDE IKNAKLVIST
QVIEAGIDVS FDVLYTEAAP LPNLVQRAGR VARYGGQGEV HILPFSGHVY DRNDVETSLE
IVRREGKLDS SLMSSFNTSY ILNSDLLFSL NILDEGPFFS SEATAKLLKK ECSITRETSL
IMGFPQGCRS SACGIPLTED EAKDLLERGA KPLRDGELVD WKPGNLCLSI DFMLKGIDGI
SVDYNQEVGA IL