Gene Msed_1147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1147 
Symbol 
ID5103495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1093126 
End bp1094034 
Gene Length909 bp 
Protein Length302 aa 
Translation table11 
GC content53% 
IMG OID640507039 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001191232 
Protein GI146303916 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.253731 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTCAC TCGTCATCTC GGATTACGGG AGTTACGTTA CCGTGAAGAG GGGAATGTTC 
CTAGTCTCGC GAAAGGTTAA CGACAAGGAG GAGAGGAGGG AGGTATCCCC GAGCGAAGTT
GATGAGATTC TGTTCTGCTC CACGTCTTTG GTCTCAACCC ACGTGTTAAG GGTGGCCTTG
TCAAGGGGAA TAACGGTTGC CTTCCTGGAC TCAAGGGGGC AGATCTGGGG CCTCCTCCTC
CCCTCAGTGG TTACGGAGAC CGTGAGGACA AAGAAGGCCC AGTATGAGGC AGTTGCCTCT
GGACTGGATT ACGGGAAGGA GATCATAAGG GCGAAGATAA ACAACCAGGT GGTCCATCTC
AAGTATTGGG CAAGGAGAGG GGTAAAGACG GATTACCGTG AGCTTGAGGG AAAGGATGAG
GCCACTGCTG CAAGGATTTA CTGGCAGAAC CTGTCTCAGG TTGTCCCTGG CTTTCGCGGA
AGGGACGTTG AGGGAGGGGA TGGATTCAAC TCAGCGTTGA ACTACTCCTA CGCTATCCTG
TACTCTCGGG TAATGAGGGC CCTAGTCCTA GCGGGTCTCG ATCCCTACCT GGGATTTGTA
CACAAGGACA GGCCAGGTAA TGAGAGTTTG GTCTACGACT TCTCGGAGAT GTTTAAGCCC
TACGTGGACC TGGTACTGGC TAAGGCTTTC AAGGATGGTC TAGAGGTGAA GTTGAAGGGA
GGCCTCATGG ACAAGGAAAG CAGGGGAGCA GTTGCTAAAC TCGTGGTAAA GGGCCTAGAG
GAGAAGGTTA AGGAGGAACT TGACCACAAC CCCAAGAGCT TGAACCAGGC GATACGGGCT
CACGCCTTGA AGTTTGCTTC TGCGTTGAGG GAAAAGAGGG AGTATAGGGG GTTCAGGATG
GTGGTTTGA
 
Protein sequence
MNSLVISDYG SYVTVKRGMF LVSRKVNDKE ERREVSPSEV DEILFCSTSL VSTHVLRVAL 
SRGITVAFLD SRGQIWGLLL PSVVTETVRT KKAQYEAVAS GLDYGKEIIR AKINNQVVHL
KYWARRGVKT DYRELEGKDE ATAARIYWQN LSQVVPGFRG RDVEGGDGFN SALNYSYAIL
YSRVMRALVL AGLDPYLGFV HKDRPGNESL VYDFSEMFKP YVDLVLAKAF KDGLEVKLKG
GLMDKESRGA VAKLVVKGLE EKVKEELDHN PKSLNQAIRA HALKFASALR EKREYRGFRM
VV