Gene Mmwyl1_3541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_3541 
Symbol 
ID5365478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp3998372 
End bp3999295 
Gene Length924 bp 
Protein Length307 aa 
Translation table11 
GC content44% 
IMG OID640805913 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001342379 
Protein GI152997544 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.608984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0531608 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGAT TTATACCGTT AAAGCCTATT CCCATAAAAG ATCGAAATTC CATGATTTTT 
GTTGGTATGG GGCGTATTGA TGTTCGAGAT GGTGCTTTTG TTGTCATTGA CGATGTTAAC
GGCGAGAGAA TGCACATTCC TGTGGGATCA ATTGCATGCT TAATGCTAGA GCCAGGGACG
AGAATTAGTC ATGCGGCCAT TAAACTTGCG TCAACTACCG GAACTTTAGT TATTTGGGTG
GGCGAAGCGG GAGTTCGACT TTATTCAGCG GGGCAGCCAG GCGGGGCTAG ATCAGATAAG
CTTCTTTACC AAGCTCAACT TGCCTTAGAC GAATCATTAC GACTAAAAGT TGTTCGTAAA
ATGTTTGAAA TTCGTTTTGG TGAAGAAGCA CCACAACGAA GAAGCGTTGA GCAACTTCGA
GGCATCGAAG GAGCAAGAGT TCGACAAATT TATCAAATTC TAGCCAAACA ATATAACGTC
CAGTGGAATG GTCGTCGTTA CGATCCAAAA GATTGGGAAT CAGGAGACAA GGTAAATCAA
TGTCTCAGTG CGGCAACGGC TTGTTTATAT GGTGTGACAG AAGCGGCTAT TTTGGCTGCA
GGGTATGCGC CTGCTATTGG TTTCTTGCAT ACAGGTAAAC CTCTGTCGTT CGTCTATGAT
ATTGCTGATT TAATTAAATT TGAAACCGTT GTACCAGTGG CCTTTAAAAT CGCTGCAAAA
TCCCCTTATC AGCCAGACAA AGAAGTTCGG ATTGCTTGTC GTGAAGTGTT TCGAACTGAA
AAAGTCTTAA AGCGATTAAT ACCGCTCATC GAAGAAGTGC TTGCCGCTGG TGAAATAGAG
CCACCAAAGC CACCAGAGGA CTCGCAGCCG CCAGCGATAC CGGAGTCTGC ATCTTTAGGT
GATGCTGGAC ATAGGAGTAA ATAG
 
Protein sequence
MAGFIPLKPI PIKDRNSMIF VGMGRIDVRD GAFVVIDDVN GERMHIPVGS IACLMLEPGT 
RISHAAIKLA STTGTLVIWV GEAGVRLYSA GQPGGARSDK LLYQAQLALD ESLRLKVVRK
MFEIRFGEEA PQRRSVEQLR GIEGARVRQI YQILAKQYNV QWNGRRYDPK DWESGDKVNQ
CLSAATACLY GVTEAAILAA GYAPAIGFLH TGKPLSFVYD IADLIKFETV VPVAFKIAAK
SPYQPDKEVR IACREVFRTE KVLKRLIPLI EEVLAAGEIE PPKPPEDSQP PAIPESASLG
DAGHRSK