Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mfla_0606 |
Symbol | |
ID | 3999246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacillus flagellatus KT |
Kingdom | Bacteria |
Replicon accession | NC_007947 |
Strand | + |
Start bp | 631099 |
End bp | 632115 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637937506 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_544717 |
Protein GI | 91774961 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.506127 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.66678 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAT TGCTGAATAC GCTCTATGTA ACGAGGCAGG AAACCTACGT ACATAAGGAG CGCGATACCA TTGTCATCAA CAGCGCAGGG GAGAAGCTGG CTCAATTTCC CGCCCTCACG ATCAGTAATA TTCTCTGCTT TGGTCAAGTA TCTGTTTCTC CACCTATGAT GGGATACTGT GGTGAAAATG GAATAGGACT GGTTTTCTAT ACTGAATATG GTCGGTTCCT AGCCAGGATG ATAGGTAAGC AGACTGGCAA CGTACTACTG CGGCGTACGC AATATCGGCT AGCGGATGAT CCTGAAAAAG CCAATGCCAT TGCCAGGGTA ATGGTGGGCG CGAAGATTGC CAATGCACGT ACTGTGCTGA TGAGGGAAAT TCGTAATCAT GGCGAACATG CGAATATAAG TCGTGCGGTC GAGCAATTGG GATATAGCCT GAAAAGCTGC CAATTCAGTC AATCTGTACC ACAGACGATG GGAATAGAAG GGGATGCTGC CGCAACCTAT TTTTCAGTAT TCAATGGATT GATACGAGGC AGCAGATTTC AGTTTAATGG CCGTATTCGT CGACCAGCCA CAGATCCTGT CAATGCCTTG TTATCCTTCA TCTATTCCCT GATTACTCAA GAGTGTGTTT CTGCCTTGAT GGGGGTGGGC TTGGATCCAT ATGTTGGATT TTTACATCAG GATAGGCCTG GTAGACCTAG CCTTGCTTTG GATTTGTTGG AAGAGTTCCG GGCATCGTGG GCAGATCGAT TTGTCTTGAC ATTGCTCAAT CGAAGACAAC TATCACCGCA AGCATTTGTT ACAGAAGCCA GTGGTGCTGT ACGGCTGACG GATGATACGA GAAAACAAGT ATTAGTTGCT TTTCAAGAGC GCAAACAGGA AGAGGTTGTA CATCCCTACC TTCAGGAAAA GGTGCCTTAT GGGTTATTGC CTCATTGCCA AGCATTGTTA TTGGCCAGAC ATCTGCGTGG AGATATGGAG TTTTATACTC CATATCTGAT TAAGTGA
|
Protein sequence | MKKLLNTLYV TRQETYVHKE RDTIVINSAG EKLAQFPALT ISNILCFGQV SVSPPMMGYC GENGIGLVFY TEYGRFLARM IGKQTGNVLL RRTQYRLADD PEKANAIARV MVGAKIANAR TVLMREIRNH GEHANISRAV EQLGYSLKSC QFSQSVPQTM GIEGDAAATY FSVFNGLIRG SRFQFNGRIR RPATDPVNAL LSFIYSLITQ ECVSALMGVG LDPYVGFLHQ DRPGRPSLAL DLLEEFRASW ADRFVLTLLN RRQLSPQAFV TEASGAVRLT DDTRKQVLVA FQERKQEEVV HPYLQEKVPY GLLPHCQALL LARHLRGDME FYTPYLIK
|
| |