Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_1401 |
Symbol | |
ID | 5114366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 1536099 |
End bp | 1537076 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640491588 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001176133 |
Protein GI | 146311059 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03637] CRISPR-associated endonuclease Cas1, YPEST subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.612871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.155418 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGTAA ATGGAATTAC GCCTTCGGAT CTCAAAACCA TTCTCCATTC AAAACGTGCC AATATTTATT ATCTGGAAAA ATGTCGTGTT CAGGTCAACG GTGGGCGAGT GGAGTATGTT ACACAGGAAG GTAAAGATTC TTTTTACTGG AACATCCCTA TTGCCAATAC TACGGCAGTC ATGCTCGGAA TGGGTACGTC GGTCACTCAA ATGGCGATGC GGGAATTTGC TCGCGCAGGG GTTATGGTCG GCTTTTGTGG CACGGACGGT ACACCGCTTT ATTCGGCTAA CGAAGTGGAT ATTGATGTTT CCTGGTTTTG TCCACAAAGT GAATATCGAC CTACCAGCTA TTTACAGAAT TGGGTGTCAT TCTGGTTTGA TGAGCAAAAA AGGCTGCAAG CCGCAAAACA ATTCCAGTAT ATCCGGTTGC AACAAATCGA AAAATATTGG CTGGCATCAA AAAAACAGCG TGATAAATCA TTTCATCCAG ACAGTCAAAA TCTGAAAAAT AGTCTTGAAC GCGCCAGACT GGCAATGGAA TCTGCAAACG ATCACACCAC GCTGATGCTT CAGGAAGCAC AGCTGACCAA ATCGCTTTAT AAACTGGTTA GCCAAACCGT TGGCTATGGC CATTTTACTC GCGCTAAACG TGGCGGTGGC GTGGATATGG CAAACCGCTT TCTCGATCAG GGAAATTATC TGGCCTACGG GCTGGCCGCT GTCGCCACAT GGGTAACGGG AATACCGCAT GGTCTTGCGG TGATGCACGG CAAAACCCGT CGCGGCGGGT TGGTATTCGA TATTGCGGAT TTGATTAAGG ACGCGCTTGT CATGCCGCAG GCATTCCTGG CTGCCATGGA AGGCGAGGAT AATCAAATGT TTCGCCAGCG ATGCATAAAT GCTTTTCAGC AGGCTGACGC CCTGGACCTC ATGATTTCCT CCCTTCAGGA AACGGCGGAA GGGAGTGCGC TGCGATGA
|
Protein sequence | MSVNGITPSD LKTILHSKRA NIYYLEKCRV QVNGGRVEYV TQEGKDSFYW NIPIANTTAV MLGMGTSVTQ MAMREFARAG VMVGFCGTDG TPLYSANEVD IDVSWFCPQS EYRPTSYLQN WVSFWFDEQK RLQAAKQFQY IRLQQIEKYW LASKKQRDKS FHPDSQNLKN SLERARLAME SANDHTTLML QEAQLTKSLY KLVSQTVGYG HFTRAKRGGG VDMANRFLDQ GNYLAYGLAA VATWVTGIPH GLAVMHGKTR RGGLVFDIAD LIKDALVMPQ AFLAAMEGED NQMFRQRCIN AFQQADALDL MISSLQETAE GSALR
|
| |