Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1974 |
Symbol | |
ID | 8416285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2312845 |
End bp | 2313876 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 645024951 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_003182327 |
Protein GI | 257791721 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.522968 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACATT TGCTCAACAC GCTTTACGTG TTCACGGAAG ACGCGTATCT CATGCTCGAT GGGGAGAACG TGGTTGTACG TCGCGATGGC GCAGAGCTTG GACGAGTGCC GCTCCATACG CTTGAGGGTA TCCTCTGCTT CTCGTACCGA GGTGCAAGTC CTGCCCTTAT GGGCGCCTGC GTCGAACGGG GCGTCGGCCT GTCTTTCTTC GATCGGAAGG GTCGTTTCCT TGCAGGCGCA TACGGCGAGC AACGCGGAAA CGTGCTGCTG CGAAAAACGC AGTATGCATG GTCTGAAGAA ACCGAAAAGA GCCTTGCTAT CGCTCGGAAT TTCATTGTGG GAAAGCTCTA CAACGGTCGT TGGGTGCTCG AACGGGCGGT GCGCGACCAT GGGTTGCGTA TTGATACGGG ATCGGTGAAG GCAGCCAGCA TGAGGCTTGA CGGTTCGCTT CGCAGTGCGG CTGAGTGCGA AACGATGGAT TCCTTGCGAG GGATTGAGGG CGACGCGGCT GCAGAGTATT TCGGCGTGTT CGATGAGCTG GTGTTGCGAG ATAAGGAGAC GTTTCGTTTT ACAGGTAGAG CGCGGCGTCC TCCGACCGAT GCGATGAACG CCATGCTGTC TCTGTTCTAC ACGGTTCTGG CTTTCGATTG CGCGTCTGCG CTTGAGGGTG TTGGGCTTGA TCCGTTCGTG GGTTTCATGC ATGCGGATAG GCCTGGTCGA CGCTCGCTTG CTTTGGACTT AATGGAGGAG CTGCGGCCGG TGTTCGTCGA TCGGTTCGTG CTTTCCGCCT TGAACAACCG GGTGGTGAGT TCGAAGGATT TTGAGAAGCG GGAATCGGGT GAGGTTCGGT TGGGCGATGA AGGGAGACGG GCTTTGTTCG GCGCTTGGCA AGAGCGAAAG AAGGAGACGA TCACCCATCC CTTTTTGAAG GAGAAGATTC CCCGGGGGCT TGTGCCGCAT GTGCAGGCGC TGTTGCTCGC TCGTTGCTTG CGCGGCGATC TAGACGGCTA CCCGCCGTTT CTATGGAAGT GA
|
Protein sequence | MRHLLNTLYV FTEDAYLMLD GENVVVRRDG AELGRVPLHT LEGILCFSYR GASPALMGAC VERGVGLSFF DRKGRFLAGA YGEQRGNVLL RKTQYAWSEE TEKSLAIARN FIVGKLYNGR WVLERAVRDH GLRIDTGSVK AASMRLDGSL RSAAECETMD SLRGIEGDAA AEYFGVFDEL VLRDKETFRF TGRARRPPTD AMNAMLSLFY TVLAFDCASA LEGVGLDPFV GFMHADRPGR RSLALDLMEE LRPVFVDRFV LSALNNRVVS SKDFEKRESG EVRLGDEGRR ALFGAWQERK KETITHPFLK EKIPRGLVPH VQALLLARCL RGDLDGYPPF LWK
|
| |