Gene Elen_1974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1974 
Symbol 
ID8416285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2312845 
End bp2313876 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content57% 
IMG OID645024951 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_003182327 
Protein GI257791721 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.522968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACATT TGCTCAACAC GCTTTACGTG TTCACGGAAG ACGCGTATCT CATGCTCGAT 
GGGGAGAACG TGGTTGTACG TCGCGATGGC GCAGAGCTTG GACGAGTGCC GCTCCATACG
CTTGAGGGTA TCCTCTGCTT CTCGTACCGA GGTGCAAGTC CTGCCCTTAT GGGCGCCTGC
GTCGAACGGG GCGTCGGCCT GTCTTTCTTC GATCGGAAGG GTCGTTTCCT TGCAGGCGCA
TACGGCGAGC AACGCGGAAA CGTGCTGCTG CGAAAAACGC AGTATGCATG GTCTGAAGAA
ACCGAAAAGA GCCTTGCTAT CGCTCGGAAT TTCATTGTGG GAAAGCTCTA CAACGGTCGT
TGGGTGCTCG AACGGGCGGT GCGCGACCAT GGGTTGCGTA TTGATACGGG ATCGGTGAAG
GCAGCCAGCA TGAGGCTTGA CGGTTCGCTT CGCAGTGCGG CTGAGTGCGA AACGATGGAT
TCCTTGCGAG GGATTGAGGG CGACGCGGCT GCAGAGTATT TCGGCGTGTT CGATGAGCTG
GTGTTGCGAG ATAAGGAGAC GTTTCGTTTT ACAGGTAGAG CGCGGCGTCC TCCGACCGAT
GCGATGAACG CCATGCTGTC TCTGTTCTAC ACGGTTCTGG CTTTCGATTG CGCGTCTGCG
CTTGAGGGTG TTGGGCTTGA TCCGTTCGTG GGTTTCATGC ATGCGGATAG GCCTGGTCGA
CGCTCGCTTG CTTTGGACTT AATGGAGGAG CTGCGGCCGG TGTTCGTCGA TCGGTTCGTG
CTTTCCGCCT TGAACAACCG GGTGGTGAGT TCGAAGGATT TTGAGAAGCG GGAATCGGGT
GAGGTTCGGT TGGGCGATGA AGGGAGACGG GCTTTGTTCG GCGCTTGGCA AGAGCGAAAG
AAGGAGACGA TCACCCATCC CTTTTTGAAG GAGAAGATTC CCCGGGGGCT TGTGCCGCAT
GTGCAGGCGC TGTTGCTCGC TCGTTGCTTG CGCGGCGATC TAGACGGCTA CCCGCCGTTT
CTATGGAAGT GA
 
Protein sequence
MRHLLNTLYV FTEDAYLMLD GENVVVRRDG AELGRVPLHT LEGILCFSYR GASPALMGAC 
VERGVGLSFF DRKGRFLAGA YGEQRGNVLL RKTQYAWSEE TEKSLAIARN FIVGKLYNGR
WVLERAVRDH GLRIDTGSVK AASMRLDGSL RSAAECETMD SLRGIEGDAA AEYFGVFDEL
VLRDKETFRF TGRARRPPTD AMNAMLSLFY TVLAFDCASA LEGVGLDPFV GFMHADRPGR
RSLALDLMEE LRPVFVDRFV LSALNNRVVS SKDFEKRESG EVRLGDEGRR ALFGAWQERK
KETITHPFLK EKIPRGLVPH VQALLLARCL RGDLDGYPPF LWK