Gene Elen_1976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1976 
Symbol 
ID8416287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2314538 
End bp2315437 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content60% 
IMG OID645024953 
ProductCRISPR-associated protein, Csd2 family 
Protein accessionYP_003182329 
Protein GI257791723 
COG category[L] Replication, recombination and repair 
COG ID[COG3649] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR01595] CRISPR-associated protein, CT1132 family
[TIGR02589] CRISPR-associated protein, Csd2 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.856064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA CGATCAAGAA CCGTTACGAC TTCGTCGTGT ACTTCGATGT GGAGAACGGT 
AACCCCAACG GCGATCCCGA TGCCGGCAAC ATGCCGCGCA TCGATCCCGA GACGAGCCTC
GGCATCGTGA CTGACGTGTG CTTGAAGCGC AAGATCCGCA ACTACGTGGA GACGGTGAAG
GAAGGCGAGC GGGGTTTCGA GATATACATC AAGGACGGCG TGCCGCTCAA TGCGAGCGAC
CGACGGGCGC TCGACGAGTT CGGCGTGGGA ACCGATGATA AAGCCATCAA AAAACTCAAG
AAAGACGATC CTGCGCTCGA CGAGAAGATT CGCGATTTCA TGTGCGAGAC ATTCTATGAT
GTGCGCACGT TCGGTGCGGT GATGACCACG TTCGTCAAAG GCGCGCTCAA CTGCGGGCAG
GTGCGCGGGC CGGTGCAGCT GACGTTCGCG CGCAGCGTCG ATCCTATCAT TCCGCAGGAG
GTCACCATCA CGCGCGTAGC CATCACCACC GAAGCCGATG CCGAGAAGAA GGGCACCGAG
ATGGGTCGCA AGTACGTGGT TCCCTATGCG CTCTATCGCG GGGAGGGCTA CGTTTCGGCG
AACTTGGCGC GCAAATCGAC GGGATTCTCG GAGGACGACC TCGCCCTTCT GTGGGATGCT
ATCGTCAACA TGTTCGAGCA CGATCACTCG GCTGCGCGCG GCAAGATGGC GGTTCGTGCG
CTCGTGGTGT TCAAGCATGA CAGCGAGCTT GGCAATGCGC CGTCGTACAA GCTGTTCGAC
GCTGTTTCAA CCCAGAAGAA AGCCGGCGTG GAGGCTCCGC GTTCCATCGA CGACTACGAG
GAGATCACCG TTGACGAGGG CGCCGTGCCC GAGGGAGTCA CGGTTCTGAG GATGGTGTAG
 
Protein sequence
MSDTIKNRYD FVVYFDVENG NPNGDPDAGN MPRIDPETSL GIVTDVCLKR KIRNYVETVK 
EGERGFEIYI KDGVPLNASD RRALDEFGVG TDDKAIKKLK KDDPALDEKI RDFMCETFYD
VRTFGAVMTT FVKGALNCGQ VRGPVQLTFA RSVDPIIPQE VTITRVAITT EADAEKKGTE
MGRKYVVPYA LYRGEGYVSA NLARKSTGFS EDDLALLWDA IVNMFEHDHS AARGKMAVRA
LVVFKHDSEL GNAPSYKLFD AVSTQKKAGV EAPRSIDDYE EITVDEGAVP EGVTVLRMV