Gene Lcho_1154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_1154 
Symbol 
ID6163077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp1237367 
End bp1238398 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content68% 
IMG OID641663908 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001790188 
Protein GI171057839 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTCC TCAACACCCT CTACATCACC CTGCCCGACA GCTACCTGCG GCTCGACAAC 
GACACCCTGC GCGTGGTCGA CGAAGACAAG GAAACCCGCC TGCGCGTGCC GCTGCACCAT
CTGCAGGCGG TGGTGTGTTT CGGCCACGTC GGCCTGAGCG CCAAGCTGAT GCACCGGCTG
GCCGAAGACG GCATCGCCCT CGTGCTGCTC GATGCCAACG GCCGCTTCAA GGCGCGGCTG
GAAGGCGAGA CCAGCGGCAA CGTGCTGTTG CGCCGCGCCC ATCACCAGGC GGTCGACAGC
GCCGCGTTCA CGCTCGAAGC GGCTCGTTGC ATCGTGGCCG GCAAGCTGCG CAACCAGCGC
CAGGTGCTGC TGCGCGGCGC CCGTGAATCG AAGGATCCGG GTGAAGAAGC CCAGCTCACC
CGCGCCGCAC AAGACCTGGC GGCCAGCCTG CGCGCACTGC CCGCGGCGGC CGATCTCGAC
GTCCTCCGCG GCATCGAGGG CGAGGCCGCG CGCACCTACT TTGCCGCGCT CAACCTGCTG
GTACGTGCCG ACCGGCGCGA TCATTTCCAG ATGAACGGCC GCAGCCGCCG CCCGCCGCGC
GACCGCATGA ACGCGCTGCT CAGCTTCTTC TATGCAATGT GGATGAACGA CTGCCGCAGT
GCCATCGAGG CCGCCGGGCT CGATCCGCAG ATGGGCTTTC TGCATGCACT GAGGCCGGGG
CGCGCGGCGC TGGCGCTCGA TCTGATGGAG GAGTTTCGCC CGTTCGCCGA CCGGCTGGCG
CTCACGCTGG TAAACCGCGC GCAGGTCAAC GAAGACGACT TCGTGGAGCG TGAAGGCGGC
GCCGTACTGC TGGAGGGCGA TGCGCGCAAG GCGGTGGTGG TGGCGTATCA GGAGCGCAAG
CAGGAGGAGT TGACACACCC GCTGCTGGCC GAAAGCGTTC CGCTCGGACT GGTGCCGCTG
GTGCAGGCGC GGTTGCTGGC GCGCCATGTG CGCGGCGAGG CGCCGAGTTA CGTGCCATTT
GCGATGCGCT GA
 
Protein sequence
MQLLNTLYIT LPDSYLRLDN DTLRVVDEDK ETRLRVPLHH LQAVVCFGHV GLSAKLMHRL 
AEDGIALVLL DANGRFKARL EGETSGNVLL RRAHHQAVDS AAFTLEAARC IVAGKLRNQR
QVLLRGARES KDPGEEAQLT RAAQDLAASL RALPAAADLD VLRGIEGEAA RTYFAALNLL
VRADRRDHFQ MNGRSRRPPR DRMNALLSFF YAMWMNDCRS AIEAAGLDPQ MGFLHALRPG
RAALALDLME EFRPFADRLA LTLVNRAQVN EDDFVEREGG AVLLEGDARK AVVVAYQERK
QEELTHPLLA ESVPLGLVPL VQARLLARHV RGEAPSYVPF AMR