Gene Ppha_1662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_1662 
Symbol 
ID6463252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp1760549 
End bp1761580 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content48% 
IMG OID642727879 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002018516 
Protein GI194336722 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.250727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAAAT ATCTCAATAC CCTTTTTGTC ACTACACAGG GCGCCTACCT CTCAAAGGAG 
GGTGAGTGTG CTGTTATCAA AATAGAAAAA GAGGTCAAAA CGCGCATCCC GCTGCATATG
CTTGACGGCA TCATCTGTTT TGGTTCGGTA ACCTGCAGCC CATTTTTGCT TGGCCATTGT
GCTGAAAGCG GTGTAACAGT AACATTTTTA AGCTTGTATG GAAAATTTCT CTGTCAGGTT
CAGGGAGCGA CACGAGGGAA TATTTTACTG CGGAGAGCAC AGTACCGTAT GGCTGACAAT
AACCTTCAAA CCGCTCGCCT CGCCCGCTCT TTTGTTATCG GCAAAATCGG TAATGCCCGA
ATCACACTTG CACGGACGGT AAGGGATCAT CCCGAAAAAG TGAACTCTGT TAAACTGAAA
AATGCCCAGC ATATCCTTGC TGGTTGCATA AAGCGCTTGC AGGATGAAAC CGATCAGGAA
CGTATCAGGG GAATTGAGGG CGAAGCAGCA AAAGTTTACT TCGACGTCTT CGATGAGTGT
ATCACCTCTA CAGATCCATT GTTCCGCTTT ACCGGTCGCA ACCGCCGTCC GCCACTTGAC
CGAATAAACT GTCTGCTCTC TTTTCTCTAT ACCCTGCTGA CACACGATAT TCGCTCTGCG
CTTGAGTCGT GCGGTCTTGA TCCTGCTGCA GGTTTTTTAC ACAAAGACCG ACCGGGACGC
CCAAGTCTTG CTCTTGACAT GATTGAAGAG TTCCGTTCGT ATATTGCGGA CAGGTTGGCA
CTATCGCTGA TCAATCGAGG TCAGATTCAA TCCAAAGATT TTACCATCTC GGAAACGGGT
GCTGTACTCC TGAAAGATGA TGCTCGCAAA ACGCTTTTAA CCGCCTATCA AAATCGGAAG
CAGGAAGAAA TCGAGCACCC GTTTGTCAAA GAAAAAATGG CCATCGGTTT ATTGTGGCAC
ATGCAAGCTA TGCTGCTTGC CCGCCATATC CGGGGCGATA TCGACACCTA CCCGCCATTT
GTCTGGAGAT AA
 
Protein sequence
MKKYLNTLFV TTQGAYLSKE GECAVIKIEK EVKTRIPLHM LDGIICFGSV TCSPFLLGHC 
AESGVTVTFL SLYGKFLCQV QGATRGNILL RRAQYRMADN NLQTARLARS FVIGKIGNAR
ITLARTVRDH PEKVNSVKLK NAQHILAGCI KRLQDETDQE RIRGIEGEAA KVYFDVFDEC
ITSTDPLFRF TGRNRRPPLD RINCLLSFLY TLLTHDIRSA LESCGLDPAA GFLHKDRPGR
PSLALDMIEE FRSYIADRLA LSLINRGQIQ SKDFTISETG AVLLKDDARK TLLTAYQNRK
QEEIEHPFVK EKMAIGLLWH MQAMLLARHI RGDIDTYPPF VWR