Gene Cphamn1_2158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2158 
Symbol 
ID6375852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2335040 
End bp2336824 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content46% 
IMG OID642684645 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_001960544 
Protein GI189501074 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTATT CAGGCTATAT TGTTGAAGCT CCCATGGGTA TCGGAAAAAC TGAAGCCTCG 
CTTTATGCTG CATATTCCAT AATAGCCGCA GGTGGAGCTA CAGGGCTATA CTTTGCGCTT
CCAACACAAC TCACTTCCAA CAAAATCCAT GACCGAGTCA ATGATTTCCT GAAAAAAATA
TTGGCTCCTG ATTGTATTCA TCGACAGGCC CTTTTGCTGC ACAGCAATGC TTGGTTGAAA
GAGAATGAGA TGGGTGCGGA TGGTGGACCG GGAGGGAACT GGTTCAGTTC CGGAAAGCGG
GGCATTCTTG CCCCATTTGC AGTTGGCACA ATTGACCAGG CTCTGATGGC GGTAATGAAT
GTCAAGCATG GCTTTGTTAG GGCATTTGGC TTAGCAGGGA AAGTTGTCGT TCTCGATGAG
GTACACTCCT ACGATGCCTA TACCGGAACC ATACTCGACA AACTGGTTGA GGCACTGCGC
AAACTCCAGT GTACAGTAAT CATTTTGAGC GCTACCCTTA CCAAAGAACG CCGGATGAAT
ATTTTACAAC AACCGGTACA TGATATTTCA TATCCGCTCA TCACAGTCAG CCAAAATCAT
GGCGAATCTG TAATCGAAAC AACTGCCGTT GTTCCAGGAA AAACCGTTGT ATCCATTCAT
TCTTCTTCAT TTTTCGATTT AGCTATCGAA GAGGCATTGA GCAGGGCTGA AGGTGGTCAG
CAAGTGCTCT GGATAGAAAA TACGGTTGCT GAAGCACAAA ACTCTTTCAG CATTCTTGCT
GCTCGATCTG CCGGAATGGA AATTGAATGT GGTTTGCTTC ACTCTCGTTT CATAAAAGCC
GATCGTGAAC GCAACGAGGA GTATTGGGTT ACTCGCTATG GAAAAGGATG CGATGAACTA
AGGCGACAAA GTGGTCGAAT ACTTGTTGGT ACCCAGGTGT TGGAACAGTC TCTTGATATT
GATGCCGATT TTCTTGTATC AAAAATATGT CCGACAGACA TGCTGCTGCA AAGGATTGGT
CGACTATGGC GACATGGTGA TACGTTTCGT CCGGTCGGTG CCGTCTGTGA AGTGTTGATA
CTGAAGCCTG AGTTCCATAG TGCACTTTTA AATCCTGAAA AAGAATTCGG GCCTACAGCA
AATGTCTACA GCCCTTATGT GCTTTTGCGA ACTCTTGTTG TCTGGAATGA TATGGCCGAC
ATTGTTCTTC CGGAGCAGAT ACGAAGCTTG ATCGAAGCGA CCTATATGGA AATTGAAGAG
GATCCTATGA TGCTCAAGTA TAAAGCGAAG CTCCAGCAGG AAAAAGCGAA GCTTGAACTG
CTTGCCTTAG GTGGCGTTTC GGAAGGCACC AAAACTTTGC CGGAAAGCAA GGCAAGCACT
CGATATAGCG AACAGGAAAG TGTTGAGGTT TTACTGTTAC GCTCTTTTCA CTTTGACCGT
AATCAGAATG TGACAAATGT CAAACTACTT GACGGCAGTG ATCTTTTTCT GCCATTGGTA
TGCTCGAAGA AGAGTAAAAA AGAACAACGT TCACTTGCAG CATCTCTTGC GCAATACACC
CTGCACGTTG CCGACTATCT CGCTCCGGAA GTAATTTCGG TTAAAAACCT CGACTGGTTG
AAAAATTATT TCTATCTCGG AGACCGTGAC CATGACGAGA GCCTCCTTCG GGTAGCTATT
GTAGGGCAAG ACGAGGAACT CAAATCCCTA AACGGTCGGA ATGCGTCGTC ATCATACGGA
CTCAGTTACA ATCCAAGGCT GGGATATCGG GCCATAAAAA TATAA
 
Protein sequence
MNYSGYIVEA PMGIGKTEAS LYAAYSIIAA GGATGLYFAL PTQLTSNKIH DRVNDFLKKI 
LAPDCIHRQA LLLHSNAWLK ENEMGADGGP GGNWFSSGKR GILAPFAVGT IDQALMAVMN
VKHGFVRAFG LAGKVVVLDE VHSYDAYTGT ILDKLVEALR KLQCTVIILS ATLTKERRMN
ILQQPVHDIS YPLITVSQNH GESVIETTAV VPGKTVVSIH SSSFFDLAIE EALSRAEGGQ
QVLWIENTVA EAQNSFSILA ARSAGMEIEC GLLHSRFIKA DRERNEEYWV TRYGKGCDEL
RRQSGRILVG TQVLEQSLDI DADFLVSKIC PTDMLLQRIG RLWRHGDTFR PVGAVCEVLI
LKPEFHSALL NPEKEFGPTA NVYSPYVLLR TLVVWNDMAD IVLPEQIRSL IEATYMEIEE
DPMMLKYKAK LQQEKAKLEL LALGGVSEGT KTLPESKAST RYSEQESVEV LLLRSFHFDR
NQNVTNVKLL DGSDLFLPLV CSKKSKKEQR SLAASLAQYT LHVADYLAPE VISVKNLDWL
KNYFYLGDRD HDESLLRVAI VGQDEELKSL NGRNASSSYG LSYNPRLGYR AIKI