Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2158 |
Symbol | |
ID | 6375852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 2335040 |
End bp | 2336824 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 642684645 |
Product | CRISPR-associated helicase Cas3 |
Protein accession | YP_001960544 |
Protein GI | 189501074 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTATT CAGGCTATAT TGTTGAAGCT CCCATGGGTA TCGGAAAAAC TGAAGCCTCG CTTTATGCTG CATATTCCAT AATAGCCGCA GGTGGAGCTA CAGGGCTATA CTTTGCGCTT CCAACACAAC TCACTTCCAA CAAAATCCAT GACCGAGTCA ATGATTTCCT GAAAAAAATA TTGGCTCCTG ATTGTATTCA TCGACAGGCC CTTTTGCTGC ACAGCAATGC TTGGTTGAAA GAGAATGAGA TGGGTGCGGA TGGTGGACCG GGAGGGAACT GGTTCAGTTC CGGAAAGCGG GGCATTCTTG CCCCATTTGC AGTTGGCACA ATTGACCAGG CTCTGATGGC GGTAATGAAT GTCAAGCATG GCTTTGTTAG GGCATTTGGC TTAGCAGGGA AAGTTGTCGT TCTCGATGAG GTACACTCCT ACGATGCCTA TACCGGAACC ATACTCGACA AACTGGTTGA GGCACTGCGC AAACTCCAGT GTACAGTAAT CATTTTGAGC GCTACCCTTA CCAAAGAACG CCGGATGAAT ATTTTACAAC AACCGGTACA TGATATTTCA TATCCGCTCA TCACAGTCAG CCAAAATCAT GGCGAATCTG TAATCGAAAC AACTGCCGTT GTTCCAGGAA AAACCGTTGT ATCCATTCAT TCTTCTTCAT TTTTCGATTT AGCTATCGAA GAGGCATTGA GCAGGGCTGA AGGTGGTCAG CAAGTGCTCT GGATAGAAAA TACGGTTGCT GAAGCACAAA ACTCTTTCAG CATTCTTGCT GCTCGATCTG CCGGAATGGA AATTGAATGT GGTTTGCTTC ACTCTCGTTT CATAAAAGCC GATCGTGAAC GCAACGAGGA GTATTGGGTT ACTCGCTATG GAAAAGGATG CGATGAACTA AGGCGACAAA GTGGTCGAAT ACTTGTTGGT ACCCAGGTGT TGGAACAGTC TCTTGATATT GATGCCGATT TTCTTGTATC AAAAATATGT CCGACAGACA TGCTGCTGCA AAGGATTGGT CGACTATGGC GACATGGTGA TACGTTTCGT CCGGTCGGTG CCGTCTGTGA AGTGTTGATA CTGAAGCCTG AGTTCCATAG TGCACTTTTA AATCCTGAAA AAGAATTCGG GCCTACAGCA AATGTCTACA GCCCTTATGT GCTTTTGCGA ACTCTTGTTG TCTGGAATGA TATGGCCGAC ATTGTTCTTC CGGAGCAGAT ACGAAGCTTG ATCGAAGCGA CCTATATGGA AATTGAAGAG GATCCTATGA TGCTCAAGTA TAAAGCGAAG CTCCAGCAGG AAAAAGCGAA GCTTGAACTG CTTGCCTTAG GTGGCGTTTC GGAAGGCACC AAAACTTTGC CGGAAAGCAA GGCAAGCACT CGATATAGCG AACAGGAAAG TGTTGAGGTT TTACTGTTAC GCTCTTTTCA CTTTGACCGT AATCAGAATG TGACAAATGT CAAACTACTT GACGGCAGTG ATCTTTTTCT GCCATTGGTA TGCTCGAAGA AGAGTAAAAA AGAACAACGT TCACTTGCAG CATCTCTTGC GCAATACACC CTGCACGTTG CCGACTATCT CGCTCCGGAA GTAATTTCGG TTAAAAACCT CGACTGGTTG AAAAATTATT TCTATCTCGG AGACCGTGAC CATGACGAGA GCCTCCTTCG GGTAGCTATT GTAGGGCAAG ACGAGGAACT CAAATCCCTA AACGGTCGGA ATGCGTCGTC ATCATACGGA CTCAGTTACA ATCCAAGGCT GGGATATCGG GCCATAAAAA TATAA
|
Protein sequence | MNYSGYIVEA PMGIGKTEAS LYAAYSIIAA GGATGLYFAL PTQLTSNKIH DRVNDFLKKI LAPDCIHRQA LLLHSNAWLK ENEMGADGGP GGNWFSSGKR GILAPFAVGT IDQALMAVMN VKHGFVRAFG LAGKVVVLDE VHSYDAYTGT ILDKLVEALR KLQCTVIILS ATLTKERRMN ILQQPVHDIS YPLITVSQNH GESVIETTAV VPGKTVVSIH SSSFFDLAIE EALSRAEGGQ QVLWIENTVA EAQNSFSILA ARSAGMEIEC GLLHSRFIKA DRERNEEYWV TRYGKGCDEL RRQSGRILVG TQVLEQSLDI DADFLVSKIC PTDMLLQRIG RLWRHGDTFR PVGAVCEVLI LKPEFHSALL NPEKEFGPTA NVYSPYVLLR TLVVWNDMAD IVLPEQIRSL IEATYMEIEE DPMMLKYKAK LQQEKAKLEL LALGGVSEGT KTLPESKAST RYSEQESVEV LLLRSFHFDR NQNVTNVKLL DGSDLFLPLV CSKKSKKEQR SLAASLAQYT LHVADYLAPE VISVKNLDWL KNYFYLGDRD HDESLLRVAI VGQDEELKSL NGRNASSSYG LSYNPRLGYR AIKI
|
| |