Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_1402 |
Symbol | |
ID | 5114367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 1537073 |
End bp | 1540291 |
Gene Length | 3219 bp |
Protein Length | 1072 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640491589 |
Product | CRISPR-associated helicase Cas3 family protein |
Protein accession | YP_001176134 |
Protein GI | 146311060 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR02562] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.23535 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTTC TGATTATTTC CCGGTGCACC AAAAATGCCC GCGTAGAGAG CTGTCGGATA GTTGATCAGT TTGCAGAACG AACGGGGAAT GCCGCCTGGC AAACCGTGAT TACGCTGGAA GGGGTCAATA CACTCAGAAA GTTGCTGCGT AAAACGGCGC GCCGCAATAC GGCGGTGGCC TGTCACTGGC TTAAAAAGAA CGGACAAACA GAATTATTAT GGATTGTCGG TAATCTTCGC CGGTTTAATG CGCAGGGGCG TGTACCCACT AACCGCACGA CGTCAGATGT GGTCAAAAGC GGGGCTGAAC ATGGCTGGCA CAGCGCAGAA AGCATTGCGT TACTGGCCGC CATTGCTGGA CTTTTTCATG ATTTTGGCAA GGCAGGGCGC TGTTTTCAGC AGACCTTGAA AGGCGCCAGT AAGCATCTTT GCCAGCCTTT TCGTCATGAG TGGATTTCAG TGCGTCTATT TCAGGCATTT GTGGATAAAA AAACGGATGC AGAATGGCTC TCAGAGCTAA CGAATCTGAA GGCATCTGAT GAAACTGTCA TCATGAATGC GCTTGTCACG GATACATCCT CGAAAAGTGA CAGCCCGCTC AGCACGTTGC CGCCGCTGGC GAAGACCGTC GCCTGGCTGA TGCTGTCGCA TCACCGTTTG CCGCAGGCGT TATCGTCGAG GCCGCAACTG GAATATTGTA CCGGCTGGAT GGACAGGCAA CTCAACGCTG ACTGGAACTC ACTGAATCAC AAAATCAGCG ACTCGCACAA ATGGACAGAT CGCGATTTTA AGCAGGTGTG GCAATTTCCT GAAGGTACGC CTCTGCGAAG CGCTGTCTGG CGTGAAAAAG CCAGGCAGAT TGGCAAGCGA GCCAGGAATG CGGTTTCGCT CCTTCACTTT GGCTCGCTTG AAAACAGTTT TACCGTCCAT ATGGCGCGGC TGTCGCTGAT GCTGGCCGAT CATTTTTACT CCTCTCAGCC ACCGTCTGTC GTCTGGCAGC AGGACAACTA TGACGTCTGG GCCAACAGCG ATCGCCACAC CAAACAGCTG AAACAAAAGC TGGATGAGCA CAACACTGGT GTCGCGCATC ATGCTTTGTT GTTAGGACGT TCTTTACCAC AGATACGTCG TTCGTTACCT GCGATAACCC GTCATAAAAC CTTTCGCGAA CGGGCAAAAG ATGCGCGTTT TCAGTGGCAA AACCGGGCCT GGGATGTCGC GTTAGCTTTA CGTGATCGCA GTGCAGAGCA GGGCTTTTTT GGTGTCAATA TGGCCTCTAC CGGCTGCGGT AAAACTTTCG CCAATGCCCG CATTATGTAT GCCCTCGCCA GTGAACAGGA GGGTTGCCGG TTCAGCGTTG CTCTTGGTTT GCGTACACTA ACGTTGCAAA CCGGGCAGGC GCTGCAGTCT CGTCTAGGTC TGGATGACGA TACGCTCGCG GTCGTGACAG GGTCCGCCGC CGTGCGTGAC CTCTTCCAGC AGGGACACGA TGATGAGGAT ACCAGCAGCG CCAGCGACGA GGCATTTTTT GCCAGTCATC ACTACGTGCA CTATGAAGGC AGCACCAGTA ACGGTGTTGT TCAGCAATGG CTGGCCGCGG AGCCGGCACT GAATCGTCTG GTTAGCGCGC CGGTGCTGGT GACGACTGTT GACCATCTGA TGCCCGCGAC CGAAGGCGTG CGAGGCGGCA GGCAGATCCC GGCAATGCTA CGTCTGTTGA CCAGCGATCT GGTTCTTGAT GAGCCGGATG ATTTCGATAT CGATGATTTG CACGCTTTGT GTCGGTTGGT GAACTGGGCG GGAATGCTCG GCTCTCGCGT GTTGCTCTCG TCTGCAACGT TACCGCCGGC GTTAACAGAA GCTTTATTCG AAGCCTATAG GAAAGGACGT GAAGCCTATC AGGCTGCCTG CGGCATTCCT GGAAAACCGG TCAATATTTG CTGCGCGTGG TTCGACGAAT ATGGCGCGCA GTCGCAAGAG GCTTGCGATG AGACGGTATT TCGACAAGCT CATAATCAGT TTGTAGCCCA TCGCGCGCGA CATTTACTGG AACAACCCCG CCTGCGCTTT GGCAAGTTGG CAAAAATAGA GCCAGCTTCT GACATCATTA AAACGGTGGC GAGCACGCTT TATCAGCACA TGCTGGCGCT ACATCAGCAC CATCACACCG TTCATCACTG TGGAAAAACG GTGTCGTTGG GTTTGATTCG CATGGCGAAT ATCAACCCGC TTGTTGACGT GGCACAGTCG CTTATGGCGA TGCCTGCGCT AGAAAATTAC TGCATCCATT ATTGCGTTTA CCATAGTCAG CATCCGCTGG CGGTACGGTC GAGTATTGAG AAACGGCTGG ATGCGGCATT TACCCGTCAC GATCCGCAAC ATATCTGGCA CTTACCGGAA GTGAAACAGG CGCTCGCATC GCCCTTTCAA CATCATCTCT TTGTGGTGGT GGGGACATCC GTGCTGGAGG TTGGGCGCGA TTTTGACGCT GACTGGGGCA TTATCGAGCC CAGTTCGATG CGCTCACTCA TTCAATTTGC TGGGCGTATT CAGCGCCACC GCCAACAACT CCCGGAGAGT GAAAATGTGG TGATCCTGAG TCGTAATATC CGTGCGCTAC AGGGTAAAAC CATCGCGTAC TGCAAGCCGG GTTTTGAGAC GAATGAACAT CTTTTTCCAC ATCATGATTT AGCTGATCTG CTGCCCGACG CCAATTTTCA GCACATCAAC GCTATCCCGC GCATTGTGGA CGATGTAACT GATAACGCGT TTGCCAGCCT TGAACATACC CGGTTGCGTG CCGCGTTACT TGATGGGGGC GATAAACGTG ACGTTATTGC CGCCCAGTGG TGGCGCTTAC CGCTGACATG GAATGGCGAA TTACAGAGAC GTACGCCTTT TCGCCATGCT TCGCCACAGG AATCATTTTT CCTGACCATG GATCAGGATG ACGATACGCC TGTTTTTTGT CTGATGCAGA GTGATGGCGT GCTGAAACCC TCGCATCTGT TTCGCGAACA TTCGTTTTTA CTGGCTGAAA GAGTGCAGGC GTGGTTTGCC GTTGATTATC AGGCGGTTCT ACTTGAGCTG GCAGAATCAA AAAATATGGA ACTTCACGCG GTATCACGAC GGTATGGCGA AATAACATTA CCCGTTGGTA AAGAGAACGC CACGGAGCAA TGGCGATATC ACCCGATTTT GGGTGTGTTT AGGGAATAA
|
Protein sequence | MNVLIISRCT KNARVESCRI VDQFAERTGN AAWQTVITLE GVNTLRKLLR KTARRNTAVA CHWLKKNGQT ELLWIVGNLR RFNAQGRVPT NRTTSDVVKS GAEHGWHSAE SIALLAAIAG LFHDFGKAGR CFQQTLKGAS KHLCQPFRHE WISVRLFQAF VDKKTDAEWL SELTNLKASD ETVIMNALVT DTSSKSDSPL STLPPLAKTV AWLMLSHHRL PQALSSRPQL EYCTGWMDRQ LNADWNSLNH KISDSHKWTD RDFKQVWQFP EGTPLRSAVW REKARQIGKR ARNAVSLLHF GSLENSFTVH MARLSLMLAD HFYSSQPPSV VWQQDNYDVW ANSDRHTKQL KQKLDEHNTG VAHHALLLGR SLPQIRRSLP AITRHKTFRE RAKDARFQWQ NRAWDVALAL RDRSAEQGFF GVNMASTGCG KTFANARIMY ALASEQEGCR FSVALGLRTL TLQTGQALQS RLGLDDDTLA VVTGSAAVRD LFQQGHDDED TSSASDEAFF ASHHYVHYEG STSNGVVQQW LAAEPALNRL VSAPVLVTTV DHLMPATEGV RGGRQIPAML RLLTSDLVLD EPDDFDIDDL HALCRLVNWA GMLGSRVLLS SATLPPALTE ALFEAYRKGR EAYQAACGIP GKPVNICCAW FDEYGAQSQE ACDETVFRQA HNQFVAHRAR HLLEQPRLRF GKLAKIEPAS DIIKTVASTL YQHMLALHQH HHTVHHCGKT VSLGLIRMAN INPLVDVAQS LMAMPALENY CIHYCVYHSQ HPLAVRSSIE KRLDAAFTRH DPQHIWHLPE VKQALASPFQ HHLFVVVGTS VLEVGRDFDA DWGIIEPSSM RSLIQFAGRI QRHRQQLPES ENVVILSRNI RALQGKTIAY CKPGFETNEH LFPHHDLADL LPDANFQHIN AIPRIVDDVT DNAFASLEHT RLRAALLDGG DKRDVIAAQW WRLPLTWNGE LQRRTPFRHA SPQESFFLTM DQDDDTPVFC LMQSDGVLKP SHLFREHSFL LAERVQAWFA VDYQAVLLEL AESKNMELHA VSRRYGEITL PVGKENATEQ WRYHPILGVF RE
|
| |