Gene Ent638_1402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1402 
Symbol 
ID5114367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1537073 
End bp1540291 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content53% 
IMG OID640491589 
ProductCRISPR-associated helicase Cas3 family protein 
Protein accessionYP_001176134 
Protein GI146311060 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR02562] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.23535 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTTC TGATTATTTC CCGGTGCACC AAAAATGCCC GCGTAGAGAG CTGTCGGATA 
GTTGATCAGT TTGCAGAACG AACGGGGAAT GCCGCCTGGC AAACCGTGAT TACGCTGGAA
GGGGTCAATA CACTCAGAAA GTTGCTGCGT AAAACGGCGC GCCGCAATAC GGCGGTGGCC
TGTCACTGGC TTAAAAAGAA CGGACAAACA GAATTATTAT GGATTGTCGG TAATCTTCGC
CGGTTTAATG CGCAGGGGCG TGTACCCACT AACCGCACGA CGTCAGATGT GGTCAAAAGC
GGGGCTGAAC ATGGCTGGCA CAGCGCAGAA AGCATTGCGT TACTGGCCGC CATTGCTGGA
CTTTTTCATG ATTTTGGCAA GGCAGGGCGC TGTTTTCAGC AGACCTTGAA AGGCGCCAGT
AAGCATCTTT GCCAGCCTTT TCGTCATGAG TGGATTTCAG TGCGTCTATT TCAGGCATTT
GTGGATAAAA AAACGGATGC AGAATGGCTC TCAGAGCTAA CGAATCTGAA GGCATCTGAT
GAAACTGTCA TCATGAATGC GCTTGTCACG GATACATCCT CGAAAAGTGA CAGCCCGCTC
AGCACGTTGC CGCCGCTGGC GAAGACCGTC GCCTGGCTGA TGCTGTCGCA TCACCGTTTG
CCGCAGGCGT TATCGTCGAG GCCGCAACTG GAATATTGTA CCGGCTGGAT GGACAGGCAA
CTCAACGCTG ACTGGAACTC ACTGAATCAC AAAATCAGCG ACTCGCACAA ATGGACAGAT
CGCGATTTTA AGCAGGTGTG GCAATTTCCT GAAGGTACGC CTCTGCGAAG CGCTGTCTGG
CGTGAAAAAG CCAGGCAGAT TGGCAAGCGA GCCAGGAATG CGGTTTCGCT CCTTCACTTT
GGCTCGCTTG AAAACAGTTT TACCGTCCAT ATGGCGCGGC TGTCGCTGAT GCTGGCCGAT
CATTTTTACT CCTCTCAGCC ACCGTCTGTC GTCTGGCAGC AGGACAACTA TGACGTCTGG
GCCAACAGCG ATCGCCACAC CAAACAGCTG AAACAAAAGC TGGATGAGCA CAACACTGGT
GTCGCGCATC ATGCTTTGTT GTTAGGACGT TCTTTACCAC AGATACGTCG TTCGTTACCT
GCGATAACCC GTCATAAAAC CTTTCGCGAA CGGGCAAAAG ATGCGCGTTT TCAGTGGCAA
AACCGGGCCT GGGATGTCGC GTTAGCTTTA CGTGATCGCA GTGCAGAGCA GGGCTTTTTT
GGTGTCAATA TGGCCTCTAC CGGCTGCGGT AAAACTTTCG CCAATGCCCG CATTATGTAT
GCCCTCGCCA GTGAACAGGA GGGTTGCCGG TTCAGCGTTG CTCTTGGTTT GCGTACACTA
ACGTTGCAAA CCGGGCAGGC GCTGCAGTCT CGTCTAGGTC TGGATGACGA TACGCTCGCG
GTCGTGACAG GGTCCGCCGC CGTGCGTGAC CTCTTCCAGC AGGGACACGA TGATGAGGAT
ACCAGCAGCG CCAGCGACGA GGCATTTTTT GCCAGTCATC ACTACGTGCA CTATGAAGGC
AGCACCAGTA ACGGTGTTGT TCAGCAATGG CTGGCCGCGG AGCCGGCACT GAATCGTCTG
GTTAGCGCGC CGGTGCTGGT GACGACTGTT GACCATCTGA TGCCCGCGAC CGAAGGCGTG
CGAGGCGGCA GGCAGATCCC GGCAATGCTA CGTCTGTTGA CCAGCGATCT GGTTCTTGAT
GAGCCGGATG ATTTCGATAT CGATGATTTG CACGCTTTGT GTCGGTTGGT GAACTGGGCG
GGAATGCTCG GCTCTCGCGT GTTGCTCTCG TCTGCAACGT TACCGCCGGC GTTAACAGAA
GCTTTATTCG AAGCCTATAG GAAAGGACGT GAAGCCTATC AGGCTGCCTG CGGCATTCCT
GGAAAACCGG TCAATATTTG CTGCGCGTGG TTCGACGAAT ATGGCGCGCA GTCGCAAGAG
GCTTGCGATG AGACGGTATT TCGACAAGCT CATAATCAGT TTGTAGCCCA TCGCGCGCGA
CATTTACTGG AACAACCCCG CCTGCGCTTT GGCAAGTTGG CAAAAATAGA GCCAGCTTCT
GACATCATTA AAACGGTGGC GAGCACGCTT TATCAGCACA TGCTGGCGCT ACATCAGCAC
CATCACACCG TTCATCACTG TGGAAAAACG GTGTCGTTGG GTTTGATTCG CATGGCGAAT
ATCAACCCGC TTGTTGACGT GGCACAGTCG CTTATGGCGA TGCCTGCGCT AGAAAATTAC
TGCATCCATT ATTGCGTTTA CCATAGTCAG CATCCGCTGG CGGTACGGTC GAGTATTGAG
AAACGGCTGG ATGCGGCATT TACCCGTCAC GATCCGCAAC ATATCTGGCA CTTACCGGAA
GTGAAACAGG CGCTCGCATC GCCCTTTCAA CATCATCTCT TTGTGGTGGT GGGGACATCC
GTGCTGGAGG TTGGGCGCGA TTTTGACGCT GACTGGGGCA TTATCGAGCC CAGTTCGATG
CGCTCACTCA TTCAATTTGC TGGGCGTATT CAGCGCCACC GCCAACAACT CCCGGAGAGT
GAAAATGTGG TGATCCTGAG TCGTAATATC CGTGCGCTAC AGGGTAAAAC CATCGCGTAC
TGCAAGCCGG GTTTTGAGAC GAATGAACAT CTTTTTCCAC ATCATGATTT AGCTGATCTG
CTGCCCGACG CCAATTTTCA GCACATCAAC GCTATCCCGC GCATTGTGGA CGATGTAACT
GATAACGCGT TTGCCAGCCT TGAACATACC CGGTTGCGTG CCGCGTTACT TGATGGGGGC
GATAAACGTG ACGTTATTGC CGCCCAGTGG TGGCGCTTAC CGCTGACATG GAATGGCGAA
TTACAGAGAC GTACGCCTTT TCGCCATGCT TCGCCACAGG AATCATTTTT CCTGACCATG
GATCAGGATG ACGATACGCC TGTTTTTTGT CTGATGCAGA GTGATGGCGT GCTGAAACCC
TCGCATCTGT TTCGCGAACA TTCGTTTTTA CTGGCTGAAA GAGTGCAGGC GTGGTTTGCC
GTTGATTATC AGGCGGTTCT ACTTGAGCTG GCAGAATCAA AAAATATGGA ACTTCACGCG
GTATCACGAC GGTATGGCGA AATAACATTA CCCGTTGGTA AAGAGAACGC CACGGAGCAA
TGGCGATATC ACCCGATTTT GGGTGTGTTT AGGGAATAA
 
Protein sequence
MNVLIISRCT KNARVESCRI VDQFAERTGN AAWQTVITLE GVNTLRKLLR KTARRNTAVA 
CHWLKKNGQT ELLWIVGNLR RFNAQGRVPT NRTTSDVVKS GAEHGWHSAE SIALLAAIAG
LFHDFGKAGR CFQQTLKGAS KHLCQPFRHE WISVRLFQAF VDKKTDAEWL SELTNLKASD
ETVIMNALVT DTSSKSDSPL STLPPLAKTV AWLMLSHHRL PQALSSRPQL EYCTGWMDRQ
LNADWNSLNH KISDSHKWTD RDFKQVWQFP EGTPLRSAVW REKARQIGKR ARNAVSLLHF
GSLENSFTVH MARLSLMLAD HFYSSQPPSV VWQQDNYDVW ANSDRHTKQL KQKLDEHNTG
VAHHALLLGR SLPQIRRSLP AITRHKTFRE RAKDARFQWQ NRAWDVALAL RDRSAEQGFF
GVNMASTGCG KTFANARIMY ALASEQEGCR FSVALGLRTL TLQTGQALQS RLGLDDDTLA
VVTGSAAVRD LFQQGHDDED TSSASDEAFF ASHHYVHYEG STSNGVVQQW LAAEPALNRL
VSAPVLVTTV DHLMPATEGV RGGRQIPAML RLLTSDLVLD EPDDFDIDDL HALCRLVNWA
GMLGSRVLLS SATLPPALTE ALFEAYRKGR EAYQAACGIP GKPVNICCAW FDEYGAQSQE
ACDETVFRQA HNQFVAHRAR HLLEQPRLRF GKLAKIEPAS DIIKTVASTL YQHMLALHQH
HHTVHHCGKT VSLGLIRMAN INPLVDVAQS LMAMPALENY CIHYCVYHSQ HPLAVRSSIE
KRLDAAFTRH DPQHIWHLPE VKQALASPFQ HHLFVVVGTS VLEVGRDFDA DWGIIEPSSM
RSLIQFAGRI QRHRQQLPES ENVVILSRNI RALQGKTIAY CKPGFETNEH LFPHHDLADL
LPDANFQHIN AIPRIVDDVT DNAFASLEHT RLRAALLDGG DKRDVIAAQW WRLPLTWNGE
LQRRTPFRHA SPQESFFLTM DQDDDTPVFC LMQSDGVLKP SHLFREHSFL LAERVQAWFA
VDYQAVLLEL AESKNMELHA VSRRYGEITL PVGKENATEQ WRYHPILGVF RE