Gene ECD_03189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03189 
SymbolchiA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3327394 
End bp3329454 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content49% 
IMG OID 
Productperiplasmic endochitinase 
Protein accessionACT44993 
Protein GI253979323 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.407335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAA ATATATTTAC TAAATCTATG ATTGGTATGG GGCTGGTGTG TTCCGCTCTG 
CCAGCATTGG CAATGGAAGC ATGGAATAAC CAACAAGGTG GTAATAAATA TCAGGTTATT
TTCGATGGCA AAATTTATGA AAATGCCTGG TGGGTTTCTT CTACAAATTG CCCGGGAAAA
GCGAAAGCAA ATGATGCAAC TAACCCGTGG CGTTTAAAGC GTACCGCAAC AGCTGCTGAA
ATTAGTCAGT TTGGCAATAC ACTTTCCTGC GAAAAGAGCG GCAGCTCATC TTCTTCAAAT
TCAAATACGC CTGCATCCAA TACGCCGGCT AACAGCGAGC CATCAACACC AGCGGATAGC
GGTAACGATT ACTCATTGCA AGCGTGGAGC GGCCAGGAAG GTAGCGAAAT TTACCATGTT
ATTTTCAATG GTAATGTTTA CAAGAACGCC TGGTGGGTTG GGTCTAAAGA TTGCCCACGG
GGTACCAGCG CTGAAAACTC CAATAACCCA TGGCGTCTCG AGCGTACAGC TACCGCTGCG
GAATTGAGTC AGTACGGTAA CCCGACTACC TGTGAAATTG ATAACGGCGG CGTCATTGTT
GCGGATGGTT TCCAGGCCAG CAAAGCGTAC AGCGCGGACA GCATCGTAGA TTATAACGAT
GCACATTATA AAACTTCTGT CGATCAAGAC GCATGGGGCT TTGTCCCGGG CGGCGATAAC
CCGTGGAAGA AATACGAACC GGCGAAAGCA TGGTCCGCAT CCACTGTGTA CGTGAAAGGT
GATCGCGTTG TTGTTGATGG GCAGGCTTAT GAAGCGCTGT TCTGGACGCA AAGTGACAAC
CCTGCTCTGG TGGCGAACCA AAACGCCACC GGTAGCAATA GCCGCCCGTG GAAGCCGTTA
GGTAAGGCTC AGAGCTATAG CAACGAAGAG CTGAATAATG CGCCGCAGTT TAATCCAGAA
ACGCTTTATG CCAGCGATAC GCTGATTCGC TTTAACGGTG TGAACTACAT TTCTCAGAGT
AAAGTGCAGA AAGTTTCTCC TTCTGACAGC AACCCGTGGC GTGTTTTTGT TGACTGGACC
GGAACCAAAG AACGTGTAGG TACGCCGAAG AAAGCATGGC CGAAACACGT TTATGCACCG
TATGTTGACT TTACGCTGAA TACGATCCCG GATTTGGCTG CGCTGGCTAA GAATCATAAC
GTCAACCACT TCACGCTGGC ATTTGTGGTG AGTAAAGATG CGAACACCTG TCTGCCGACA
TGGGGTACCG CTTATGGTAT GCACAATTAC GCTCAGTACA GCAAAATCAA AGCTCTGCGT
GAGGCTGGCG GCGATGTGAT GCTGTCTATC GGTGGTGCTA ACAACGCTCC GCTGGCTGCT
TCCTGTAAGA ACGTAGACGA TCTGATGCAG CATTATTATG ACATCGTTGA TAACCTGAAC
CTCAAAGTCC TGGACTTCGA TATCGAAGGC ACCTGGGTTG CGGATCAGGC ATCTATTGAA
CGTCGTAACC TTGCTGTGAA GAAAGTGCAG GATAAATGGA AGTCAGAAGG CAAAGATATT
GCTATCTGGT ACACCTTGCC AATTCTGCCG ACTGGCCTGA CGCCGGAAGG GATGAATGTC
CTGAGCGATG CCAAAGCGAA AGGTGTTGAG CTGGCGGGTG TGAACGTGAT GACAATGGAC
TACGGTAACG CGATTTGTCA GTCTGCAAAT ACCGAAGGCC AGAACATTCA CGGTAAGTGT
GCAACGTCTG CGATTGCCAA CCTGCATTCA CAATTGAAAG GCCTCCATCC CAATAAGAGC
GATGCAGAAA TTGACGCTAT GATGGGTACC ACGCCGATGG TTGGCGTGAA CGACGTTCAG
GGCGAGGTGT TCTATCTCTC TGATGCTCGT CTGGTCATGC AGGATGCGCA GAAGCGTAAT
CTCGGTATGG TTGGTATCTG GTCAATCGCG CGCGACCTGC CGGGCGGCAC TAACCTGTCT
CCGGAATTCC ACGGCCTGAC TAAAGAACAG GCACCGAAGT ACGCATTTAG CGAAATCTTC
GCGCCGTTTA CTAAGCAATA A
 
Protein sequence
MKLNIFTKSM IGMGLVCSAL PALAMEAWNN QQGGNKYQVI FDGKIYENAW WVSSTNCPGK 
AKANDATNPW RLKRTATAAE ISQFGNTLSC EKSGSSSSSN SNTPASNTPA NSEPSTPADS
GNDYSLQAWS GQEGSEIYHV IFNGNVYKNA WWVGSKDCPR GTSAENSNNP WRLERTATAA
ELSQYGNPTT CEIDNGGVIV ADGFQASKAY SADSIVDYND AHYKTSVDQD AWGFVPGGDN
PWKKYEPAKA WSASTVYVKG DRVVVDGQAY EALFWTQSDN PALVANQNAT GSNSRPWKPL
GKAQSYSNEE LNNAPQFNPE TLYASDTLIR FNGVNYISQS KVQKVSPSDS NPWRVFVDWT
GTKERVGTPK KAWPKHVYAP YVDFTLNTIP DLAALAKNHN VNHFTLAFVV SKDANTCLPT
WGTAYGMHNY AQYSKIKALR EAGGDVMLSI GGANNAPLAA SCKNVDDLMQ HYYDIVDNLN
LKVLDFDIEG TWVADQASIE RRNLAVKKVQ DKWKSEGKDI AIWYTLPILP TGLTPEGMNV
LSDAKAKGVE LAGVNVMTMD YGNAICQSAN TEGQNIHGKC ATSAIANLHS QLKGLHPNKS
DAEIDAMMGT TPMVGVNDVQ GEVFYLSDAR LVMQDAQKRN LGMVGIWSIA RDLPGGTNLS
PEFHGLTKEQ APKYAFSEIF APFTKQ