Gene SeHA_C0271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0271 
Symbol 
ID6490109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp281916 
End bp283679 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content56% 
IMG OID642740549 
Productendochitinase 
Protein accessionYP_002044223 
Protein GI194451487 
COG category[R] General function prediction only 
COG ID[COG3979] Uncharacterized protein contain chitin-binding domain type 3 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.527688 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value0.775272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTGC AAAAAACACT GGCGCTCAGC GCCGTGGCGG CAGGAATCAT GTTGAGCTTA 
TCCGGTGCGC AGGCCGCGCC GCTGCTTAGC AGTAGTGAGC CAATGACCAT CAACGCCAGC
GATCTGGCGG CGAAAGAGAA AGCGCTGACG GATTTTCCGT TAATGGAGGC CGTGAAATCC
TCTATCCAGA CGTTGGATAA CAGCGCGGTC GAACAAATCG AACCGGGGCG CGCCGCTAAC
CCGGCAAACG TAAAACGCGT TGAAAGTATT CTGAAAGAGG CCGACTGGGA TTATCTGTTC
CCGATGCGCG CGCCGGAATA CACTTACTCT AACTTCCTGA AAGCGATAGG TAAATTCCCG
GCGGTTTGTG GTACCTACAC CGATGGACGC GATAGCGACG CTATCTGCCG TAAAACCCTG
GCTACTATGT TTGCGCATTT TGCCCAGGAG ACGGGCGGTC ACGAAAGCTG GCGTGACATT
CCGGAATGGC GTCAGGCGCT GGTCTATCTG CGCGAAGTCG GCTGGACTGA AGGGCAGAAA
GGCGGCTACA ACGGCGAATG TAACCCGGAT GTATGGCAGG GCCAGACCTG GCCGTGCGGT
AAAGATAAGG ACGGCGATTT CCTCAGCTAT TTTGGTCGCG GGGCAAAACA GCTTTCTTAT
AACTACAACT ATGGGCCTTT CTCTGACGCG ATGTATGGCG ACGTTCGCCC TCTGCTGGAT
AAACCCGAGC TGGTGGCGGA TACCTGGATG AACCTGGCGA GCGCCGTCTT CTTCTTTGTG
TATCCGCAGC CGCCGAAGCC GTCTATGCTA CATGTGATTG ACGGTACCTG GCAGCCAAAC
GATCGCGATA AAGCAAACGG CCTGGTATCA GGTTTCGGCG TCACCATTCA GATCATCAAT
GGCGGCGTGG AGTGCGGCGG CGCAGATGAG AATGCGCAGT CGCTTAACCG TATCGCCTAC
TACAAAGAGT TTGCCAACTA CCTGAAAGTG CCGGTGCCGG CTGACGAAGT GTTGGGCTGT
AAAAAGATGA AGCAGTTCGA TGAAGGCGGC GCTGGCGCGT TACCGATCTA TTGGGAACAA
GACTGGGGCT GGAGCGCCGA TACTGCGGAC GGTAAAACCT ATTCCTGCCA GTTAGTGGGA
TATCAGACGC CGTATACCGC CTTTAAAGAG GGCGACTACA CGAAATGTGT ACAGCATTAT
TTCAACGTTA ATGTCGTCGA TGATAGCGGA ACCACTGAAC CGGATGTCAC GCCAACTCCG
GCGCCAGTGA CGGATGAAAA CGTGGCGCCA GTCGCGCGCA TTGCCGGACC GGTCGGGGCG
GTGGAAGCCG GTAGCCCGGT TTCACTCAGC GCGGAAGGAT CGACCGACGC GAATGGCGAC
AAGCTCACCT ATACCTGGAT GTCGCAGGAT GGCAAAACGC TGAGCGGCCA GGATAAAGCC
GTTGTGATTT TCAACGCGCC GGATGTCACT CAGAACACCC AGTATGTGGT GAATCTGACC
GTTAGCGACG GTACGCTCTC CAGTACAGCG GTTTATACGC TGAATGTGAA AGCGAAGGCC
GCCGCTGCGG ATGACGAAGA TAAGACCACC AGCTATCCTG CCTGGAGCAG CAGCCAGAAA
TGGAATCCGG GCGACATCGT CAACAGTAAT GGCGCATTGT ACCAGTGCAA ACCGTTCCCG
GAAGGCTCAT GGTGTAATGT TGCGCCTGCC TACTATGAGC CCGGCGTAGG GATTGCCTGG
GCCGATGCAT GGAACGCATT GTAA
 
Protein sequence
MGLQKTLALS AVAAGIMLSL SGAQAAPLLS SSEPMTINAS DLAAKEKALT DFPLMEAVKS 
SIQTLDNSAV EQIEPGRAAN PANVKRVESI LKEADWDYLF PMRAPEYTYS NFLKAIGKFP
AVCGTYTDGR DSDAICRKTL ATMFAHFAQE TGGHESWRDI PEWRQALVYL REVGWTEGQK
GGYNGECNPD VWQGQTWPCG KDKDGDFLSY FGRGAKQLSY NYNYGPFSDA MYGDVRPLLD
KPELVADTWM NLASAVFFFV YPQPPKPSML HVIDGTWQPN DRDKANGLVS GFGVTIQIIN
GGVECGGADE NAQSLNRIAY YKEFANYLKV PVPADEVLGC KKMKQFDEGG AGALPIYWEQ
DWGWSADTAD GKTYSCQLVG YQTPYTAFKE GDYTKCVQHY FNVNVVDDSG TTEPDVTPTP
APVTDENVAP VARIAGPVGA VEAGSPVSLS AEGSTDANGD KLTYTWMSQD GKTLSGQDKA
VVIFNAPDVT QNTQYVVNLT VSDGTLSSTA VYTLNVKAKA AAADDEDKTT SYPAWSSSQK
WNPGDIVNSN GALYQCKPFP EGSWCNVAPA YYEPGVGIAW ADAWNAL