Gene Haur_2177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2177 
Symbol 
ID5734064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2758018 
End bp2759754 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content51% 
IMG OID641279318 
Productglycoside hydrolase family protein 
Protein accessionYP_001544945 
Protein GI159898698 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3325] Chitinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0141597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGTC AACGCGCCTT GAGCATGAAG GTTTATTTGC TGTTGGCTCT GATCATGCTG 
GCTGGGAGTT TTGGTGCTGG ATTTGATCAA CCCAAATCAG CCCAAGCCCA AATCGCCTAT
AAAATTGTGG GCTATTTGCC ATCGTGGCAA GGCAGTGTCA ACGGCACTCA AATTGATAAA
CTGACCCACA TTAACTATGC TTTTCTGTTG CCCAACAACG ATGGCAGCCT CAAGCCAATC
GAAAACTCCA GCAAATTGCA AGAGTTGGTA TCGGTCGCTC ATAGCAAAAA CAAAAAAGTG
CTAATTTCAG TTGGTGGCTG GAACGACGGC GATGATAGTG CCTTCGAAAG TATCGCCGCC
AACGCCAGCT ATCGCACGAA TTTTGCCAAT AACCTCAATA ATTTCGTCAA CCAATATAAT
CTCGATGGGG TTGATATTGA TTGGGAATAT CCTGAGGCTG GCGATTTTTA CTTTGAAACG
ATGTCGGCGA TCCGCAATCG CATCGGCTCG GGCAAATTAT TAACTGCGGC AGTCGCTGCA
ACCAATGCTG GCGGCTCAGG CGTAACTAGC AATGCCATCG AAATTATGGA TTACATCACG
CTCATGGCCT ACGATGGTGA TGGCGGCGCT GGTCATTCGC CCTATAGTTT AGCTCAGCAA
TCGCTCGATT ATTGGGGCAC CAAAACTAGC AATAAGGCTA AATTAATCTT GGGCGTGCCG
TTTTATGCTC GCCCAGGCTG GTATGGCTAC AACACCTTAC GCGCTGGTGG TTGCTCAGCC
GATAGCGATA GCTGCTGGTA TGGCGGCGCA ACCCAATACT ACACGGGCCG CCCAACCATG
CGCGCCAAAA TCGATTTGAT GAAAAGCAAG GGCGGCGGTG GGATTATGAT TTGGGAATTG
AGCCAAGATA CGGCTGTTAC CAGCAGCGAT TCCTTGCTCA AAACCATTGC CGATCAGCTT
GGCACACCCA GCACGCCAAC TGGCAATTTA GCCCTGAACA AAGCCGCTAC TGCTTCATCA
ACCGAAAATA GCAGCTATGG CGCAAATCTC GCGTTCGATG GCAACAGCTC CACCCGTTGG
TCGAGCACAT GGAGCGATCC ACAATGGCTA CAGGTTGATT TAGGCGCGGT GTATGCGATC
AAACAAGTGG TTTTAAAATG GGAAGCCGCC TTTGGCCGCG CCTATCAAGT GCAAGTTTCC
AACGATGGCA ATACCTGGCG CTCGATCTAT AGTACAACCA ATAGCGATGG CGCAACCGAT
GATTTAGCAG TTTCCGGCAT TGGGCGCTAT CTGCGCGTCT ACGCCACAAC TCGCGCCACC
GAGTGGGGCT ACTCGCTCTG GGAAGTCGAG GTTTATGGCA ATGCGTTGGC AACCAGCGCT
TCATCGACCG AAGCAGGCGG CAGCACGACC AACGCGATTG ATACTAACGG CACTACTCGT
TGGAGCGCAG GCTTGCCCCA AGCAGCAGGC CAGTGGTATC GGGTCGATTT TGGCAATAAC
CAAAGTTTCA GCCAAATCAC GCTTGATGCA GGCCCATCCT ACGGCGATTA TCCACGCAAC
TTCCAAGTGC AAGTTTCCAA CGATGGCAAT AGCTGGGCAA CCGTAGCTAC TGCCACTGGT
ACAACCCAAG CAGTAACCGT CAACTTCGCT GCTCAAAATA GCCGTTATCT GCGGGTATGG
CTCACCAGCA CTGGCGGCGG CAGTTGGTGG TCGATCCACG AATTAACCGT GCGCTAA
 
Protein sequence
MNRQRALSMK VYLLLALIML AGSFGAGFDQ PKSAQAQIAY KIVGYLPSWQ GSVNGTQIDK 
LTHINYAFLL PNNDGSLKPI ENSSKLQELV SVAHSKNKKV LISVGGWNDG DDSAFESIAA
NASYRTNFAN NLNNFVNQYN LDGVDIDWEY PEAGDFYFET MSAIRNRIGS GKLLTAAVAA
TNAGGSGVTS NAIEIMDYIT LMAYDGDGGA GHSPYSLAQQ SLDYWGTKTS NKAKLILGVP
FYARPGWYGY NTLRAGGCSA DSDSCWYGGA TQYYTGRPTM RAKIDLMKSK GGGGIMIWEL
SQDTAVTSSD SLLKTIADQL GTPSTPTGNL ALNKAATASS TENSSYGANL AFDGNSSTRW
SSTWSDPQWL QVDLGAVYAI KQVVLKWEAA FGRAYQVQVS NDGNTWRSIY STTNSDGATD
DLAVSGIGRY LRVYATTRAT EWGYSLWEVE VYGNALATSA SSTEAGGSTT NAIDTNGTTR
WSAGLPQAAG QWYRVDFGNN QSFSQITLDA GPSYGDYPRN FQVQVSNDGN SWATVATATG
TTQAVTVNFA AQNSRYLRVW LTSTGGGSWW SIHELTVR