Gene Haur_2750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2750 
Symbol 
ID5734631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3506594 
End bp3508159 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content52% 
IMG OID641279893 
Productchitinase 
Protein accessionYP_001545516 
Protein GI159899269 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3325] Chitinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTGA TCCATCGTCG GTTTCGGTTT CTAGCCTTTA TCTTAATTCC CGTTCTTGTT 
ATGTTGATGG CATTGAGCCA ACAAAAGGTT CAGGCAGCCG CACCTGCATG GGATGGTAAC
TATCATGCCT ATGCGATTGG GGCACGAGTA AGCTATGCTG GTGGCGAATA TGAATGTATT
CAACCGCATA CTTCATTGCC AAATTGGAAT CCAGTTGATG TTCCAGCCTT GTGGCGAGCG
GTTTCACCAA GCAATCCAAC CGCGACTTCA ATTCCAGCCA CCGCAACTCC ACGCCCTGCA
ACATCAACGC CTCTGCCACC AACGGCAACG CCACGACCTG GCACGGCCAC GCCAACGCCA
AATAATCCAA CCGCAACCCC ACGCCCAGCA ACTGCAACCC CAGTCTTGCC AACTGCAACC
ACCACACCTG GCGGTGGCAA ACGCATCATC GGCTACTTCG CTGAATGGGG CGTCTACGGT
CGCAACTATC ACGTTCGCAA TATCAAAACC AGTGGCTCAG CTGCTAAGTT GACCCACATC
AACTATGCCT TTGGCAATGT TGTCAATAGC CGCTGCCAAT TGGGCGACAC CTACGCCGAT
TATGATCGGG CTTATAGCGC TGCTGAAAGC GTCGATGGCG TAGCTGATAC TTGGGATACT
GGCGTATTGC GCGGTAGCTT TGGCCAATTG CGCAAACTCA AAGCCGAATT CCCCCACTTG
AAAGTGTTGA TTTCATTGGG TGGTTGGACG TGGTCGGCAG GTTTCTCGGA TGCTGCCTTG
CCAGCTAACC GCGCAGCCTT CGTCAAATCA TGTGTTGACC TGTTCATCAA AGACCCACGC
TGGGCTGGCG TATTCGATGG CATCGATATC GACTGGGAAT ACCCCGCAGC TTGTGGTAAC
ACCTGTAACT ATCGCCCTGA AGATACCCAA AACTTCACCG CCTTGTTGAG CGAATTCCGC
AGCCAATTGA ACGCAGTTCG CCCAGGCTTG TTGTTGACGA TTGCTGCCCC AGCCGACCCA
GCCAAGATCG CCAAGATTCA GGTTGGTCAA ATTCACCAAT ATCTCGATTT CATCAACATC
ATGACCTACG ACTTGCACGG CGCTTGGGAA GCTAACACCA ACTTCCAATC AAACTTATAT
TCAATCGCTG GCGATCCCGG CCCAGTCTAC TCGGTTGATA TTGCTGTCAA CGCTTGGTTG
AATGGTGGAA CTCCAGCCGA TAAAGTTGTG GTTGGTGTAC CATTCTATGG TCGCGGCTGG
AAAGATGTAC CAAGCACCAA CAATGGCTTG TTCCAACCTG GCTCAGCTGC TCCAGCAACT
TACGAAGCTG GCATCGAAGA TTACAAAGTG TTGAAAACCA AAGGCTTGAC CCGCTACTCA
AATAGCGCTG CTGGCGCTGC TTGGCTCTAC GGCAACGGCC AATTCTGGAC CTATGATGAC
CCAGCCATTA TGAAAGTCAA GACTGACTAT GTAAAAGCCA AAGGTCTTGG CGGCACGATG
TTCTGGGAAT TGAGCGGCGA CACCACGAAT GGCGAATTGA TCAACGCCCT TTACCAAGGT
CGCTAG
 
Protein sequence
MNLIHRRFRF LAFILIPVLV MLMALSQQKV QAAAPAWDGN YHAYAIGARV SYAGGEYECI 
QPHTSLPNWN PVDVPALWRA VSPSNPTATS IPATATPRPA TSTPLPPTAT PRPGTATPTP
NNPTATPRPA TATPVLPTAT TTPGGGKRII GYFAEWGVYG RNYHVRNIKT SGSAAKLTHI
NYAFGNVVNS RCQLGDTYAD YDRAYSAAES VDGVADTWDT GVLRGSFGQL RKLKAEFPHL
KVLISLGGWT WSAGFSDAAL PANRAAFVKS CVDLFIKDPR WAGVFDGIDI DWEYPAACGN
TCNYRPEDTQ NFTALLSEFR SQLNAVRPGL LLTIAAPADP AKIAKIQVGQ IHQYLDFINI
MTYDLHGAWE ANTNFQSNLY SIAGDPGPVY SVDIAVNAWL NGGTPADKVV VGVPFYGRGW
KDVPSTNNGL FQPGSAAPAT YEAGIEDYKV LKTKGLTRYS NSAAGAAWLY GNGQFWTYDD
PAIMKVKTDY VKAKGLGGTM FWELSGDTTN GELINALYQG R