Gene Haur_0324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0324 
Symbol 
ID5732234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp385895 
End bp387979 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content52% 
IMG OID641277448 
Productcellulose-binding family II protein 
Protein accessionYP_001543104 
Protein GI159896857 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4124] Beta-mannanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGTCGA ACCTGAGCCG TTCGCGGTCG TTGTTGACCA TTGCGCTCCT TGGGGTCTTG 
CTTGGTTCAC TGAGTTTGTT ACCAACGCAC ACCACCAAGG CCGCGATTAC CGCCTATCAA
GCCGCCGATC CAAATCTCGC CCCGTATGCC CAAAATGTGC TCGATCGTTT TGTCCAATAC
AAAGGTCAGT ATTGGCTAGG TGGTCAGCAA GAAGTTCACT GGGATAATTC GCGCAAAGAT
GAAATGTCGA ACGCAGTTTT TGCCCGCACC AATCCACAAC GCTACCCCGC GCTGCGTGGC
TGGGATTTCC CGATCGGCGG CGCACTGCCC AACGATGGCC AATGGATGAT CGACGCGATC
ATCAGCGATT GGACCAATGC CAACGTTATT CCAACCATCA GCCAACACTG GACACCGCTT
GCCAGCCAAG GCACTAACCA TCAAGATATG TTCACGGTGG TTGATATTGA TCGGATGTTT
GTTGATGGCA CGACTGAACG CACTAATTAT CTGATTTGGC TCGATAATAT TGCTGATGAT
TTGCAACAGC TCGAAGATGC CAACGTGCCA GTGCTATGGC GACCCTACCA CGAAGCTGGT
GGTGGTTGGT TCTGGTGGGA TAAAGATAAT GGAGCCACCA ATTATCGCCG CCTTTGGGAT
GATATGTTTA CCTATTTGGT AACCACCCGT GGCCTGCACA ACCTGATCTG GGTTTGGACA
CCTGGGGTAA AAGGAGTCAG CACGGCTTGG TATCCCGCTG GCCAAGCCGA TATTTTGGGC
AGCGATGTAT ATAACGAAAC TTCGGGCAAT TACGTGAGTT GGTATGAAGA TCTTGGGCGT
TTCTCGCAAA CCAAGATTAA AGCCCTCAGC GAAACCGACT ACATGATGGA CCCTGCTCTG
TTGAGCAGTG CGCCATTTGC CTACTTTATG ATTTGGCACA CCGATATGTT TTATCGCAAT
ACCGATAGCC GCATTCAAAG CACCTATGCG CATTGGGCAA CCCTGAATCG AACCAATGTT
GGCCAGATTT GGAATGGTAC TCTTGGGATC GCTCCCACAG CAACGCCTGG TACACCAACG
CCTACGCCAA TTCCTGGGAT CGTGGTCAGC GATTTTGAAG ATGGCACGCT GCAGGGCTGG
ACTGGCACAA ACCTGGTTTC AGGGCCAACT GTCAACAACG AATGGGCCGC CAATGGTCAA
CGTTCGATCA AAGCTCAAGT TAATTTAGCC GCTACGCCTG CCGATATTCG GCTCGCACAA
GCACTGGATT TAACTGGTCA ATCACGCATC CAAATTCGCC TGAGTGCCCA AAATGTTGGG
AGCGGTCTCA GTGCTAAGCT CTACATCAAG ACGGGAAGCG CCTGGAATTG GAAAGATAGC
GGAACTGTGC TGATCGATTC GGGCATCAGC TTGCTGACAA TTGAATTAGC GGGTGTGCCT
GATATCAACC AAGTGCGCGA GTTGGGGGTT GAATTCAACG CACTCACGGG GAATAGTGGC
ACAGCAACGA TCTACGCCGA CTATCTGACC GTGGGTGTGG TCAATAGCAA CCAACCAACC
CCAACCACGG GGCCAACGGC GACTGCAACG CGCACGCCAA CCCAAGCGCC AACGGCTACG
CCAACCAATA TTCCAACCGC AACGCGCACG CCAACCCAAG GCCCAACCGC GACTGCAACC
ACCATTCCAA CCGTCACGCC AACGAATATT CCAACCGCAA CGCGCACGCC AACCCAAGTG
CCAACTGCTA CGCCAACCAG CACTGGCGGC GCTTGTAAGG TTGATTTCAA GGTGACAAGC
CAATGGGGCG TAGGCTTTAT CGCCGATGTT ACCGTAACCA ATTTGCAGCC AAGCGCCTTA
AATGACTGGA ATGTGAAATT CAACTTCCCC AGTGGCCAAA CCATCAGCAA CCTATGGAAT
GGCACACTCA GCCAGACTGG TAGCGCAGTT ACCGTCACGA ATGCTGGCTG GAATGGCTAT
CTTGCAGGTA ATGGTGGCAC AGCCAACTTT GGTTTCCAAG GTGTTGGTAG TGTTCCAATC
TTGCCAAGCA ACAGCTTCCA ACTCAATGGC GTAACCTGCC AATAA
 
Protein sequence
MWSNLSRSRS LLTIALLGVL LGSLSLLPTH TTKAAITAYQ AADPNLAPYA QNVLDRFVQY 
KGQYWLGGQQ EVHWDNSRKD EMSNAVFART NPQRYPALRG WDFPIGGALP NDGQWMIDAI
ISDWTNANVI PTISQHWTPL ASQGTNHQDM FTVVDIDRMF VDGTTERTNY LIWLDNIADD
LQQLEDANVP VLWRPYHEAG GGWFWWDKDN GATNYRRLWD DMFTYLVTTR GLHNLIWVWT
PGVKGVSTAW YPAGQADILG SDVYNETSGN YVSWYEDLGR FSQTKIKALS ETDYMMDPAL
LSSAPFAYFM IWHTDMFYRN TDSRIQSTYA HWATLNRTNV GQIWNGTLGI APTATPGTPT
PTPIPGIVVS DFEDGTLQGW TGTNLVSGPT VNNEWAANGQ RSIKAQVNLA ATPADIRLAQ
ALDLTGQSRI QIRLSAQNVG SGLSAKLYIK TGSAWNWKDS GTVLIDSGIS LLTIELAGVP
DINQVRELGV EFNALTGNSG TATIYADYLT VGVVNSNQPT PTTGPTATAT RTPTQAPTAT
PTNIPTATRT PTQGPTATAT TIPTVTPTNI PTATRTPTQV PTATPTSTGG ACKVDFKVTS
QWGVGFIADV TVTNLQPSAL NDWNVKFNFP SGQTISNLWN GTLSQTGSAV TVTNAGWNGY
LAGNGGTANF GFQGVGSVPI LPSNSFQLNG VTCQ