Gene Hhal_0209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0209 
Symbol 
ID4710979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp241890 
End bp242945 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content70% 
IMG OID639854668 
Productendoglucanase 
Protein accessionYP_001001805 
Protein GI121997018 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4124] Beta-mannanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.405756 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCACC GCCGACTGAT CCGCTGGGCC GGCGGCGCCC TGATCTGCGC CCACGGACTG 
CTCCCCGCCG GACTCCCGGC GGGCAGCGCC CCGCCCCCCG CCGCACCCCC GGTGGCCTTT
GGCGTGGCCC TGGACGGAGC TGCCAGCCGC GAACGCCTGG CCCACCTGGA ACAGGCACTG
GAGCTCGAGA TCGGCCTGGT GGCCATCTTC ATCCAGTTTC CCGAGGATCC GCGGCACGAC
AACTTCCCGG CGGAGCAGCT GGACGCCATC CGTCAGGCCG GCGGGCGGCC GGTCCTGACC
TGGGAGCCGA TGTACATCGC CGATGGCGAG GAGCACGCCA TCCCCGCCGA GGAGCTGACC
GGCGGCGCCT ACGACGCCTA CATCCGTCGT TTTGCGCGCG GCGTCAAGGC GTTCCCCGAG
CCGGTAATCA TCCGCTTTGC CCACGAGATG AACCTCGACC GCTACCACTG GGGCTCCACC
GCCGAGGACT ACGGCCCATC GGCCCCGACG CGGTATCGGG CGATGTTCCG GCACGTGGTG
GAGATCTTCC GCGACGAGGG GGCAGCGGAG CACGCCCGCT TCGCCTTCAA CCCCAACGCC
GAGTCGGTCC CCTCGCCCGA CCGCGACCCG GACGCCGACT GGAACCGGCC GGAGGCGTAC
TACCCCGGCG ACGCCTACGT CGACGTCCTG GGCATGGACG GCTACAACTG GGGCACCACC
CGGACCCGCG AGGAACACGG CTGGGACAGC CGCTTCCAGT CCTTCCAGAC GATCTTCGAG
CCGCTCTACC GCACCCTGCG GGATCTCGCC CCCGACAAAC CCATCTACGT CTTCGAGACC
GCCACCGTCA CCGACGGGGG CGACAAGGCG GCCTGGATCG AGCAGGCCGC CGCGTCCGCC
GTGGCCTGGG AGCTGGCCGG GCTGGTCTGG TTCCATAACG ACAAGGAAGA GAACTGGCGG
CTGGATACCG GTGTCACCCC GGAGGACCTT GAACCGCTGC GGCGGATGAT CACCGACCCC
GAGGCCCTGC TGGAGGGACG ATCCCGTGGT GACTGA
 
Protein sequence
MNHRRLIRWA GGALICAHGL LPAGLPAGSA PPPAAPPVAF GVALDGAASR ERLAHLEQAL 
ELEIGLVAIF IQFPEDPRHD NFPAEQLDAI RQAGGRPVLT WEPMYIADGE EHAIPAEELT
GGAYDAYIRR FARGVKAFPE PVIIRFAHEM NLDRYHWGST AEDYGPSAPT RYRAMFRHVV
EIFRDEGAAE HARFAFNPNA ESVPSPDRDP DADWNRPEAY YPGDAYVDVL GMDGYNWGTT
RTREEHGWDS RFQSFQTIFE PLYRTLRDLA PDKPIYVFET ATVTDGGDKA AWIEQAAASA
VAWELAGLVW FHNDKEENWR LDTGVTPEDL EPLRRMITDP EALLEGRSRG D