Gene Hore_21810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_21810 
Symbol 
ID7313729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp2371686 
End bp2372810 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content41% 
IMG OID643612634 
Productchitinase 
Protein accessionYP_002509922 
Protein GI220933014 
COG category[R] General function prediction only 
COG ID[COG3858] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGTAT CTCCGGTTAT ATACTGTGCT GCAACATCCA GTGAAGAAGG CCCCGGTACT 
TTTGACTGGT TAAAAGGTAT TCTGTTATTA ATTATTTCTT TTTTTATTAA TAACTTTGTT
GATAAAAATG AAGAAGATCA GGAAGCAAGA CCCGGTGAAT CGCCTCTGGA TGAAGATATT
ATCTCAAATC GGGAAATACT GGGCTTTTAT GTCAACTGGC TAACCCCATA TGCTAATTCA
TATGATGCCA TGGTTTCTAA CCACAGGTAT GTTGACATGG TAGCACCCTT CTGGTTTACA
GCCAACCCTG ATGGTACAAT CAAGAGTAGA TACGGGGGGC ACCAGTATGA GGTAGATTCC
TTTTCCAAAA GACAGGGTCT TGAATTACTA CCTCTGATTA ATAACAACCA GAAAAATAAC
ATGATCCTGG TTGATTCAGA TGTCAGGAGT AAGACGATAA AAAATATAGT TAAGCTGGTG
GAAAAATATA ATTATAATGG AGTAAATATT GACTTTGAAT TTATTCCACC CTGGACCCGT
AATGGTTATA CCCAGTTTAT TAAAGAGCTT TCCAGTGAGT TAAACAAGAA AAACAAAAAA
CTTACAATCT CCGTTTTTCC TAAAATAGAT GTCCCGATGG AGTTACAGGG AGCCTATGAT
TATGCAGCCC TGGGAAAACT GGTTGACAGG GTAGTTATCA TGACCTATGA CCACCACTGG
CCCTCCGGTG ACCCCGGACC GATTGCCCCC ATAAACTGGG TCGAAAAGAA TATTAAATAT
GCACTGGAAT ATATACCAAA TGAGAAACTT CTAATAGGAG TAGCTAACTA CGGCTATGAC
TGGCCTGAGG GGGGACCCGG TAGGCCCATC AGTGCTAAAG AAGCAATGAA CCTGGCCCGG
GAAAAGGGCG TTAAAGTTCA ATGGGATACA CCTTCCCAGA GCCCCTATTT CTATTACCAG
GATAACAGTG GCATTAAACA CGAAGTCTGG TTTGAATCAA GTAGTAGCCT TGCCTTCAAA
CTGGAGCTGG TTAAGAAATA TAATCTGAAA GGTATAGCCA TCTGGCGGCT GGGAAATGGT
ACTGACCGGT TCTGGGAGAT TATAGACAAT AAATTAGGTC AGTGA
 
Protein sequence
MLVSPVIYCA ATSSEEGPGT FDWLKGILLL IISFFINNFV DKNEEDQEAR PGESPLDEDI 
ISNREILGFY VNWLTPYANS YDAMVSNHRY VDMVAPFWFT ANPDGTIKSR YGGHQYEVDS
FSKRQGLELL PLINNNQKNN MILVDSDVRS KTIKNIVKLV EKYNYNGVNI DFEFIPPWTR
NGYTQFIKEL SSELNKKNKK LTISVFPKID VPMELQGAYD YAALGKLVDR VVIMTYDHHW
PSGDPGPIAP INWVEKNIKY ALEYIPNEKL LIGVANYGYD WPEGGPGRPI SAKEAMNLAR
EKGVKVQWDT PSQSPYFYYQ DNSGIKHEVW FESSSSLAFK LELVKKYNLK GIAIWRLGNG
TDRFWEIIDN KLGQ