Gene Hoch_2130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2130 
Symbol 
ID8544516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2953199 
End bp2954305 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content70% 
IMG OID646386837 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003266568 
Protein GI262195359 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCATGC CTCGTCCTCT TGTCGTATGG ACCGGCCGCG CGCGCTGGTT GGTATCTCTG 
GCCATCTGCG CCGCGCTGCT GAGCGGCGGA CACCTGGCCC ACGCTCAGAG CACCGCGCCC
GGCGATGACG ATCTCGCCGA CCTGCTGCCC GAGGAGCGCA ACACCGTCCG CCTGTTCGAG
CGGACCGCGC CCTCGGTGGT CTTCGTGATC AACCGCGGCG TCCAGCGCGA TCTGTTCTCG
CGCCACACCG GCGAGTATCA GCGCGGCACC GGCTCGGGCT TCGTCTGGGA CAAGAGCGGC
CACATCGTCA CCAACTACCA CGTCATCCAG GGCGCCTCCT CGGTCGCCGT GGTCATCGAC
AACGAGGAGT ACCCGGCGCG CGTGCTCGGC GCCGAACCCA AGCGCGACAT CGCCGTGCTG
GCGCTCGACG GCGCCGCCAA GCGCGCGCTC ACGCCGGTGC GTCTGGGCCA CGACGAGCGC
CTGCGCGTGG GCCAGCACGT CATCGCCATC GGCAGCCCCT TCGGCCTCGA CCGCACGCTC
ACCACCGGCG TGATCTCGGC CCTGGGCCGC GACATCGTCG GCATCGGCGG CGTCACCATC
CCCGACATGA TTCAGACCGA CGCGTCGATC AACCCCGGCA ACTCGGGCGG CCCCCTGCTC
GACTCGGCCG GTCGCCTGAT CGGCATGAAC ACCATGATCT ACTCCAAGAG CGGCTCCAGC
GCCGGCATCG GCTTTGCCGT CCCCGTGCGC TTTCTGCGCC GCCTGGTGCC GCAGATCATC
CGCACCGGCC ACGCCATCAC CCCCGACCTC GGCGCCCGCT ACTTCGATGA CGACGTCGCC
CGCCGCCTGC GCGTCGAGGG CGTGATCATC CGCGCCGTGC CGCGCGGCTC CAGCGCCGCA
CGCGCCGGCT TCCGCGGCAC CGCGCGCACG CGCCGGGGCA ATATCCGCCT GGGCGACATC
ATCGTCGGCG TCGATAGCCA CCGCGTGCGC AACTACGACG ATCTCTACAA CACCTTCGAC
AACTACAAGC CCGGCGACCG CGTGGTCATC CACATCGTGC GCGACGGTCG CCGACAACAG
CTCGAGGTCG TCCTCGAAGC GCTGTAG
 
Protein sequence
MLMPRPLVVW TGRARWLVSL AICAALLSGG HLAHAQSTAP GDDDLADLLP EERNTVRLFE 
RTAPSVVFVI NRGVQRDLFS RHTGEYQRGT GSGFVWDKSG HIVTNYHVIQ GASSVAVVID
NEEYPARVLG AEPKRDIAVL ALDGAAKRAL TPVRLGHDER LRVGQHVIAI GSPFGLDRTL
TTGVISALGR DIVGIGGVTI PDMIQTDASI NPGNSGGPLL DSAGRLIGMN TMIYSKSGSS
AGIGFAVPVR FLRRLVPQII RTGHAITPDL GARYFDDDVA RRLRVEGVII RAVPRGSSAA
RAGFRGTART RRGNIRLGDI IVGVDSHRVR NYDDLYNTFD NYKPGDRVVI HIVRDGRRQQ
LEVVLEAL