Gene Hoch_3767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3767 
Symbol 
ID8546160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5178765 
End bp5179907 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content67% 
IMG OID646388437 
Productsignal peptidase I 
Protein accessionYP_003268160 
Protein GI262196951 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0681] Signal peptidase I 
TIGRFAM ID[TIGR02227] signal peptidase I, bacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0219598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.176079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACTT CTGCAGCAGA CGCGCGCATC GAGCGCAAGC TTCACACCGA GGTCAAGAAG 
CTGGTGCGCG AGACCCGCGC CAGTCTCGGC CGGCACGGCT CGCGACTCAG CAACCGGGTG
CGCGAGGATA TCGAGGGCCG GGTCGAGCGA CTCGAGACCG CGATGCGCGA GAAGGACGGC
ACCACCATGC GCGTGGAGCT GCCGGTTCTC GACGCCATGG TCGACGAACA GCTCGCGTTT
GCGCGCAAGT CGGCATTTCG TGAGTACGCC GAGTCGATCG GCATCGCCGT GATCATCGCC
GTGCTGCTGC GCACCTTCGT GATCGAGGCG TTTAAGATCC CGTCGGGCTC GATGATCCCG
ACCATGGAGA TCGGCGATCA CATCTTCGTC AACAAGTTCC TCTACGGCAT CCGCATCCCG
GTGCTGGGCG TGAAGTTCTT CCAGTTCCGC AAGCCCGAGC GCGGCGAGGT CATCGTCTTC
GAGAAGCCGC GCGACCGCGA GCGCCGCGAC TTCATCAAGC GTATCGTGGC CGTGGCCGGC
GACACCCTGG AGGTGCGCTG CGGCATGCTG TACGTCAACG GTGAGCGCGT GAGCCGCGAG
CTGGTGGCGG CCAGCGATTT CCACTGGGAT GACCCGCCCG AGCCCGGCAC CGGCGACACC
TGGACGCGGG TGGAGAGCAG CCGCTACCGC GAGACCCTGG GCGAGACCCG CTACGACACG
CTCTACGATC CCGACCGGCC CGAGTACGAG CACCTGGTCG ACGCCGGCGG GGCCGCGGGC
TGGGGCGCGT CCTCGAGCCT GACCAGCCGC GACTTCCCCA TGCAGAGCAG CGCGATCTTC
CCCGACTTCA ACCGCATCCC GCGCTGCGCC GACCACAGCG AGGAGAGCAG CTCGATCGGC
TGCTACGCGC CCTCGCCGCA GACGCAGAAG GGCGACGCCG GGGCGTGCGC GCTGCAGCGG
CACTACGTGG TGCCCGAGGG CCACGTCTTC GGCATGGGCG ACAACCGCGA GAACTCCAGC
GACTCGCGGC AGTGGGGTCC GGTGCCGCTC GACAATATCA AAGGCAAAGC GCTGTTCATC
TGGTGGTCGT CGAACGACAA GGTAGGTGTG CAGTGGGATC GTATCGGTAA GGTCGTAGAA
TGA
 
Protein sequence
MATSAADARI ERKLHTEVKK LVRETRASLG RHGSRLSNRV REDIEGRVER LETAMREKDG 
TTMRVELPVL DAMVDEQLAF ARKSAFREYA ESIGIAVIIA VLLRTFVIEA FKIPSGSMIP
TMEIGDHIFV NKFLYGIRIP VLGVKFFQFR KPERGEVIVF EKPRDRERRD FIKRIVAVAG
DTLEVRCGML YVNGERVSRE LVAASDFHWD DPPEPGTGDT WTRVESSRYR ETLGETRYDT
LYDPDRPEYE HLVDAGGAAG WGASSSLTSR DFPMQSSAIF PDFNRIPRCA DHSEESSSIG
CYAPSPQTQK GDAGACALQR HYVVPEGHVF GMGDNRENSS DSRQWGPVPL DNIKGKALFI
WWSSNDKVGV QWDRIGKVVE