Gene Hoch_3453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3453 
Symbol 
ID8545842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4767256 
End bp4769652 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content70% 
IMG OID646388121 
ProductSpore coat protein CotH 
Protein accessionYP_003267848 
Protein GI262196639 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5337] Spore coat assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0950404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.13091 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGT CGCGTCTGTG TGTTCCGTTG GGTTTTCTTG CCGTCGCGAT CGCGGGCGGC 
GGGTGCACGG GCACCTTTGG CAATCCCTTC GAACCGCCGG CGCCACCGGC CAATAATCCC
GACGGAGGCA GCGGCGACGG CGACGGCGGC GTGGTCACCG GACCGCCCCC GCCGCAGATC
TTCCTGTCCG AGATCATGTA TCACCCGATC CTCGAGGACG ATTTCATCGA TCGGCACGAG
TTCATCGAGA TCTACAACCC CAACGACGAA GCGGTGTCGC TCGCGGGCTG GTCGCTCGGC
GGCGGCATCG ACTACACCTT CCCGGCCGGG GCCGAGATCG CGGCCGGCGG CTACGTGGTC
GCGGCCAAGA GCCGGGACGA CCTGCTCAGC CTCGAGAGCT ACGCGCTCGA CGCCAGCGCG
GTGTACGGCG ACTTCGACGG CGCGCTGGCC AACGGCGACG ACACCGTCCT GCTCTTCGGC
CCCGGCGGCC GGGTGGTCGA CAGCGTGGTC TACGAGGACA GCGCGCCCTG GCCCGAGGCC
GCCGACGCGC TCGGCGCCAG CCGCCAGTTT TTGCCCGCCG AGATCGGCGA CAATCTCGAA
GAACATCGCT ATTTCGGACG CTCGCTCGAA CGCATCAGCT TCGACCTGCC GGCCACCGAG
GTGGCCAACT GGGACGTGTC CCCGCTCGAC GGCGCCACGC CCGGTCGCGT CAACAGCGCG
AGCCGCGAGA CCCCGCTGGC GATCGCGCTC ACGGTGAGCG CCAGCCCGGC GCGCCTGGGC
CAGGCGGCCG GCGATGAGGA CCCGCTGATC CGGGCCGGTG ACGAGGTCGC CGTGCGCATG
CGCCTGAGCG ATGTCGGCTC CATCGACGGC GCCGAGATCG AGTATTTCGT CGACGACCTC
AACCGCGAGG ACGAGGAGCT GTTTACCGCG ACGCTGCTCG ACGACGGCGC CGGCAACGAC
CTCAGCGCTG GCGACCGGGT GTACGTGAGC ACGCTGCCGG CGCTGCCCGA GCGCAGCATC
GTGCGCTACC GCGTGCGCGT GAGCGAGAGC GGCGCCACCC GCCGGCTCAG CCCGCGCGAG
AGCGCGCCCT ACGAGTGGCA CGCCTACTAC GTGAGCCCGG TCATCGAGGC CCAGACCCGG
GTCTACGAGG TCTTCGTCGG CATCGACGAG TGGACGCGCA TGCACACCAA CATCGAGGCC
CGCCGCGCGG TCGGCTGCGG CATCAACCCG CTGTGGGACG AGCGCGTCCC GGCGGTGTTT
GTCCACGAGG GCAAGGTCTA CGACGTGCGC GCCCGCTACC AGGGCAGCCG CTACCAGCGC
ATGAACGGCG ACGTGGTCAA CACCGATAGC TGGGTCGGCC CCCTGCCCAG CCGCCCCGAT
CCCCTGCGCG CGCTGAGCTG GAGCCTCAAG TTCCCCCGCT ACGCGCGCTT CGAGGGCAAG
CGCACGGTCA CGCTCAACAA GCTCAAGCAG AGCTGCCCCG GCCTCACCGC CGGCGTCGGC
ATGCGCCTGT TCGAGGCGGT CGGCGTCCCG GCCGCCAACA CCCGCTACGC CCAGCTTCAC
GTCAACGGCA ACTACTACCG CTACACCATC GAGATCGAGC ACCCGGGCGA GGACATGCTC
GAGCGCATCC ACGAGGACCA GGCGGCCGCG GGCGAGCAGC CGGATCAGGT CGGCCACCTG
TTCAAGGCCA CCGGCTACGG CGGCAGCGAG GGCCCCTGGG GCCGCGCCCA CGGCGGCGTG
CTGCCCGAGA ACTGCGGCTA CACCCCGCGC GAGCGCTACA GCTACAGCTA TGAGCGCAAG
ACCTACGACT GGCTGGGCAT CGACGCGCTC GCCGATCTCA TCGAGGAGCT CGACGCGGTC
CGCGGCGACG TGCCCGCGCT GCGCGCCTAC CTCGAGCAGA ACTTCGATGT CGACGCCACG
CTCAGCTACC TGGCGGTCAT GAACTGGGCG GTGCCCTTCG ACGACCAGTT CCACAACTAC
TACATCTATC GCCAGTACGA GACCGGCCTG TGGCAGCTCA TGCCCTGGGA CCTCGACCGC
AACTTCGGCG AGTACACCGG CGACAGGGGA GAAGGCCCGG CCTCGAGCAT CTACATCGGC
CAGGACGGCG ACCCCGACAA CCGCGGCGGC GAGTGGAACT ACTTCAAAGA CGCGTTCCTG
CGCGCGTTCC GCAGCGAGTT CGAGCAGCGC CTGCGCGAGG TCAACGCCAA CGTGCTCACG
CCCGAGAACG TGAACCGCGT GGTCGACGAG ATCGCCGCCA GCATGCACGA GGACGAGGCC
GACGCGGCGC TCAGCAGCGT CGGCGGCGAG TGCGCGCTGC AGCCCGCGGT GCAGCAGTTC
AAGGACTTCG CGAGCGCGCG CCACGCGCAC GTGAGCAACC TGCTCGGCAC ACCGTAG
 
Protein sequence
MKLSRLCVPL GFLAVAIAGG GCTGTFGNPF EPPAPPANNP DGGSGDGDGG VVTGPPPPQI 
FLSEIMYHPI LEDDFIDRHE FIEIYNPNDE AVSLAGWSLG GGIDYTFPAG AEIAAGGYVV
AAKSRDDLLS LESYALDASA VYGDFDGALA NGDDTVLLFG PGGRVVDSVV YEDSAPWPEA
ADALGASRQF LPAEIGDNLE EHRYFGRSLE RISFDLPATE VANWDVSPLD GATPGRVNSA
SRETPLAIAL TVSASPARLG QAAGDEDPLI RAGDEVAVRM RLSDVGSIDG AEIEYFVDDL
NREDEELFTA TLLDDGAGND LSAGDRVYVS TLPALPERSI VRYRVRVSES GATRRLSPRE
SAPYEWHAYY VSPVIEAQTR VYEVFVGIDE WTRMHTNIEA RRAVGCGINP LWDERVPAVF
VHEGKVYDVR ARYQGSRYQR MNGDVVNTDS WVGPLPSRPD PLRALSWSLK FPRYARFEGK
RTVTLNKLKQ SCPGLTAGVG MRLFEAVGVP AANTRYAQLH VNGNYYRYTI EIEHPGEDML
ERIHEDQAAA GEQPDQVGHL FKATGYGGSE GPWGRAHGGV LPENCGYTPR ERYSYSYERK
TYDWLGIDAL ADLIEELDAV RGDVPALRAY LEQNFDVDAT LSYLAVMNWA VPFDDQFHNY
YIYRQYETGL WQLMPWDLDR NFGEYTGDRG EGPASSIYIG QDGDPDNRGG EWNYFKDAFL
RAFRSEFEQR LREVNANVLT PENVNRVVDE IAASMHEDEA DAALSSVGGE CALQPAVQQF
KDFASARHAH VSNLLGTP