Gene Hoch_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0033 
Symbol 
ID8542403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp48302 
End bp49774 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content63% 
IMG OID646384821 
ProductSpore coat protein CotH 
Protein accessionYP_003264568 
Protein GI262193359 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5337] Spore coat assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATACG CGTTGGGCGT TCTGGGCTGG TTGCCATTCT CGCTGGGCGT AGGCGCCCTG 
GGAGTCGCGG CCTTGGCTAC TGGTTGCACG GTGGTGGAAG ATCCGACCTT CGTCGGTGGA
GAGGCTGTGG GCGGGGAGGA AGCAGCCGCT ATCGGTGGCA GCGACAGCGT GACGCTTGCG
GCCAACGAGC CGCTGCCGCC CGACGCGGAG GGATGCCCCG GACTCTATGC CCAGGACCTC
CTGCCCGAAT TTCATCTCAC GATCAGCGAA GCCGTGATGT CCACCCTGTA CGAAGACTGG
GTGCACGGTC GCGAGCGCAA CTCGCTCGAC GACGACGACA CGCCGTATCA CCCGGTGTCC
GAGTTCCGCT ACGGCGACAT CGCCGTGCAG AACGCGATGA TCCGCCTGCG CGGCAACCCC
AACTTCTGGC TCGAACAGAA CAAGATGCAG TTCCAGGTCT CGTTCGACGA GATCAACAAG
AACGGGCGCT TCATGGGCCA GCGCAAGATG CTCTTCGATG CGGCCACCTT CAACCGCCAC
TTCATGCGCG ACCGCTTGTC GCTGTGGATC TTCCGCCAGG CCGGCGTGCC CGCGCCCTGC
GCCAACAACG CCCGCCTGTA CATCAACGAC GAGTACTACG GACTGTTCAC CAGCATCGAG
AAGCTCGACA ACGTGTTTCT CAAGCGCGTG TTTCCGAACA ACGACCACGG CGATCTGTTC
GAGCGCCGCG GCTACGAGCA GAAGACCAAC GAGGACACCT CGGATGACAC GCGTCTCGAC
CAGATGCGCA CTTACGCGGA CGCCTTCCCC AGCTCGACCA ATCCCGACGC GGGCGACGGC
CTGAATCCCA ACCTCGAGGG ACTCGCGGCG TACATGGACT TCGAGCAGGT GCTGCGCCTG
TACGCGGTGG ACGCGATCTT GCCCAACTCC GACGGGCCGT GGGCGGGCGG CTTGAACTAC
TACTTCTACG ACCACCCGGG CCAGAACAAG TTCTTCATCC TGCCCTGGGA CCTCGACAAT
ACCTTCACGC GGCTCACGCC CGACGTGCAG CCGTACACCT ACAGCAAGCC CAGCCCGTAT
CACGGCCGCC CGTACTTCAA CGCCATCCTC AGCGATGACG TCTGGCTGGA TCGCTATGTC
GAGATCATCG AGGAGGTGGT CAACGAGATC TACAACACCG ACACGCTGTG GGCGCTGACC
GGCACCAACG CGACGATTCC GCTCGAGAAT CTCAGCGACC AGGACCGCCA GCTTCTGGGC
GTGTGGTCGG GTCAGATTCG CGACTCGGCC TTTGCCGACA TCAACAAGCC GTACACCAAC
CGGCGCATGG AAGACCGCCG CGCGTTCTTC GCCAACTTCC TGGCGGAGCG CGAGACCTGC
CTGCGGCAGT GGCTCGAGTG TGTGCACGGC AGCGACACGA CCCGCCAGGG CGTCGAGACC
TGCGTCTGCG ATGTCGATCT GCTGAGCGAG TAG
 
Protein sequence
MKYALGVLGW LPFSLGVGAL GVAALATGCT VVEDPTFVGG EAVGGEEAAA IGGSDSVTLA 
ANEPLPPDAE GCPGLYAQDL LPEFHLTISE AVMSTLYEDW VHGRERNSLD DDDTPYHPVS
EFRYGDIAVQ NAMIRLRGNP NFWLEQNKMQ FQVSFDEINK NGRFMGQRKM LFDAATFNRH
FMRDRLSLWI FRQAGVPAPC ANNARLYIND EYYGLFTSIE KLDNVFLKRV FPNNDHGDLF
ERRGYEQKTN EDTSDDTRLD QMRTYADAFP SSTNPDAGDG LNPNLEGLAA YMDFEQVLRL
YAVDAILPNS DGPWAGGLNY YFYDHPGQNK FFILPWDLDN TFTRLTPDVQ PYTYSKPSPY
HGRPYFNAIL SDDVWLDRYV EIIEEVVNEI YNTDTLWALT GTNATIPLEN LSDQDRQLLG
VWSGQIRDSA FADINKPYTN RRMEDRRAFF ANFLAERETC LRQWLECVHG SDTTRQGVET
CVCDVDLLSE