Gene Hoch_3656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3656 
Symbol 
ID8546046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5031589 
End bp5032794 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content65% 
IMG OID646388325 
Producttail sheath protein 
Protein accessionYP_003268051 
Protein GI262196842 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.175288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0120421 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCA GTTATCTCTC ACCCGGTGTC TATGTAGAAG AGGTCGACCG CGGCTCGAAA 
CCGATCGAAG CTGTCGGAAC CAACACCGTC GGCTTCATCG GCGAGTCTGC CAAAGGCCCC
ACGAACGAGG CGGTCCTGGT CACGAACTGG TCGCAGTTCG TCAAGACCTT CGGCGACTTC
AAGGACTGCT CGATGCACCT GGCGCACGCG GTCTACGGCT TCTTCAATAA CGGCGGTTCG
CGTTGCTTCG TTGTCAACTG CGGCGCGCCG GACGAAGAAG AAGTGAGCGC CACGGCCAGT
GCCGGAGACA AGAAGGACGA CAAGCCCAAG ACCGCCTCGC GCGGTGACCG CGACGGCCGC
TTCATCGGTC GCGACAACGG TCCGGGTCAG CGTTCGGGTC TCAAGTGCTT CGAGGAGATC
GACGAGATCG CGATCCTCTG CGCCCCCGGT CAGGTTTCGC CCGCCGTCCA AGACGCCGTG
CTCACGCACT GCGAGACCCG CAAGGACCGC TTCGCGATCC TCGACTCGCC CGAGACCATC
TCGGGTGGCG TCGACCGCCT GCCCAAGCCG CGCGACTCGA AGTACGGCGC CTACTACTTC
CCGTGGATCC AGGTCTACGA CCCGGACAAG GGCAACCTCT TCGTGCCGCC CTCGGGTCAC
ATCGCCGGCG TCTACTCGCG CGTGGACAAC GAGCGCGGCG TGCACAAAGC TCCGGCCAAC
GAGCTGGTGC GCGGTGCCCT GGGCCTCAAG TACAACGTGT CGAAGGGCGA GCAGGACCTG
CTCAACCCCA AAGGCATCAA CGCCATCCGC ATGATGAACG GCGGCATTCG CATCTGGGGC
GCGCGTACGC TGTCCTCGGA TCCGTCGTGG AAGTACATCA ACGTCCGGCG TCTGTTCATT
ATGGTGGAGT CGTCGATCGA GCGCGCGACC CAGTGGGTCG TATTCGAGCC CAACGATCAT
CGCCTCTGGA AGCGCGTGGT GCGGACGATC TCGTCGTTCC TCACCCTGCT GTGGCGCAAC
GGCGCGCTCA TGGGCACCTC GCCCGAGCAG GCCTTCTTCG TCAAGTGTGA CGACGAGACC
AACCCGCCCG AGGTCGTGGA CGCCGGCCAG CTGGTCGTCG AGATCGGCCT CGCCCCGGTG
AAACCGGCCG AGTTCGTCAT CTTCCGCATC GGCCAGATGC CCTCTGGCGG CGGCGTCGAA
GAGTAG
 
Protein sequence
MATSYLSPGV YVEEVDRGSK PIEAVGTNTV GFIGESAKGP TNEAVLVTNW SQFVKTFGDF 
KDCSMHLAHA VYGFFNNGGS RCFVVNCGAP DEEEVSATAS AGDKKDDKPK TASRGDRDGR
FIGRDNGPGQ RSGLKCFEEI DEIAILCAPG QVSPAVQDAV LTHCETRKDR FAILDSPETI
SGGVDRLPKP RDSKYGAYYF PWIQVYDPDK GNLFVPPSGH IAGVYSRVDN ERGVHKAPAN
ELVRGALGLK YNVSKGEQDL LNPKGINAIR MMNGGIRIWG ARTLSSDPSW KYINVRRLFI
MVESSIERAT QWVVFEPNDH RLWKRVVRTI SSFLTLLWRN GALMGTSPEQ AFFVKCDDET
NPPEVVDAGQ LVVEIGLAPV KPAEFVIFRI GQMPSGGGVE E