Gene Hoch_3062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3062 
Symbol 
ID8545450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4224645 
End bp4225937 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content66% 
IMG OID646387733 
ProductSpore coat protein CotH 
Protein accessionYP_003267461 
Protein GI262196252 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5337] Spore coat assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.139786 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.977271 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAC TATATTACGT AGTTGCATTG GTATCGATTG GCCTATTCGC GTGCGGCTCC 
ACCTCGGGCG AGGATCCGGG GCCGGGCGGC ACGGTAGACG CTGGGCCGGG CGGCGGTGGC
GACGGCGGCG GCGGCGGCGG CGTATGCGGC GACGGTCAGC GCGATGTCTT CGAGGCCTGC
GACGGTGAGG ATCTGGTGGG CGCGAGCTGC CAGAGCCTGG GCTACACCCT GGGCGAGCTG
TCGTGCCGCG CCGATTGCAC CTTCGACTCG AGCGGTTGCT CGGGCAAACC CGAGGACGCC
GACCTCGACG GCGCGAGCGG CTGCGAGGGT ATCTTCAGCC CCGAGCAGGT GCTGGCCTAC
CAGGTCGAGG CCAGCAACCT CGAGGTCGAT GGGCCCGGGC GCTTCCGCTG CGGCGACGAG
CTCACGCTCG ACGTCGAGGT CGAGCGCAAA CGCGGCGGCG GCTTCAAGAT CGACTTCAAC
GAGTACCTCG ACGGCCAGAC CTACTTCGGC ATCAAGAAGC TGGTCTACGA CACCGGCGGC
ACCACCAGCG TGACCGATAT CGTCCGACAG TATCTCGCCT GGCGCATGAT GCACAGGGCC
GGCGTGATCG CCAGCCGCGC GGCGCTCGCC GAAGTGTACG TCAACGACAG TTATTTCGGC
CTGCTGGTCA ACATCGAGGC GGTCGACAAG CGGCTGCTCA AGAACCGCTT CGGCAACGAC
GACGGCTGGC TGTACAAGAA GAGCGGCGGC GAGGGCGATG GTCTCAAGAC CCACGAGAGC
GACGGCCTGG GCGACGCCAA TCCCTACGAC GACTATCTAT GCTTCTGGAA ATCGGGCAAC
GCCTGCCCGG TGCCGGACGA CGTCGCCACC GCCTTGCCCG AGCACCTCGC CGTCGAGCAG
TTTCTGCGCA TGGGCGTGGT CAACGCGCTC ATCGCCAACA CCGACGCGCC GCTGTTCAAG
GACAACAACT ATTACTTCTA CGATTGGTTC GGCGGCCGCC ACTATCTGCC CTGGGACCTC
GACACGGTGA TGAAGGACGA CGACTTTGCG ATGGTGCAGG AGACCGCCTT TGCCGAGGCG
CTGATGCCGC ATTTCGGCGA CCAGTACCAG GCCATCGCGG TCGAGCTGCT GGCCGGCCCG
CTGGCGCTGG CCGAGATTCA CGCCGAGATC GACCGCGTGG GTGAGGTTGC CGGAGCCGCG
ATCGCCAACG CCGGCGGCGA CCCGGACGCC ATCGTCAGCG AGCTCAAGAC CTGGTGGAGC
GAGCGGCACG CCCTGGTCAG CGCGCAGTTC TGA
 
Protein sequence
MKILYYVVAL VSIGLFACGS TSGEDPGPGG TVDAGPGGGG DGGGGGGVCG DGQRDVFEAC 
DGEDLVGASC QSLGYTLGEL SCRADCTFDS SGCSGKPEDA DLDGASGCEG IFSPEQVLAY
QVEASNLEVD GPGRFRCGDE LTLDVEVERK RGGGFKIDFN EYLDGQTYFG IKKLVYDTGG
TTSVTDIVRQ YLAWRMMHRA GVIASRAALA EVYVNDSYFG LLVNIEAVDK RLLKNRFGND
DGWLYKKSGG EGDGLKTHES DGLGDANPYD DYLCFWKSGN ACPVPDDVAT ALPEHLAVEQ
FLRMGVVNAL IANTDAPLFK DNNYYFYDWF GGRHYLPWDL DTVMKDDDFA MVQETAFAEA
LMPHFGDQYQ AIAVELLAGP LALAEIHAEI DRVGEVAGAA IANAGGDPDA IVSELKTWWS
ERHALVSAQF