Gene Hoch_3788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3788 
Symbol 
ID8546181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5206768 
End bp5207850 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content74% 
IMG OID646388458 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003268181 
Protein GI262196972 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.10732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.21805 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTTA CCAACACCTT TCGCACTGGC GGCCGCAGCC GCCCCCGTGG CGCCGCGTGT 
GCGCGCCTGC TCTGGGCGGC CGCGGCCGCC CTCTTCATCG CCCTGTGGGC GGCGCCGGCC
TCGGCGCAGC AGGCCGCGGC CGAGGCGCTC TTCAATCAGG GCCGGGCGCT GATGCAGGAG
CGCAACTACG CGGAGGCCTG TGAGAAGTTC GCGGCCAGCC ACGAGCTCGA CCCCAGCGTC
GGCGCCTTGC TCAACCTCGG CGATTGCCGC GAGAAGAACG GCCAGACCGC CACCGCCTGG
GCCACCTATC GCGAGGCCGT GTCGCTGTCG CGGCGCACCG GCGACCGTCG CCGCGAGCGC
TTTGCGCAGT CGCGGGCCGC GGCGCTCGAG GGCAAGCTCT CGTATCTGGT CATCGAGGTG
AGCGACGAGG CGCGGGTTCC CGGGCTCACG CTCACGCGCA GCGGCGAGCC CGTGCTCGAG
GCTGTGTGGG ACCAGCGTGT GCCCACCGAT CCCGGCTCCT ACGTCATCCG CGCCGAGGCC
CCCGGGTATC GCCCGGCCGA GGTCGAGGCC GAGGTCGGCG AGGGCGGCGG CGAGGCCCGG
GTAACGATCC CCGAGCTCGA GAAAGCGGCC GCGGGCGAGG TCACCGGGCC TTCGGCCGCG
GACGCGCCGG TGCGGGCTGC CGGTGACGGC GAGGTCGGCG TCAGCGCGAG CGGCGGCTCG
GAAGGCGGCA TGCCGACCGG CCGCAAGATC GCCATCGGCG TGGGCGCTGC GGGCGTGGTC
GCGTTGGCCG CGGGCGCGGT CTTTGGGCTC AACGCCAGCT CCAAGTGGGA CAAGGCCAAG
AGCCACTGCG TGGACGGCGA CTTCAGCAAC TGCGACGACC AGGGCGTGCA GCTCAGCAAG
GACGCGACGG TCCAGGCCAA CCTGTCCACG GTCTCGCTCA GCGTGGGCGT GTTGGCCGCG
GCCGGCGCCG CGGTGCTGTG GTTCACCAGC GCCCCCGACG ACACCGGCGC CGAGCGCAGC
GCCCGGTTCA CCCCGCTGCT CACCCCCGAC ACCGTCGGGG CCAGTCTCCT CCTCCACTTC
TAA
 
Protein sequence
MTVTNTFRTG GRSRPRGAAC ARLLWAAAAA LFIALWAAPA SAQQAAAEAL FNQGRALMQE 
RNYAEACEKF AASHELDPSV GALLNLGDCR EKNGQTATAW ATYREAVSLS RRTGDRRRER
FAQSRAAALE GKLSYLVIEV SDEARVPGLT LTRSGEPVLE AVWDQRVPTD PGSYVIRAEA
PGYRPAEVEA EVGEGGGEAR VTIPELEKAA AGEVTGPSAA DAPVRAAGDG EVGVSASGGS
EGGMPTGRKI AIGVGAAGVV ALAAGAVFGL NASSKWDKAK SHCVDGDFSN CDDQGVQLSK
DATVQANLST VSLSVGVLAA AGAAVLWFTS APDDTGAERS ARFTPLLTPD TVGASLLLHF