Gene Hoch_3304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3304 
Symbol 
ID8545692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4557408 
End bp4558709 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content71% 
IMG OID646387971 
ProductTetratricopeptide repeat protein 
Protein accessionYP_003267699 
Protein GI262196490 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.253328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.400775 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGCG TATGCCTCGC CCTGTCTCTC GCGCTGTGCA CGCTGGCCGC GTGCGCGGGA 
CCTCGTCAGC CGGCCGCCGC GCCCGGAGCC GCGGGCCCTG CGCAGGCGGG GGCGGGCCCG
GTGGACAGCG CCGACGCGGG CGCCGACGCG GGCGACGACA CGCTTGCTGC GGCGGGGCAG
GGCGAGGGCC AGGAGAACCA TAACGCTGCC GCTGTCCCCG TGTACGACCT CGAGGGCATG
CGCATCGAGG TCGCCGGGCG CACGGCCGAG GGCGAGCCCG AGCTGGTGTC CTACGACGCG
CAGTCGCTGC TCGACGAGGG CAACCAGGCC CTGGCCGACG AGCGCTTCGA CGCCGCCGCG
GCCCGCTACG AGCAGCTCCT GCGCATGTTC CCCGACTCGC GCCTGGTGCC CGACGCGCTC
TACAACCTGG GCCTGAGCTA CGAGCTGCGC GACCAGCCCG AGCGCGCCCT GGCCATGTAT
CGCCAGGTCG GCGATCTCGC GGCCGAGCTG CGCAGCGCCG CGGTGCTGGC CGAATACGCG
CGCTGGTCCG AGGCCCGGCG GGTGCTCGAG CGCGCGGCCG AGCGCGAGCA GCTCACCGCG
GCCGAGCGCA TCGAGGTGTT CGCGCGTCTG GGCTACGTGG CCCTATCGCA AGAGGACGAC
GCCGCCGCCG AGCTGGCCCT GGGCGAGGCG CTGGCGGACT TCGATGCGCT CACCGCGGCG
CCGGCCGATC TGTATTACCC GGCGATGGCG CGCTACTACC TGGCGCAGAT TCCCCACCGG
CAATTGCAAC GACTGACGCT GCGCCTGCCC GACGCGCAGC TTCAGCGCGA CCTGGAGAAT
AAATCCGAGC TGCTGGCGCT GGCCTACGAT CGCTATCGCG CCACGCTCGA CATCCATCAT
CTCTATTGGG CGACCGCGGC CGGATATCAA ATGTCGCAGA TTTATAAAGA GTTCTGGGAC
GACGTCATCG CCGTGCCGGT GCCGCCGCAG CTCGCGCCCG AGGCCGCGCA ATTCTATCGC
CGCGAGGTCC ATGAGCGGGT GCGGCCCATG CTGGAGAAGG CCCTCGACGG CCACCTGCGC
AACCTCGATC TCGCCGACGC CTACGGCCAG GCCACCGAGT GGAGCCGGGC CTCGCGGGTG
CGCGCCGATG AGATCGCGAG GCTGCTCATG CGCGAGCACG CCGGCGAGCT GGTGAGCCCG
CCCGGGGTGA CGCCGAATTC GCCTCGCGCT GCCGACGATG CGCAGGAGCC GGATTCACGC
TTTGCGCCCG AGCGCTATGT CCCCGATAGA TTCACGCTCT GA
 
Protein sequence
MSRVCLALSL ALCTLAACAG PRQPAAAPGA AGPAQAGAGP VDSADAGADA GDDTLAAAGQ 
GEGQENHNAA AVPVYDLEGM RIEVAGRTAE GEPELVSYDA QSLLDEGNQA LADERFDAAA
ARYEQLLRMF PDSRLVPDAL YNLGLSYELR DQPERALAMY RQVGDLAAEL RSAAVLAEYA
RWSEARRVLE RAAEREQLTA AERIEVFARL GYVALSQEDD AAAELALGEA LADFDALTAA
PADLYYPAMA RYYLAQIPHR QLQRLTLRLP DAQLQRDLEN KSELLALAYD RYRATLDIHH
LYWATAAGYQ MSQIYKEFWD DVIAVPVPPQ LAPEAAQFYR REVHERVRPM LEKALDGHLR
NLDLADAYGQ ATEWSRASRV RADEIARLLM REHAGELVSP PGVTPNSPRA ADDAQEPDSR
FAPERYVPDR FTL