Gene Hoch_4958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4958 
Symbol 
ID8547366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6833992 
End bp6835065 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content69% 
IMG OID646389632 
ProductPEGA domain protein 
Protein accessionYP_003269340 
Protein GI262198131 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCACC GTCGTTCCGC GCACATCGCC GTCCTGGTTG GCATTCTCTG CGCGCTGTCG 
TCCGCCCGCG TCAGCGCTCA AGCGTCGCCC GAGCAATCGG CGCAGGGAGT GGCTCTCATC
CGCATTGAGA TTCAGGGCGA TCATCCGCCC GGGCTCGCGG ACGAGCTCGA CGCCGCGGTG
ACCAAGACGC TCGGCGAGAT CGGCTTCTCG GGGATTTCGT ACAGCGAGGT CCGCGAGCGC
ATCCAGAGCC GGCTGGGCGC CATGCCCGAG CTGGCCAACT GCCAGAAGCC CGCGTGTCTG
CGCGAGATGC CGGAGCTGGT CGGGACCAAT CTATTTCTGC GCTTGAAGGT CGAGGCCAGC
AGCGCCATCT ACTCCTTCGA CATCGAGTTT CTCACCGCCG AGGATGGCGG GGTCAAGGAG
CGCAGCTCGG GCTCGTGCCC GGTGTGCACG GCAGACGAGT TCATCGACCG CGTGTCCAGC
AACGTGCGTT CGCTGCTCGA GCCGTTTCAG CCCATGCCGG TGATGATCGC GTCGGAGCCC
GCGGGCGCCA CCTTGATCAT CAATGGCCGC GAGGTGGGGC CGGCGCCGTT CTCGGGGCAG
CTCGCCCCGG GGCCGCATCA GGTGCGCGCC TCGCATCCCG GCTATGTCGA TGCGATGACG
ACCGTCGAGG TCGTGCCCGG GCAGGCCACG GTGATTCAAA ATTTCGATGT GCAGCTCACC
CCGGACGCGA GCGGCGGTGA TGGCGGTGAC GGCGGCGGCA GTGGCTTTGG CATGTGGAAG
TGGCCGACTG CGGCCGCAGC AGCCGTCAGC ATCGGCCTGG GCGCGACCTG GGTCGCGGTC
CACGATAGCG AGACCAACTG CGGCCCGGGG CAAAGCACCT GCCCCGAGGT GTACGACACC
CTCGGGCCGG GCCTGGTCGC GCTCGGCGTC GGAGCCGGCC TGGGCGCGTT GGCGGTGTGG
ATGTTCATCG ACGACAGCGG CAGCGACGCC AGCACCAGCG ACGCGACCAG CTTCGTCCCG
GGCGTGCATC CGGTCGAGGG CGGCGCCGTC GGCTCGCTGC GCCTGCGCTT CTGA
 
Protein sequence
MFHRRSAHIA VLVGILCALS SARVSAQASP EQSAQGVALI RIEIQGDHPP GLADELDAAV 
TKTLGEIGFS GISYSEVRER IQSRLGAMPE LANCQKPACL REMPELVGTN LFLRLKVEAS
SAIYSFDIEF LTAEDGGVKE RSSGSCPVCT ADEFIDRVSS NVRSLLEPFQ PMPVMIASEP
AGATLIINGR EVGPAPFSGQ LAPGPHQVRA SHPGYVDAMT TVEVVPGQAT VIQNFDVQLT
PDASGGDGGD GGGSGFGMWK WPTAAAAAVS IGLGATWVAV HDSETNCGPG QSTCPEVYDT
LGPGLVALGV GAGLGALAVW MFIDDSGSDA STSDATSFVP GVHPVEGGAV GSLRLRF