Gene Hoch_3578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3578 
Symbol 
ID8545968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4930355 
End bp4931731 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content76% 
IMG OID646388247 
Productcytochrome c family protein 
Protein accessionYP_003267973 
Protein GI262196764 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.628183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0456283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATGG GAACCACTCG CAAGCGCGAT ACAGCGCGCG CGCTGGCGCC GCTCTTTGCC 
TTCGCGATCG CGCTGGGGCT GGCGGCCGCG GCCCTGGCGC AGCCCGAATC GCAGCCCGCG
GGCGAGCCCG CGCGCGATCC GGCCCAGCCG GCGCCAGCGC TGTTTCCCGC GCCGCTGCCG
CCGGCGCTCG CGCCCGACGC GGTGCCGCCG TCAGACCCGG CCGCAACGCC GGCAGCGGAC
GCGGCCGGGG ACGGCGTCTC GCCCGTGATC TACCCGCCGC AGGAGCTGCC GCTCTACTTC
TCGCACGCGG CTCATCTGCG TCTGCCCGAG GCGCCCGCGT GCCTCGACTG TCACCCGCGC
GCCGCCTCCT CGATGTCGTC CATCGACGAT CTGATGCCGC GCGAGGCCGC GTGCCGGCCG
TGCCACGCCA TCGATCGCGA CCAGCCCACC AAGGCCGTGG CCGCGGGCGC GCCGGCCGCG
CGCTGCGACG CCTGCCATCC GGGCTACGCG CCCGGCGATG TCGCGGTCGC GCGGCTGCGG
GTGCCGGTGC CCAACCTCAA GTTCCCGCAC CGCGTGCACG TCGCCCGCGG CCAGGCCTGC
ACCGGCTGCC ACGGCGACCT GGCGGCCGAG GGCGTGGCGC TGGCGACCCG GGCGCAGCTC
CCCGCGATGC GCTCGTGCCT GGCCTGTCAC GACGACCGCC AGGCCGCGCG CGCGTGCACC
ACCTGCCACC TGGCCGACGC CGGCGGCTTC GTGCGCACCC GCTTCGCCGA GGGCGCGCTC
ATGCCCTCGG GCACGCTGCG CGGCGCCGCC CACGATCTGA GCTTCCGCAG CGCGCACGCC
GGGGCCTCGC GCAGCGACCC CGACTACTGC GCGAGCTGCC ACCAGCAGTC GTTCTGCGTC
GATTGCCACG ACGGCGCGTT CAAGCCCATG GACTTCCACG GCGGCAACTA CGTGGCCCTG
CACGCCATCG ACGCGCGCCG CGACGCCAAC GAGTGCAGCG CGTGCCACCG CGCCCAGAGC
TTCTGCACCG GCTGCCACAG CCGCTCGGGC GTGAGCGCCG ACGGCCGCGG CTCCGAGTTC
GACGCCGAGC AGCCCGGCCG CGGCTTTCAT CCGCCCGGCT GGTCGCGGCC CGGCCTGGTC
GGCCCCGGCC ACCACGGCTT CGCCGCCCGG CGCAACATCG AGCAGTGCGC GAGCTGCCAC
CGCGAAGAGG ACTGCGTGGC CTGCCACAGC GGCAGCCCGA TGGGCGGGAT CTTCGGCGTC
AGCCCGCACC CGCCGGGCTG GGCCACGAGC CGGCGCTGCC GCGTCCTGCT GTCCAAAAAC
CCGCGCGTGT GCCTGCGCTG CCACATCGAT CGCGCCGAGC TGCGCTGCGC GCCCTGA
 
Protein sequence
MAMGTTRKRD TARALAPLFA FAIALGLAAA ALAQPESQPA GEPARDPAQP APALFPAPLP 
PALAPDAVPP SDPAATPAAD AAGDGVSPVI YPPQELPLYF SHAAHLRLPE APACLDCHPR
AASSMSSIDD LMPREAACRP CHAIDRDQPT KAVAAGAPAA RCDACHPGYA PGDVAVARLR
VPVPNLKFPH RVHVARGQAC TGCHGDLAAE GVALATRAQL PAMRSCLACH DDRQAARACT
TCHLADAGGF VRTRFAEGAL MPSGTLRGAA HDLSFRSAHA GASRSDPDYC ASCHQQSFCV
DCHDGAFKPM DFHGGNYVAL HAIDARRDAN ECSACHRAQS FCTGCHSRSG VSADGRGSEF
DAEQPGRGFH PPGWSRPGLV GPGHHGFAAR RNIEQCASCH REEDCVACHS GSPMGGIFGV
SPHPPGWATS RRCRVLLSKN PRVCLRCHID RAELRCAP