Gene Hoch_3969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3969 
Symbol 
ID8546365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5469768 
End bp5471180 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content72% 
IMG OID646388641 
Productpeptidase domain protein 
Protein accessionYP_003268361 
Protein GI262197152 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0213029 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.430474 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGCGGCC TGGGCGCGTG CCTGAGCCTG CTGGCGCTCG GCGCCTGCGA CAAGCGTACG 
GGCACCGCCA AGCACATCCC GCGCGATGAC GACGCGCTGC CGGTCGTGGT GGTCGACGAG
GTCGCGAGCG CGACCGCGAC CATCGACGAG GTCGAGCCCA ACAACGAGCG CGCGCAGGCC
ACAGAGGTCG CGCTCGGCGA GGCTGGCAAA GGCGTGCTCG ACGGCGAAGA AGACGTCGAT
TTCTATCGGG TCGCCGTGGC CTCGGCCGAT GTCTTGAGCG TGCGGCTGAG CGGCATCGAG
GCGGTCGACC TGATGCTCGA GCTGCAGGAC CAGGGCGGCG AGGTGCTGGC CCGCTCCGAC
CGCGGCCCGG CGGGCACGCT CGAGGGCATA GCCAACTTCT ACGTCGAGCC CGGGAGCTAC
TTCCTGGCGG TGCGCGAGTT CGTGCCCAAG CGGCGCAAGC GCAAAGGCGA GCCGCGCACC
GGTCCATCGC CCGCGTACAC GCTCGAGCTC GCGCGGCTGA GCGAGATCGC CGAGACCCAG
GAGCGCGAGC CCAACCAGGA CGTCGAGGGC GCGCGCGAGC TGCTGGTCGG CGACGAGGGC
AGCGGCTTTA TCGGCTGGGG CGGCGACGTC GATCTGTGGA AGCTGCCGGT CGAGGGCTTC
ACCGAGCAGT ACAGCCTGGA TCTCGACCTC ACCGGCGTGC CCGCGGCGAC GCTCACGCTG
GAATTGCTCG ACAGCGGCGG CTCGGTGATC CTCAAGCGCA CGGGCGCCGC TGACAGCGCG
CTGGCCGTGC GCAACCTGGT GCCGGAGTCC GCCGGCGACG ACGCCACCGG GCCGACGCAG
CACACCTACT ACGCGCGCAT TTCGGCGCGC CGCTCCAATC CCGTGGATCC CTACCTGCTG
CGCGTGAGCT CACACCTGCT CGACCTCAGC GACGAGCGCG AGCCCAACGA CGTGGCCGCG
CAGGCCTCGT CGCTGCTCGG GGTCGATCCC GGACAGACGG CCGACAGCCG CTCGGGGCGC
GTGACCGGCA CGCTCACGGT TGGCGATACC GACGTGTTCA GCCTGCCGGC GCAGGGCGAG
GCGGTCGCGC TCACGGTCGA GCTGGTGCCG CGCGAGGACC TCGACGCCAC GCTCACCGTG
CTGAGCAACG GCGAGACCCG GGCCATGGCC AACGCCAACG GCAAGGGCGG CAAGGAATAC
CTGGCCGATG TGCGCATCGA CGCCGGCGCG TCGGCGGTGG TGCAGATCTC GGGCGAGGGC
GCGCTGGGCG AGGGCGCCGG GTACCTGCTG AACTGGTCGC TCGCGACCGC ATTCGAGCCC
CCGCAAGACA CCCTCGACAG TACCTGGGGC GAGCTGCCGC CGGAGCTGGG CGGCGACGCG
GCTGGGTCGG GCGATGACTT TCGCGGAGAC TGA
 
Protein sequence
MCGLGACLSL LALGACDKRT GTAKHIPRDD DALPVVVVDE VASATATIDE VEPNNERAQA 
TEVALGEAGK GVLDGEEDVD FYRVAVASAD VLSVRLSGIE AVDLMLELQD QGGEVLARSD
RGPAGTLEGI ANFYVEPGSY FLAVREFVPK RRKRKGEPRT GPSPAYTLEL ARLSEIAETQ
EREPNQDVEG ARELLVGDEG SGFIGWGGDV DLWKLPVEGF TEQYSLDLDL TGVPAATLTL
ELLDSGGSVI LKRTGAADSA LAVRNLVPES AGDDATGPTQ HTYYARISAR RSNPVDPYLL
RVSSHLLDLS DEREPNDVAA QASSLLGVDP GQTADSRSGR VTGTLTVGDT DVFSLPAQGE
AVALTVELVP REDLDATLTV LSNGETRAMA NANGKGGKEY LADVRIDAGA SAVVQISGEG
ALGEGAGYLL NWSLATAFEP PQDTLDSTWG ELPPELGGDA AGSGDDFRGD