Gene Hoch_4078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4078 
Symbol 
ID8546479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5604246 
End bp5605643 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content72% 
IMG OID646388754 
Productphospholipase D/Transphosphatidylase 
Protein accessionYP_003268469 
Protein GI262197260 
COG category[I] Lipid transport and metabolism 
COG ID[COG1502] Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.59544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.179058 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG GCGACGAGCG TTTCATTCCC CCTGTCCCCG CCGAGTCCGG CAGCTATCCG 
CTGCGCGAGG GCAACGCCGT GCGGCCCCTG GTCGACGGCG TGTCGGCGTT TACGCGCATC
TGCCAGGCGG TGGAAGCGGC CCGGCACAGC GTGTGGGTGA CGGTGGCCTT TCACGACGGC
GCCTTCCGCA TGCCCGAGCC CTTCGGCACG CTCATCGAGT TGCTCGATCG AGCGCGCGCG
CGCGGGGTGG ACGTGCGCGC GCTGTTCTGG CGTTCGTTCG AGCTCGAGGA CGACGAGCCC
GACACCCACT TCGCGGGCTC GCCGGCGCAG ATGGCGCGGC TCGAGGCCGC GGGGGCACGC
TTTCTGGCGC GCTGGGACCG GCTGCCCAAG CCGCTGTGCC ATCACCAGAA GAGCTGGCTC
ATCGATGCTG CCACGTCTGG CGAGGTGGCC TTTGTCGGCG GCATCAACCT CGACTGCGGC
TCGCTGGACG TGCCCGGGCA CGCGCCGCGT GAGGCCGGCA ATGTGCACGA TGTGTATCTC
GAGCTGCGCG GACCGGCGGC CACCGACGTG CACCACAACT TCGCGCAGCG CTGGAACGCG
GCCACCGAGC GCGCGCTCGC GGGCGGCGCC TGGCCGAGCG TGGAGGCCGC CGATGATCTC
GCGTTTCCCG AGGCGCTGAG CGCGCCCGCC GGCGAGGTGG CGGTGCAGAT CGCGCGCACG
CTCTCGCCCG GCTACCTCAG CGATGCCACG CCGGCCCCGC AGGCGCACGC CTACGCGGTC
GCCGAGGGCG AGTTTGGCAT CGAGGCCCAG TACGTGGCCG CGATCGACGC GGCCCAGGAG
GCCATCTACA TCGAGGACCA GCTCATCGCC TCGCCGCTGA TTCTCGGACA CCTGTACGGG
GCCATGCGGC GCGGCGTCGA GGTGGTATTT CTGGTGCCCG GCAAGCCGCA CCCCGAGTTC
GCCGAGGCCC GCGGCAACGA GGAGCACGCG CTGGCCTTTG CCTTCTTCGA CCGCCTGGCC
GACGAGGATC GCTTCACCTT GGCCGGCATC GCGTCGCACG CCGGCCCGGG CCAGTACTGC
GATGTCTACG TGCACGCCAA GATCATGCTG GTCGATGACG CCTGGGCGAC GATCGGCTCG
GCCAACGTCG CCGAGCGCTC GTTTCGCCAG GACACCGAGA TGAACGCCTC GCTGTGGCAC
GCGCCCACGG TGCGCGCGCT GCGCGAGCAG CTCCTGGGCG AGCACCTGGC GCGCGACACA
TCCGCGATGG ACGCGCGCGC GGCCCTGCGC TGCTTCCGCG AGGTCGCGCA GGCCAACCGC
GAGCGCCGCG CGCGCGGCGA GGCGCTCGAG GGGCTGGCCT TTGCCATCTC GCCGGCCGAG
TACGGCCTGT CGGTGTAG
 
Protein sequence
MSTGDERFIP PVPAESGSYP LREGNAVRPL VDGVSAFTRI CQAVEAARHS VWVTVAFHDG 
AFRMPEPFGT LIELLDRARA RGVDVRALFW RSFELEDDEP DTHFAGSPAQ MARLEAAGAR
FLARWDRLPK PLCHHQKSWL IDAATSGEVA FVGGINLDCG SLDVPGHAPR EAGNVHDVYL
ELRGPAATDV HHNFAQRWNA ATERALAGGA WPSVEAADDL AFPEALSAPA GEVAVQIART
LSPGYLSDAT PAPQAHAYAV AEGEFGIEAQ YVAAIDAAQE AIYIEDQLIA SPLILGHLYG
AMRRGVEVVF LVPGKPHPEF AEARGNEEHA LAFAFFDRLA DEDRFTLAGI ASHAGPGQYC
DVYVHAKIML VDDAWATIGS ANVAERSFRQ DTEMNASLWH APTVRALREQ LLGEHLARDT
SAMDARAALR CFREVAQANR ERRARGEALE GLAFAISPAE YGLSV