Gene Hoch_3839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3839 
Symbol 
ID8546232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5288174 
End bp5289697 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content66% 
IMG OID646388508 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_003268231 
Protein GI262197022 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.133481 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCAGCA CAGGCGTCCG CGAGTGTATA AACAGCCCGC GACTCTCCCT GGTAACTGGC 
TCATCTATGT CCTTAGCGCG AAAAGCTGTA CATGGCGCCA TGTGGACGGT CGGTGCGAGC
CTCGGAGCTC GCGCCATCGG CCTCGTCGGC ACGGTGGTGA TCACGTATTT CCTCTCGCCC
ACCGTGGTCG CCGAGGTCAA CGCCGCCGCC ATTCTCGCCA TGTCGGCCAG TTTGCTCAGC
AATTTCGGCA TCGGCAACTA CTACATCGTC AAAGGTGATG ACCGCGAGGT CGCCTTCCAC
ATGACGGTGT ACAACCTTTT GCTCGGAGCC GTGGTTTTCG GCCTGGTATT GGCCTTTAAC
GAACCTCTGA GCGAGCTGCT GAATCTGCCC GCGATATCGG AGTTTGTCCC CGGTATGGTG
CTGGCCTGGT CCATCCGCCG CGTGGCCATG CAATCGCAAA AGGTGCTGGT GCGCGACATG
CGCTTCGGCC GCCTGAGCAT CGCGCGCGCG CTGGGTGAGA TTTCCTTCGT GCTCACCTCG
GTGGGTCTGG CCGCGCTCGA GTACGGCGGC ATGGCCATCG TCATCGGCAA CATCGTCCAG
TACTCGGTCG ACGGCGTCAT CACCATCACC TCGGTGCACT GGCGGACCTG GCTCGAGCCC
TGCAAATTGC GCTGGGAGCG CACTGTCGAC ATGTTCCGCT TCGGCTGGCC GCTGGGCGTC
AACGCCTTCG TCGGCTACGC CACGCATAGC TGGGACCGGC TGCTGTTCGC CAACCTGTTC
AACACCCACC TCATGGGTCT GTACAACTAT GCCTACCGCC TGGCCGAAAT ACCGGCCTCG
CAGGTGGGTG ACCAGATCAG CGACGTGCTG CTGCCGTCGA TGTCCAAGCT CGACGCCGAG
GGCCGCAAGC GCGCGCTCAT CCGCTCCACC GCCCTGCTGG GCGTGCTGCT GTTTCCGCTC
ACGGTCGGGC TGGCGGCCGT GGCCGAGCCG CTGATCACGC TGATCTTCGA CGAGGCCTGG
CACAGCACCG CGCCCATGGT GTCGGTGCTG GGCGCCTGCT TCGTGTTCGA GCCCATCGGC
AGCACCCTGG TCTCGTATCT GATGGCCCAG AGCCGCACGC GCACGCTCAT GATTCTGCAG
ATCATCAAGC TCGGCGCGCT GTTCGCCGGC ATGACCCTGC TGTCGACGCT GGGCCCGCTG
TGGGCGTGCG GCGGCGTCGG CGTCGGCTTC GCGGTCTACG GCTTGGTCAG CGCGTATCTG
TGCGTGCGCC GCGACAACAT CCCGGCCGGC AAGCTGCTGT CGGCCTTTGT CCAGCCGCTC
ACGGCCTGCG TGCCCATGGT CGGCGCGGTC CTGGGCGTGC GCTACGGGCT GCGCGCGGCC
GGCTTCGACA GCCCGGCGCT GTCGCTGGGA TGCGAGATCG TCGCGGGCGC CGCCGTGTAT
GTGCCCGCTG TGTTCCTGAC CGCGCCGGCG ACGGCGCGCG ACTTCCTCAG CCTGGTGCGC
AAGGCGCTCA AGCGCGGCGG CTGA
 
Protein sequence
MPSTGVRECI NSPRLSLVTG SSMSLARKAV HGAMWTVGAS LGARAIGLVG TVVITYFLSP 
TVVAEVNAAA ILAMSASLLS NFGIGNYYIV KGDDREVAFH MTVYNLLLGA VVFGLVLAFN
EPLSELLNLP AISEFVPGMV LAWSIRRVAM QSQKVLVRDM RFGRLSIARA LGEISFVLTS
VGLAALEYGG MAIVIGNIVQ YSVDGVITIT SVHWRTWLEP CKLRWERTVD MFRFGWPLGV
NAFVGYATHS WDRLLFANLF NTHLMGLYNY AYRLAEIPAS QVGDQISDVL LPSMSKLDAE
GRKRALIRST ALLGVLLFPL TVGLAAVAEP LITLIFDEAW HSTAPMVSVL GACFVFEPIG
STLVSYLMAQ SRTRTLMILQ IIKLGALFAG MTLLSTLGPL WACGGVGVGF AVYGLVSAYL
CVRRDNIPAG KLLSAFVQPL TACVPMVGAV LGVRYGLRAA GFDSPALSLG CEIVAGAAVY
VPAVFLTAPA TARDFLSLVR KALKRGG