Gene Hoch_2274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2274 
Symbol 
ID8544660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3163861 
End bp3165651 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content69% 
IMG OID646386979 
Productoligopeptide transporter, OPT family 
Protein accessionYP_003266710 
Protein GI262195501 
COG category[S] Function unknown 
COG ID[COG1297] Predicted membrane protein 
TIGRFAM ID[TIGR00728] oligopeptide transporters, OPT superfamily
[TIGR00733] putative oligopeptide transporter, OPT family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.247185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0874432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCAGCG AGCCATCATC CCCATCGTCG CATCCGCCGC CGGGCGGGAA ATCGTCCACC 
CGTCCGGGTG GCGGGCGTCC CATCCGCCGC GGCGCCTATC CCGAGCTGAC CGCGACCGCG
CTGGTCGTCG GCTATCTACT CGGCACGGTC ATCGCCGTCA GTATCGGCTA CGCGGCCCTG
ATCCTCGGCT TTAGCATCGA GGGCTCAGAG CTGGCCGCGA TCCTCGGCTT CGCCATCTTG
CGCGGCTTGC TGGGGCGCAA CAGCATCGTC GAGAACAACA TCAACCAGAC CGTGGCCAGC
GCGGTCAACG GCGCCTCCTC GGGCATGATG TTCTCGGTGC CGGCGCTGTT CATCCTCGAC
TACGCCGACG CCTTCAATCC GCTGCTGATG GTGCTCGGCT GCATCGTCGG CGCGATCCTG
GGCATCGCCT TCATCATCCC GCTGCGCAAG CAGATGATCG ACTTCAACCG GCTCACCTTC
CCCGGCGGTG TGGCCGTGGC CGCGGTGCTC AAGTCGCCGG GCGCCGGCAT GCGCAAGGCC
ATGCTCATGC TCGGCGCGGC GCTGCTCAGC GGCGTGGTGC ACGTGTTCGC GCAGGTGGCC
GAGTTCCACG ACGCCCCGGT GGGCGCGTGG CTGGGGCTGC CCGAGTACCT CAACATCACC
TTCTACGCCT CGCTGCTGAC CGTGGGCGTC GGCTTCCTGG CCGGCAAGGG CGGCGTGTTC
TTCATCGTCG GCGGCTACGC CTGCTACTTC GTGCTGGCGC CGATTCTGGC CTCGATGGGC
AAGATCCCGG GGCCCGAGGT GCTGGCACTG GCCGACGAGC CCGCGTCCGA GTGGCTGCGG
CTCACGCTGT TCCGGCCGCT GGGCATCGGC ATGCTCATCG GCGGCGCGCT CACCGGCATC
GTGCTGGCGC TGCCGCTCAT CGCCTCGGCC ATCGGCAGCA TGCGCGCGGC GGCCAAGACG
CGCTCGGAGA TGTCGGCGGA CGAGATGCCG ATCAAGCTCC TGGGCGTGGC CGTGGGCGGC
GCGGTGGTGG TGCTGCTGAT CATCGCGATT CTGTCCACCG AGGCCATGGG CATCGGCCGC
GGCCTGGTCA TGGGCGTGCT CGGCACGGCC TGGATCTGGA TCGCGGGCGT GATCCTCTCG
GAGTGCATCG GCCGCACCAA CTGGTCGCCG CTCTCGGGCA TGACGCTCAT CGGTATCACC
ATCCTCATCT TCGTCGCCAG CGGGCTGGGC GAGGCCGAGG CCGTGGTCGC CTCGGTCATG
GTCGGCGCGG CCATGTGCGT GGCCATGTCG CAGGCCACCG ACCTGATGAT GGATCTCAAG
ACCGGCTACC TGGTGGGCGC GACCCCGCGC AAGCAGCAGC TCGCCCAGTT CGCGGGCTCG
TGGCTCGGGC CCATCGTCAT CATGGTGCTG ATCTTCGTGC TGCACCGCGA CGCCGGGCTC
GGCAGCGAGC GCTTGCCGGC GCCGCAGGGC CAGGCTCTGG CCAGCATGAT CCAGGGCATC
CTGGGCGCCG ACGTGCCCCA GCATCAATAC CTGGCCGGTG CCGGCATCGG CGCCATCCTC
AGCGCCTCCG GGGTCGGCGG TCTGGGCGTG CTGGTCGGCC TCGGCTTCTA CCTGCCCTTC
AACATCGTTC TCACCTACAC CATCGGCACG CTGCTGCGTC TGGCCTCGGA CCGCTTCCTG
GGCAAGGCCT GGAGCGAAGA GGTCGGCATC CCGATGGCGG CCGGTCTGCT GGTCGGCGAG
GCCCTGGTCG GCGTCGGCGC GGCCGTGGTC CAGGTCGCCA TGGGCTCGTG A
 
Protein sequence
MSSEPSSPSS HPPPGGKSST RPGGGRPIRR GAYPELTATA LVVGYLLGTV IAVSIGYAAL 
ILGFSIEGSE LAAILGFAIL RGLLGRNSIV ENNINQTVAS AVNGASSGMM FSVPALFILD
YADAFNPLLM VLGCIVGAIL GIAFIIPLRK QMIDFNRLTF PGGVAVAAVL KSPGAGMRKA
MLMLGAALLS GVVHVFAQVA EFHDAPVGAW LGLPEYLNIT FYASLLTVGV GFLAGKGGVF
FIVGGYACYF VLAPILASMG KIPGPEVLAL ADEPASEWLR LTLFRPLGIG MLIGGALTGI
VLALPLIASA IGSMRAAAKT RSEMSADEMP IKLLGVAVGG AVVVLLIIAI LSTEAMGIGR
GLVMGVLGTA WIWIAGVILS ECIGRTNWSP LSGMTLIGIT ILIFVASGLG EAEAVVASVM
VGAAMCVAMS QATDLMMDLK TGYLVGATPR KQQLAQFAGS WLGPIVIMVL IFVLHRDAGL
GSERLPAPQG QALASMIQGI LGADVPQHQY LAGAGIGAIL SASGVGGLGV LVGLGFYLPF
NIVLTYTIGT LLRLASDRFL GKAWSEEVGI PMAAGLLVGE ALVGVGAAVV QVAMGS