Gene Hoch_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1747 
Symbol 
ID8544129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2379265 
End bp2380656 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content71% 
IMG OID646386454 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003266189 
Protein GI262194980 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.192343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGAGCG AGCCAGAATC CCCGCGCGCC GAGAGCGCGG CCGCGGCGGC CGCCGAGTCC 
GCCGCGGCCG CCGCGGGCTG GCGGCGCTTC GTGCTGCTGT GGCTGGGACA GTCGGTGTCC
CTCACGGGCT CGAGCCTGAC CTCGTTCGCG CTCGGGCTCT GGGTGTACCA GACCACCGGC
GCGGTGGCGC AGTTCGCGCT GATCATGCTG TGTAGCGCGC TGCCGCCGAT CCTGCTCACC
CCGGTCACCG GTCCGCTGAT CGACCGTCAC GACCGGCGGC GCGTCATCTT GCTCAGCGAC
TCCATCGCCG GCCTGGCCAC GCTGAGCATC GCGCTGTTGC TGTTCAGCGG CAAGCTCGCG
GTCTGGCACA TCTATCTCAA CGCCATCCTG GTGGCCGTGT GCGGCTCGTT TCAGGCGCCC
GCGTATGTGG CCTCGATACC GCGCCTGGTG CCCGACCAGC GCCTGTCGCG CGCCAACGGC
ATGGTGCAAG TCGGCCACGC CTTCGCGCAG CTCTTCACCC CGCTCGCGGC CACCTCACTG
CTGGCGCTGG CCGGTCTGCA CGCGGTGCTG CTCGTCGACG GCGTCACCTT CCTGTTCGCG
GTGACCACGC TCCTGCGCAT CCGCCTGCCG GGCCCCGCGA GCGCTCCGGC CGCGCACGCG
CAGCGCGACG ACTTGCGCAC CGCTCTGCGC GAGGGCCTGC GCTTCATCTG GCAGCACACC
GCGCTGCGCG CGCTGATTGC GTATCTCGCC GTCACCAACC TGGTCATCGG CATCGTCGAG
GTGCTGGTGA CGCCGCTGGT GCTGTCGCTG AGCACGGTGC AGATGCTCGG CGTCATCATG
ACCATCGGCG GCCTCGGCTT CCTGGCCGGC AGCTTGCTCG CCAGCCTGTG GGGCGGGCTG
CCGCAGCGCA TCCGCGTGGC CCTGGCCTTC GAGGGCCTGT GCGGCGTCAG CCTGGTCCTG
GCCGGTCTGG TCACCTGGGT GCCGGCGCTG CCCGTCATCG CGTTCTGCTT CTTCTTCGGC
GTCCCGCTGT TCAGCAGCAT CGCCACCACC CTGCTGCAGC GCCACGTCCC CGACAACCTG
CGTGGCCGGG TGTTCTCTCT GCTCGGTACC GTCACGCAGG CGTCGGCGCC GCTGGCGTAC
GCGGTTTCCG GACCGCTGGC CGATCTCGTC TTCGAGCCGG CCATGATGCC GGGCGGCGCA
CTGGCGGACA TCTTCGGCCC GGTCTTCGGC GTCGGTCCGG GCCGCGGTAT CGGCCTGATG
TTCGTCGTCT CGGGCGCTCT CACCATACTC ATCTGTGTGC TGGGCGCGCG CTATCGTCCG
CTGCTCCGGC TTGACACGCG CCCGGCCCAC GCCGACGCTC CCCCCTCCCA ACCACCTTCG
CGAGATCTAT GA
 
Protein sequence
MTSEPESPRA ESAAAAAAES AAAAAGWRRF VLLWLGQSVS LTGSSLTSFA LGLWVYQTTG 
AVAQFALIML CSALPPILLT PVTGPLIDRH DRRRVILLSD SIAGLATLSI ALLLFSGKLA
VWHIYLNAIL VAVCGSFQAP AYVASIPRLV PDQRLSRANG MVQVGHAFAQ LFTPLAATSL
LALAGLHAVL LVDGVTFLFA VTTLLRIRLP GPASAPAAHA QRDDLRTALR EGLRFIWQHT
ALRALIAYLA VTNLVIGIVE VLVTPLVLSL STVQMLGVIM TIGGLGFLAG SLLASLWGGL
PQRIRVALAF EGLCGVSLVL AGLVTWVPAL PVIAFCFFFG VPLFSSIATT LLQRHVPDNL
RGRVFSLLGT VTQASAPLAY AVSGPLADLV FEPAMMPGGA LADIFGPVFG VGPGRGIGLM
FVVSGALTIL ICVLGARYRP LLRLDTRPAH ADAPPSQPPS RDL