Gene Hoch_4836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4836 
Symbol 
ID8547243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6618673 
End bp6619944 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content72% 
IMG OID646389509 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003269218 
Protein GI262198009 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0503623 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCGAAA TAAAACCCGA CTCGGAAGCA GCCGATTCGG AGCTCGGCGG CGCCGAACCG 
GCCGGTAAGT GGACCTACCT CGGCCTGCTG GCGGCGGTGG AGTTCTTGGC CATGACCGTG
TGGTTTTCGG CCACCGCGGT GGTGCCGCAG CTCGTCGAGG CGTGGTCGTT GGCGCCCGGC
CAGGTGAGCT GGCTGACCAT GAGCGTGCAG CTCGGATTCG TGTTCGGCGC CCTGGGCAGC
GCGTTTCTCA ACCTCCCCGA CCGGGTGGCG CTGCCGCGGC TGATCGGCGC CAGCGCGCTG
CTGGCCGCGC TCGCCAACGC CGCGATTCCG CTGCTCGAGC CGGTCCCCTC GGCCGTCATC
GCGCTGCGCT TTGTCACCGG CATGGCGCTC GCGGGCGTGT ATCCGCCGGG TATGAAGCTG
GTGGCGACCT GGAGCCTGCG CGCGCGCGGT CTCGGCATCG GTATCATGGT CGGCGCGCTC
ACCTTGGGTT CGGCGGCGCC GCATCTGCTC AACGCCTTTC CGCTGTTCGG CGACGGCGGC
ATGCCGCCTT GGCGCGCGGT GCTGGGGGCG ACCTCGGCGC TGGCCGCGCT GGCCGCGCTG
CTGTCCTTTG CCTGGCTGCG CGCGGGCCCG CTGCTGTCGG CGTCGGCGCC CTTTGACTGG
CGCTCGCTCA CCCAGGGGCT GCGCGACCGG CCCACGCGCC TGGCCAATTT CGGCTATCTC
GGCCACATGT GGGAGCTCTA CGCGATGTGG GCCTGGGTGC CGCTGTTCCT GTTGCAGCGC
TACCAGGCGG CCGGTCTGCC GACGGAGGCG GCGCGGCTGG CCGGCTTCGG CGTGGTCGCC
GTGGGCGCGC TCGGCTGTGT GGTGGCGGGC GTGATCGCCG ATCGCCTGGG ACGGACGCTG
GTGACCTCGC TGAGCATGAT CGTCTCGGGC GGCTGCGCGC TGGCTGTCGG ACTGGTGTAC
GATCACCCGC TGCTGCTCAC CGCGCTGTGT CTGCTGTGGG GGCTGGCCGT GGTCGCCGAC
AGCGCGCAGT TCAGCGCCGC GGTGTCCGAA CTGGCCGATC GCCGCTACGT GGGCACCGCG
CTCACGGTGC AGACAGCCAT GGGCTTCTTG CTCACGCTGA TCTCGATCCG CGCGGTGCCG
CCGCTGGCCG AGCTGGTGGG CTGGCGCTGG GTGTTCGCCA GCCTGGCGCT GGGGCCGCTC
TTCGGCACCG TGAGCATGCT GCGGTTGCGC GCGCTGCCGG CGGCCGCCGC CATGGCGTCC
GGACGTCGCT GA
 
Protein sequence
MSEIKPDSEA ADSELGGAEP AGKWTYLGLL AAVEFLAMTV WFSATAVVPQ LVEAWSLAPG 
QVSWLTMSVQ LGFVFGALGS AFLNLPDRVA LPRLIGASAL LAALANAAIP LLEPVPSAVI
ALRFVTGMAL AGVYPPGMKL VATWSLRARG LGIGIMVGAL TLGSAAPHLL NAFPLFGDGG
MPPWRAVLGA TSALAALAAL LSFAWLRAGP LLSASAPFDW RSLTQGLRDR PTRLANFGYL
GHMWELYAMW AWVPLFLLQR YQAAGLPTEA ARLAGFGVVA VGALGCVVAG VIADRLGRTL
VTSLSMIVSG GCALAVGLVY DHPLLLTALC LLWGLAVVAD SAQFSAAVSE LADRRYVGTA
LTVQTAMGFL LTLISIRAVP PLAELVGWRW VFASLALGPL FGTVSMLRLR ALPAAAAMAS
GRR