Gene Hoch_1561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1561 
Symbol 
ID8543943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2128771 
End bp2130033 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content74% 
IMG OID646386270 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003266005 
Protein GI262194796 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.224234 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCGA GCATCGGAAT CGCCAACACC GTCGAGCGCA GCGAGGTGCA CGCCGGCCAT 
CGCGCCCCCG CGACTGCCGC CGCGGCGCCC GCGCCCGACA CGGGACCGCC GGCGTCGTTG
GTGGCGCTGC TGGCGGCCAG CGCCGGTTTT GCGGTGGCCG CGCTGTACTA CAGCCAGCCG
ATCCTGGGCG TGCTGGGCGC GGACCTGGGC GCGTCGGCGA GCACGATGGG CCTCTTGCCG
ACGCTCACGC AGCTCGGCTA CGCGCTCGGC ATCTTGTTTC TGGTGCCGCT CGGCGACCGC
TGGGACCGCC GCCGCGTCAT CGTGGCCAAG GCACTATTAC TAATGATGGC CCTGGTCGGC
GCCGCGCTCG CGCCTTCGAC CGCGTGGCTG CTGGCGGCGA GTCTGGCCAT CGGTCTGTGC
GCGACCCTGG CCCAGGACAT CGTGCCGGCG GCGGCGACGC TGGCCCCGGG CGCCAGCCGC
GGCCGCGTGG TGGGCGCGAC CATGACCGGC CTGCTGCTGG GGATTCTGCT GTCGCGCGTG
GTCGGCGGCG TGGTCGCCGA GGCCTTTGGC TGGCGCGTGA TGTTCGCGGG CGCGGCGCTG
AGCATCGCCG CGGTGGCGCT GGCCTCGTGG CTGTGGCTGC CGCGCTTCGC GCCGACTACG
ACGCTCGGCT ATCGCGCGCT GCTGGCCTCG TTGTTGGCGC TGTGGCGGCG CTATCCGGCG
CTGCGCCGGG CGACCGCGGC GCAGGCGCTC CTGGCCGTGG GCTTCAGCGC GTTCTGGTCG
ACGCTGGCCA TCATGCTGCA CGAGCCGCCG TTTGAGCTCG GCAGCGCGGC TGCGGGCGCG
TTTGGCATCG CGGGGGCGGC CGGTGCCCTG GCCGCGCCGT TGGCCGGACG CCTGGCCGAT
CGCCGGGGAC CGCGCTGGGT CGCGCAGTCC GGCGCGCTCA TCGCCTGCGT GTCGTTTGCG
GCCATGTTGC TGGCGCCGCT GGTGTCGCCG CAGATGCAAC TCGGCCTGCT CATGGCGGCC
GCCCTGGGCT TTGATCTCGG CATCCAATCG GCCCTCATCG CCCATCAGAC CATTGTCTAT
GATCTCGAGT CCGGGGCCCG CAGCCGCCTC AACGCCGTGC TCTTCGTCGG CATGTTCGCG
GGCATGGCGG CCGGCGCCGC GCTCGGCGGT GTGGCCCTGG CCCGCTGGGG CTGGCAGGCG
GTCGTCGCCC TGGCCGCGCT CACGGCCGGC GGCGCCTACG CGCTGCGCCG CTGGGCGCGC
TGA
 
Protein sequence
MQASIGIANT VERSEVHAGH RAPATAAAAP APDTGPPASL VALLAASAGF AVAALYYSQP 
ILGVLGADLG ASASTMGLLP TLTQLGYALG ILFLVPLGDR WDRRRVIVAK ALLLMMALVG
AALAPSTAWL LAASLAIGLC ATLAQDIVPA AATLAPGASR GRVVGATMTG LLLGILLSRV
VGGVVAEAFG WRVMFAGAAL SIAAVALASW LWLPRFAPTT TLGYRALLAS LLALWRRYPA
LRRATAAQAL LAVGFSAFWS TLAIMLHEPP FELGSAAAGA FGIAGAAGAL AAPLAGRLAD
RRGPRWVAQS GALIACVSFA AMLLAPLVSP QMQLGLLMAA ALGFDLGIQS ALIAHQTIVY
DLESGARSRL NAVLFVGMFA GMAAGAALGG VALARWGWQA VVALAALTAG GAYALRRWAR