Gene Hoch_6620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6620 
Symbol 
ID8549037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp9076135 
End bp9077259 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content64% 
IMG OID646391280 
ProductD-xylose transporter subunit XylF 
Protein accessionYP_003270979 
Protein GI262199770 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00486776 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCGTA TCTCGCGTAC GAAGTCGATA GCCCCCTTGT TCGTTGCCGC ACTCGCCATG 
GTCGCGGTAC CCGGATGTAA GAAAGACGAA GAGGCGCCCG CCACGGGTGA AGGCGCCGAG
GCCACCGGCG AAACGCCGGC CGAAGAGGGA CCGCTCAAAG TCGGTTTCCT GCTCAAGACC
ATGCAGGAAG AGCGCTATCA GCGCGACAAG AAGGCCTTCA TCGACAAGGC GCAATCGCTC
GGCGCCGAGG TGCTGTTCGA TTCGGCGAAC AACAACGAGC AAACCCAGCT CTCCAAATTC
GAAACCATGC TCGCGCAGGG TGCCAAGGTG ATCGTGCTGC AGCCGGTCAA CACCGGCACC
GCCGGCAACA TGGTCAAGAT GGCCAATGAG GAGGGCGTCC GCGTGGTCGG CTACGACTCG
ATGCTGGTCA ACGGCCCGCT CGACGTCCAG GTCATGCAGG ATAGCTGGGC CGTCGGCAAG
CTCCAGGGCG AGGCCATGGT CGAGTGGCTC AAGGCCAAGA ACGACGGCAA GGTCGAGGGC
AAGGTCGCCC TGATCAAGGG CCAGCCCGGC GACTCCAACG CCAACGCCAT GTCCGAGGGC
GCGCTGACCA TCATCAACGA GAACGAGGGC CTCGAGCTGG TCGCCGAGGA GTCGCACGAG
GGCTGGTCGT CCGACAAGGC CATGGCCACC GCCGAGAACG TGCTGACCAA GTACGAGAAC
GGCGTCGACG CCTTCATCGC CAACAACAGC GGCATGGCCC GCGGCGTCAT CGCGGCGCTG
CAGAATCAGG GCCTCGACGA CGCCACCAAG GTGTTCGTCG CCGGCTCCGA CGCCGACCTG
GTCAACATCC AGTACGTGGC CCAGGGCAAG CAGGCGGTCG AGATCTGGAA GAAGATCACG
CCGCTGGCCG AGACCGCGGC CGAGATCGCG GTGACCCTGG CCAAGAGCCC CGACAAACCC
GTGACCGAGC TGGTCGAGGC CGATCGCACC ATCAACAACG GCGCGGTCGA GGTGCCCACC
ATCGTCACGC CGGTGGTGCT CGTGACCAAG GATAACGTCG AGGACACCGT GGTCGCCGGC
GAGTTCTACA CCAAAGAGCA GGTCTTCGGC GCCGAGGCCG AGTAA
 
Protein sequence
MKRISRTKSI APLFVAALAM VAVPGCKKDE EAPATGEGAE ATGETPAEEG PLKVGFLLKT 
MQEERYQRDK KAFIDKAQSL GAEVLFDSAN NNEQTQLSKF ETMLAQGAKV IVLQPVNTGT
AGNMVKMANE EGVRVVGYDS MLVNGPLDVQ VMQDSWAVGK LQGEAMVEWL KAKNDGKVEG
KVALIKGQPG DSNANAMSEG ALTIINENEG LELVAEESHE GWSSDKAMAT AENVLTKYEN
GVDAFIANNS GMARGVIAAL QNQGLDDATK VFVAGSDADL VNIQYVAQGK QAVEIWKKIT
PLAETAAEIA VTLAKSPDKP VTELVEADRT INNGAVEVPT IVTPVVLVTK DNVEDTVVAG
EFYTKEQVFG AEAE