Gene Dshi_1999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1999 
SymbolxylH 
ID5712994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2117998 
End bp2119305 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content66% 
IMG OID641267923 
Productxylose transport system permease protein 
Protein accessionYP_001533339 
Protein GI159044545 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4214] ABC-type xylose transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.222102 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.247707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGATG CATCCCCGAC CGAGAGCCGC CTGCCCAGCC GGGCCAAGCG CAGTTTCCTG 
CAGACCCTGG AGTTGGATAC CCGCCTTCTG GGCATGATCG GGGCCTTCGT TCTGGTGTGC
CTGGTCTTCA ACCTGCTGAC AGACGGGCGG TTCCTGACGG CGCGGAACAT CTTCAACCTG
TCGATCCAGA CCGTGAGCGT GGCGATCATG GCCACGGGCA TGGTGTTCAT CATCGTCACG
CGGCATATCG ACCTGGCGGT GGGGGCGTTG CTGGCCACCT GTTCGGCGGC CATGGCGATG
ACCCAGACCG CGATCCTGCC GCAGGTCTTC GGGCTGGAGT TGGGCCATCC GGCGATCCCG
TGGATCGCGA TGCTGGTGGG CCTGGTCACG GGCACGGTGA TCGGGGCGTT CCAGGGCTAC
CTGGTGGGGT ATCTGATGAT CCCGGCCTTC ATCGTGACCC TGGGCGGTCT GCTCGTGTGG
CGCAACGTGG CCTGGTACAT GACAAACGGG CAGACCATCG GGCCGCTCGA TCCGACCTTC
ATGACCCTTG GCGGCATCAA CGGGACGCTG GGCGCGACCC TGAGCTGGAT CGTGGGTCTC
GTCGCGGTGG TCGCGGCCTG CTGGGCGCTG TGGTCGGGGC GCAAGAACAA GATCGCCCAT
GACGCGCCGG TCAAGCCCGT CTGGGCGGAA CTGACGGTGA TGGGGGTGGT GTCGGTCGCG
ATCCTCGGGT TCGTGGCGAT CCTGAACAGC TACGAGGTGC CCACGGCGCG GCTGCGGCGG
CTCTTCGAGG CGCGCGGCGA GGTGATGCCC GAGGGCTATA CCGAGGTTTA CGGCATTCCC
TATTCCGTGC TGCTGCTGAT CGCGGTGGCG GTGGCGATGA CGGTGATCGC CAAGAAGACC
CGGTTCGGGC GCTACATCTT CGCCACTGGC GGCAACCCGG ACGCGGCCGA GTTGTCGGGG
ATCAACACGC GGATGCTGAC GGTGAAGGTA TTCGCGTTGA TGGGGGCGCT CTGTGCGATT
TCGGCCATCG TGGCCTCGGC GCGGCTGACA AACCATTCCA ACGATATCGG CACGCTCGAC
GAGCTGCGCG TGATCGCGGC GGCGGTGATC GGGGGCACGG CGCTGGCGGG CGGTATCGGC
ACGATCTATG GCGCGATCCT CGGTGCGCTG ATCATGCAGT CGCTGCAATC GGGCATGGCC
ATGGTCGGCG TCGATGCGCC GCTGCAGAAC ATCGTCGTGG GCACGGTTCT GGTGGCCGCG
GTTCTGATCG ACATCCTCTA TCGCAAGCGG ATGGGAGCGA AGTCATGA
 
Protein sequence
MSDASPTESR LPSRAKRSFL QTLELDTRLL GMIGAFVLVC LVFNLLTDGR FLTARNIFNL 
SIQTVSVAIM ATGMVFIIVT RHIDLAVGAL LATCSAAMAM TQTAILPQVF GLELGHPAIP
WIAMLVGLVT GTVIGAFQGY LVGYLMIPAF IVTLGGLLVW RNVAWYMTNG QTIGPLDPTF
MTLGGINGTL GATLSWIVGL VAVVAACWAL WSGRKNKIAH DAPVKPVWAE LTVMGVVSVA
ILGFVAILNS YEVPTARLRR LFEARGEVMP EGYTEVYGIP YSVLLLIAVA VAMTVIAKKT
RFGRYIFATG GNPDAAELSG INTRMLTVKV FALMGALCAI SAIVASARLT NHSNDIGTLD
ELRVIAAAVI GGTALAGGIG TIYGAILGAL IMQSLQSGMA MVGVDAPLQN IVVGTVLVAA
VLIDILYRKR MGAKS