Gene Ndas_1386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1386 
Symbol 
ID9245236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1702544 
End bp1703929 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content77% 
IMG OID 
Productxylulokinase 
Protein accessionYP_003679324 
Protein GI297560350 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.188757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0480029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCTGG TCGCCGGAAT CGACAGCTCG ACCCAGTCCT GCAAGGTCGT GGTCTGCGAC 
GCCGACAGCG GCGCGGTGGT GCGCGAGGCC CGCGCCCCGC ACCCCGACGG GACCGAGGTC
CACCCCGACG CCTGGTGGTC GGCCCTGGAA CAGGCCTCCT CCGGCCTGCT CGACGACGTG
GCCGCCGTCT CCGTCGCCGG ACAGCAGCAC GGCATGGTCG CCGTGGACGA GGTCGGCGCG
GTGGTCCGCC CGGCGCTGCT GTGGAACGAC ACCCGTTCGT CCGGGACGGC CCTGGACCTG
ATCGAGGAGC TGGGCGGCCC CGCCGAGTGG GCCAAGGCCG TGGACAACGT GCCCAACGCC
AGCCTGACCG TGAGCAAGCT GCGCTGGCTG GCCCGGCACG AGCCGGAGCA CGCCGACCGC
ACGACCGCGG TGATGCTCCC CCACGACTGG CTCACCTGGC GGCTGACCGG AGCCGGATCC
GAGCCCACCA CCGACCGCGG CGACGCCTCC GGCACCGGTT ACTGGTCGGC GACCGACGAC
GCCTACCGCC CCGACCTGCT GGCGCGGGCC TTCGGACGCG AGATCCGGGT GCCGCGCGTG
GCCGGACCGG CCGAGGCCGT GGGGCGCACC CCCTCGGGCG CGCTGGTGGC CCCGGGCACC
GGGGACAACA TGGGCAGCGC CCTGGGCCTG GGACTGCGCC CGGGGGACGC GGTGGTCTCC
CTGGGCACCA GCGGCACCGT GTTCGCCGTG AGCGAGGGCC CCACCCAGGA CCCCAGCGGC
ATCATCTGCG GGTACGCCGA CGCGACGGGC CGGTACCTGC CGCTGGTGTG CACCCTCAAC
GCGGCCCGGG TGCTCGGCGC CACCGCCGCG ATGCTGGGCG TGGACCTGGC GGGCCTGGAC
GAGCTGGCAC TGGCCGCCGA GCCGGGCGCC GAGGGCGTCG TGCTGCTGCC CTACCTGGAC
GGCGAGCGCA CGCCGGTACT GCCCGACGCC GCGGGTTCGC TGCACGGGCT GCGCCGGTCC
AACATGCGCC CGGAGAACAT CGCCCGCGCC GCGGTGGAGG GCATGCTGTG CGGGCTGGCC
GACGGGCTGT CCGCGATGAC CGACACCGGG ATCCCGGTCA GCCGGGTGCT GCTCGTGGGC
GGGGCGGCCC GGTCGGCCGC CGTGCGCGCG GTCGCGCCGA CCGTCCTGGG CGCGCGGGTG
GTCGTGCCGA GGCCCGCCGA GTACGTGGCC GTGGGCGCCG CCCGGCAGGC GGCCTGGACG
CTGGCGGGGA CCGCCGAGGC GCCGAACTGG GACCCGGGCG AGCCGGAGGC CGCCGCGGAG
CCCGTGGACG CGCCGCACGT GCGCGAGCGC TACGCCGAGG CGCGCCGTTC CGCCCACGGC
CTCTAG
 
Protein sequence
MPLVAGIDSS TQSCKVVVCD ADSGAVVREA RAPHPDGTEV HPDAWWSALE QASSGLLDDV 
AAVSVAGQQH GMVAVDEVGA VVRPALLWND TRSSGTALDL IEELGGPAEW AKAVDNVPNA
SLTVSKLRWL ARHEPEHADR TTAVMLPHDW LTWRLTGAGS EPTTDRGDAS GTGYWSATDD
AYRPDLLARA FGREIRVPRV AGPAEAVGRT PSGALVAPGT GDNMGSALGL GLRPGDAVVS
LGTSGTVFAV SEGPTQDPSG IICGYADATG RYLPLVCTLN AARVLGATAA MLGVDLAGLD
ELALAAEPGA EGVVLLPYLD GERTPVLPDA AGSLHGLRRS NMRPENIARA AVEGMLCGLA
DGLSAMTDTG IPVSRVLLVG GAARSAAVRA VAPTVLGARV VVPRPAEYVA VGAARQAAWT
LAGTAEAPNW DPGEPEAAAE PVDAPHVRER YAEARRSAHG L