Gene Ndas_0751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0751 
Symbol 
ID9244593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp920770 
End bp922206 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content74% 
IMG OID 
ProductL-arabinose isomerase 
Protein accessionYP_003678702 
Protein GI297559728 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.148644 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGTAC CCACGCCCCC CGCCACCGAG CACCCTCCCG TGCGCGCCCG CGCCCCCCGC 
GTCGGCCTGC TCGGCATCAT GCAGCCGCTC TACGACGACA TGATCCCCGG CATCACCGAG
CACCAGGCGG CGTACGCGGC CCGGGTGGCC GAACGCCTCT CCGGCGTCGC CGAGTGGACC
GTGGCCCCGC CGGTGCGCGG GCGCGCCGAC GCCGAGGAGG CGATCCGCGG CTTCGAGGCC
GCCGGACTCG ACGGGGTCCT GGTCGTCATG CTCACCTACG GCCCCTCCCT GCGCGTCACC
CGCGCGCTGG CCCGTACCCA CCTGCCCCTG GCGCTGGCCA ACATCCAGCC CGACCCCGCG
GTCAGCCCGT CCTGGGACAT GGACGACATG ACCTACAACC AGGGCATCCA CGGGGCCCAG
GACACCGCCA ACGCCATGGT GCGGGCGGGG CTGCCCTTCG AGGTCCTCAC CGGCGAGTGG
CAGAGCCCGG AGTTCGCCGC GCGCGTGGAC CGCTGGGCGC GGGCGGCCCG CGCCGTCACC
GCCCTGCGCG GGCTGCGGGT CGGCGTGTTC GGCTACCCCA TGAACGGCAT GGGCGACGCC
AGGGTGGACG AGACCGCGCT GCTGCGCAGG CTCGGCCCCG AGGTCCACGT CATCGCGCCC
GGAGCCCTGC ACCGGACCAT GGCCGACCTG CCCGAACAGG CCGTGCGCGA CCTCATGGCC
TGGGAGGACG GGGCCCTGGA GGTGGACGGC CGCCTGTCCG AGGAGGAGCG CGAGGACCAC
GCCCGCATGC AGCTGGGCAT CGAACGGCTG CTGGAGGAGT CCGGGTGCGG CGCCTACTCC
ACCCACTTCG ACGCGATCGG GGAGGACGGC CGCTTCGCCC GGCTGCCGAT GGCCGCCGCC
TCCACCCTGA TGGCCCAGGG GTACGGGTTC GCCGGTGAGG GCGACGTCCT GGCCGCCTCG
ATCGTCTACG CCGGGCACCA GCTCGCCGGG GACGGCCACT TCACCGAGAT GTACGCGATG
GACTTCCCCA GCGACTCCAT CCTCATGAGC CACATGGGCG AGGGGAACTG GAGGGTGGCC
CGCGAGGACG AGCCCATCCG GCTGGTCAAG CGCCCGCTGG GCATCGGCGG CCTGGGCGAC
CCGCCCACCA TCGTGTTCCG CTACCGGCCG GGCCCGGCCA CCCTGGCCTC CCTGGTCGCC
CTCGGCGGAG AGGAGTTCCG CCTGGTCGTG GCCGAGGGCG AGGTCATCGA CGCCCCCGAA
CTGCCCTCCC TGGAGATGCC CTACGGCCAG TTCCGCCCCG AGACGGGTGT GCGGGCCTGC
ATGGACGCCT GGCTCCGCGC GGGCGGCACC CACCACATGG TCATGAACAC CGGGGCGCGT
GCGCAGGACT GGCGGGTCCT GTGCGAGCTG TCCGGAATCG AGTACGTCCG GGTCTGA
 
Protein sequence
MTVPTPPATE HPPVRARAPR VGLLGIMQPL YDDMIPGITE HQAAYAARVA ERLSGVAEWT 
VAPPVRGRAD AEEAIRGFEA AGLDGVLVVM LTYGPSLRVT RALARTHLPL ALANIQPDPA
VSPSWDMDDM TYNQGIHGAQ DTANAMVRAG LPFEVLTGEW QSPEFAARVD RWARAARAVT
ALRGLRVGVF GYPMNGMGDA RVDETALLRR LGPEVHVIAP GALHRTMADL PEQAVRDLMA
WEDGALEVDG RLSEEEREDH ARMQLGIERL LEESGCGAYS THFDAIGEDG RFARLPMAAA
STLMAQGYGF AGEGDVLAAS IVYAGHQLAG DGHFTEMYAM DFPSDSILMS HMGEGNWRVA
REDEPIRLVK RPLGIGGLGD PPTIVFRYRP GPATLASLVA LGGEEFRLVV AEGEVIDAPE
LPSLEMPYGQ FRPETGVRAC MDAWLRAGGT HHMVMNTGAR AQDWRVLCEL SGIEYVRV