Gene Ndas_2267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2267 
Symbol 
ID9246117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2709862 
End bp2711691 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content76% 
IMG OID 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_003680195 
Protein GI297561221 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.767056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGAAG CCCAGACCAC CCCGACCACG TCCGACGCCC TCGCCGCCCC CGCCGTCGCG 
GGGGTCGGCG TCCGCGGCGA CTTCGCCGTG TGGGCGCCCC ACCGCGAACG CGTCCGCCTG
CGCCTGTACG GCACCGCGGA GCACGGCGGC ACGGGGGAGC GCGACGTCGC CATGCGGCCC
GACGACGACG GCTGGTGGCG CGTCCGCGTC GAGGACGCCG GGCCCGGCAC CGAGTACGCC
TACCTCCTGG ACGACGACCC CCAGCCCCTG CCCGACCCCC GCTCACTCCA CCAGCCCCAC
GGCGTCCACG GCCCCAGCCG CGTCCACGAC CACGCAGCCT TCGCCTGGAC CGACGCCGAC
TGGACCGGGC GCCCCCTCGC GGGCGCCGTC GTCTACGAAC TCCACGTGGG CACCTTCACC
CCCCAGGGCA CCCTCGCCGC CGTCGCCGAC CACCTCGACC ACCTGGCCGA CCTCGGCGTC
ACCCACGTCG AACTCATGCC GGTCAACGCC TTCGACGGCA CCCACGGCTG GGGCTACGAC
GGCGTCCTGT GGGCGGCTGT CCACGACCCC TACGGCGGCC CCGACGCCCT CAAGGCCCTC
GTCGACGCCT GCCACCGCCG CGGCCTGGCC GTCCTGCTCG ACGTGGTCTA CAACCACCTC
GGCCCCTCGG GCGCCTACAT GCCCCGCTTC GGCCCCTACT TCCGCGGCGA GAACGCCTGG
GGCCCCTCCC TCAACCTGGA CGGCCCCGAC TCCGACCCGG TCCGCCGCAC GGTCCTGGAC
AACGCCCTGG ACTGGCTGCG CCACTACCAC CTGGACGGGC TGCGCCTGGA CGCCGTGCAC
GCCCTGCGCG ACGACCGCGC CACCCCGCTG CTGGCCGAAC TCGCCGAGGA GGTGGACGCC
CTCGCCACCG CCCTGAACCG GCCGCTGTCC CTGGTCGCCG AGTCCGACCG CAACGACCCC
CGCACCGTCC TGCCCCGCGA GGCGGGCGGC CTGGGCATGA CCGCCCAGTG GTCCGACGAC
CTCCACCACG CCCTGCACGT CGCCCTCACC GGGGAGACGC ACGGCTACTA CGCCGACTTC
GCCGACCCGG GGGCGCTGCC CGCCGCGCTC ACCCGAGCGT TCTGGCACGC GGGCACCCGC
TCCAGCTTCC GCGGCCGCAC CCACGGCGCG CCCGTGGACA CCGCGCGCGT CCCCGGCAGC
CGCTTCCTGG CCTACCTGAG CACCCACGAC CAGATCGGCA ACCGGGCCCG GGGCGACCGC
ATGGGCGAAC ACCTCTCCCC TGGCCTGCTC GCCTGCGGCG CCGCGCTGGT GCTGTGCTCC
CCCTACACCC CCATGATCTT CATGGGGGAG GAGTGGGGGG CCGCCACGCC CTGGCCGTTC
TTCGCCTCCT TCACCGACCC CGACCTGGTC AGGGGCGTGC GCGAGGGACG CCGCCGCGAG
TTCGCCGCGC TGGGGTGGGC CGAGGAGGAG ATCCCCGACC CCATGGACCC GGCCACCCGC
GACGGCGCCG TCCTGGACTG GTCCGAGCCC GGACGCGAAC CGCACGGGCT GGTCCTGGAC
ACCTACCGGG CGCTCATCGC CCTGCGGCGC GTGGAACCGG AGCTGTCCGA CCCGCGCCTG
GACCGCTCCT CGGTCGAGGT GGGCGGCGGC GGACGCCTGC TCGTCCTGGC CCGGGGAAGC
CTGCGCGTGG TGTGCAACCT GGACGCCGAC GGCGCCGAGG TGGAGCTGGA CGCGGCCCCG
CGCGAACTCC TGCTCGCCAA CGGCGAGCCC AGGACCGCGG GGTCCACCGT CACCGTGCCG
GGGGAGTGCT TCGCCGTCCT GAGGGTGTAG
 
Protein sequence
MSEAQTTPTT SDALAAPAVA GVGVRGDFAV WAPHRERVRL RLYGTAEHGG TGERDVAMRP 
DDDGWWRVRV EDAGPGTEYA YLLDDDPQPL PDPRSLHQPH GVHGPSRVHD HAAFAWTDAD
WTGRPLAGAV VYELHVGTFT PQGTLAAVAD HLDHLADLGV THVELMPVNA FDGTHGWGYD
GVLWAAVHDP YGGPDALKAL VDACHRRGLA VLLDVVYNHL GPSGAYMPRF GPYFRGENAW
GPSLNLDGPD SDPVRRTVLD NALDWLRHYH LDGLRLDAVH ALRDDRATPL LAELAEEVDA
LATALNRPLS LVAESDRNDP RTVLPREAGG LGMTAQWSDD LHHALHVALT GETHGYYADF
ADPGALPAAL TRAFWHAGTR SSFRGRTHGA PVDTARVPGS RFLAYLSTHD QIGNRARGDR
MGEHLSPGLL ACGAALVLCS PYTPMIFMGE EWGAATPWPF FASFTDPDLV RGVREGRRRE
FAALGWAEEE IPDPMDPATR DGAVLDWSEP GREPHGLVLD TYRALIALRR VEPELSDPRL
DRSSVEVGGG GRLLVLARGS LRVVCNLDAD GAEVELDAAP RELLLANGEP RTAGSTVTVP
GECFAVLRV