Gene Ndas_0248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0248 
Symbol 
ID9244082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp305922 
End bp307970 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content69% 
IMG OID 
Producttrehalose synthase 
Protein accessionYP_003678203 
Protein GI297559229 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAG ACGAGCCCGG CCCCGGCGGA CACGGTCACG CCGAACACGG TCCGCTCGCC 
CCGGCCACCG GGGCCGCCAC CGGCCCGCTG ATGTCCTCCG GCGGCACCAG TGATCCCCAC
TGGTACAAAC ACGCCGTCTT CTACGAGGTC CTCGCCCGGG GCTTCTTCGA CTCCAACGGC
GACGGCACCG GCGACCTCGC CGGGCTGGTG CAGAAGCTGG ACTACCTCCA GTGGCTGGGC
ATCGACTGCG TGTGGCTGTT GCCGATGTAC GAGTCGCCCC TGCGCGACGG CGGCTACGAC
ATCTCCGACT ACTTCAAGAT CCTCCCGGAG TTCGGCCGCA CCGCCGACTT CGTGGAGCTC
CTGGACGAGG CGCACCGGCG CGGGATCCGG GTCATCACCG ACCTGGTCAT GAACCACACC
AGTGACCAGC ACCCGTGGTT CAAGGCCTCC CGCGAGGACC CGGACGGGCC CTACGGGGAC
TTCTACGTGT GGTCGGACAC CGACGACCGC TACGACGAGG CCCGCATCAT CTTCGTCGAC
ACCGAGACGT CCAACTGGAC CTACGACGAG GTCCGCGGCC AGTACTACTG GCACCGGTTC
TTCTCCCACC AGCCCGACCT CAACTTCGAG AACCCGGCCG TCCAGGAGGC GATCCTGGAG
GTCCTGCGCT ACTGGCTGGA CCTGGGCATC GACGGGTTCC GCCTGGACGC CGTGCCGTAC
CTGTACGAGC GCGAGGGCAC CAACTGCGAG AACCTCAAGG AGACGCACGA GTTCCTCAAG
CGCGTGCGCG CCGAGGTGGA CCGGCTCTAC CCCGACCGCG TGCTGCTGAG CGAGGCCAAC
CAGTGGCCGT CCGACGTCGT CGACTACTTC GGCGACTTCG AGTCCGGCGG CGACGAGTGC
CACATGAACT TCCACTTCCC GCTGATGCCG CGCATGTTCA TGGCGGTCCG GCAGGAGCAG
CGCTTCCCGA TCTCGGAGAT CCTCGCCCAG ACGCCGCCGA TCCCCCGCAA CTGCCAGTGG
GCGATCTTCC TGCGCAACCA CGACGAGCTG ACCCTGGAGA TGGTCACCGA CGAGGAGCGC
GACTACATGT ACTCCGAGTA CGCCAAGGAA CCCCGCATGC GCGCCAACGT GGGGATCCGC
AGGCGGCTCG CGCCCCTGCT GGACAACGAC CGCAACCAGA TCGAGCTGTT CACGGCCCTG
CTGCTGTCCC TGCCGGGCTC GCCCGTCCTG TACTACGGCG ACGAGATCGG CATGGGCGAC
AACATCTGGC TGGGCGACCG CGACGCCGTG CGCACGCCCA TGCAGTGGAG CTCGGACCGC
AACGCCGGGT TCTCCAAGGG CGACCCGGCC CGCCTGTACC TGCCGCTGAT CATGGACCCG
GTCTACGGGT ACCAAGCGCT CAACGTGGAG TCCCAGCGCG ACAACCCCGG TTCGCTGCTG
CACTGGACGC GGCGCATGAT CCAGATCCGC AAGCGCCACC CCGTGTTCGG CACCGGCGCC
TTCACCGAAC TCAACGCCAC CAACCCGAGC GTGCTGGCCT TCATCCGCGA GCACGGCGAC
GACCGGATGC TCTGCGTCAA CAACCTGTCG CGGTACCCGC AGCCCGTGGA GCTGGACCTG
GGGCGCTACG CCGGGGTCGC CCCGGTGGAG TGCGTGGGCG GTGTGCGCTT CCCCGAGATC
GGGGAGCTGC CCTACCTGCT CACCCTGCCC GGGCACGGCT TCTACTGGTT CCAGCTGCCC
ACCACCGCCG ACCGGGAGTC CGCGGGCACG AACCGGGACG GTATGCCGTC CTACGGACTG
CCCGAGGCCG GGGCCCGGGT GCACGCCGCC TTCGAGACCT CGCCGGTGTC GGCGGCCGCG
GCCCGGACGC ACGCACCGGA CCATCCAGTG AGCACGACGT CCAGCATCCC GGCCGGGTTC
TCCCCCGCGT TCGGCGACGG CGCCCGCACC CCCTCCCACG GGAGCGCGGT GCGCGCGTCG
GGCCTCTCCG GCCCGGCCGC GGAGCGCACG GGCCGGGGTG GGGGCGAGAA GGGAAACGGG
TCGCTCTGA
 
Protein sequence
MSKDEPGPGG HGHAEHGPLA PATGAATGPL MSSGGTSDPH WYKHAVFYEV LARGFFDSNG 
DGTGDLAGLV QKLDYLQWLG IDCVWLLPMY ESPLRDGGYD ISDYFKILPE FGRTADFVEL
LDEAHRRGIR VITDLVMNHT SDQHPWFKAS REDPDGPYGD FYVWSDTDDR YDEARIIFVD
TETSNWTYDE VRGQYYWHRF FSHQPDLNFE NPAVQEAILE VLRYWLDLGI DGFRLDAVPY
LYEREGTNCE NLKETHEFLK RVRAEVDRLY PDRVLLSEAN QWPSDVVDYF GDFESGGDEC
HMNFHFPLMP RMFMAVRQEQ RFPISEILAQ TPPIPRNCQW AIFLRNHDEL TLEMVTDEER
DYMYSEYAKE PRMRANVGIR RRLAPLLDND RNQIELFTAL LLSLPGSPVL YYGDEIGMGD
NIWLGDRDAV RTPMQWSSDR NAGFSKGDPA RLYLPLIMDP VYGYQALNVE SQRDNPGSLL
HWTRRMIQIR KRHPVFGTGA FTELNATNPS VLAFIREHGD DRMLCVNNLS RYPQPVELDL
GRYAGVAPVE CVGGVRFPEI GELPYLLTLP GHGFYWFQLP TTADRESAGT NRDGMPSYGL
PEAGARVHAA FETSPVSAAA ARTHAPDHPV STTSSIPAGF SPAFGDGART PSHGSAVRAS
GLSGPAAERT GRGGGEKGNG SL