Gene Ndas_1544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1544 
Symbol 
ID9245394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1891073 
End bp1892296 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content74% 
IMG OID 
Productcysteine/1-D-myo-inosityl 2-amino-2-deoxy-alpha-D-glucopyranoside ligase 
Protein accessionYP_003679479 
Protein GI297560505 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.145189 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.965113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTCAT GGTCTGCGCC TGACATCGTC CCCCTGCCGG GTACCGGCGG TCCCCTTCGT 
GTCCACGACA CCGCCACCGG CCGGATAAGG ACGACGACGC CGGGCCCCCG GGCCGGGATG
TACGCCTGCG GCATCACCCC CTACGACGCC GCCCACCTGG GCCACGCCTT CACCTACCTC
ACCTTCGACC TGGTCAACCG GGTCTGGCGC GACGCGGGCC ACGACGTCAA CTACGTGCAG
AACACCACCG ACATCGACGA CCCGCTCCTG GAGCGCGCGG AGGCCACCGG CGTCGACTGG
CGCGACCTCG CCCACCGCGA GATCGACGTC TTCCGCGAGG ACATGGCCGC CCTGCGGATC
ATCCCCCCGA CCTCCTACGT CGGCGTGGTG GAGTCCGTCG ACCTCATCAG CGACCTCGCC
GCCCGCATCC GCGACACCGG CGCCGCCTAC GAGCTGGACG GGGACCTGTA CTTCTCCGTC
GCCGAGGCGC CCGAGTTCGG CGAGATCAGC AACCTGGACC GCGGGCAGAT GCTGGAGCTG
TTCGGAGAAC GCGGCGGCGA CCCCCAGCGC ACCGGCAAGA AGGACCCGCT CGACTGGCTG
CTCTGGCGTG CCGAGCGCCC CGGCGAGCCC GCCTGGGACA GCCCCCTGGG CCGCGGGCGC
CCCGGCTGGC ACATCGAGTG CAGCGCCATC GCCCTGGACC GGCTCGGCCC GGCCTTCGAC
CTCAACGGCG GCGGCAGCGA CCTGATCTTC CCCCACCACG AGATGGGCGC GGCCGAGACC
CGATGTGCCA CGGGCGGACC CAACGCCCAC AACCACCTGC ACGTGGGCAT GGTCGGCCTC
GACGGCGAGA AGATGTCCAA GTCCCTGGGC AACCTGGTCT TCGTCTCCAA GCTGCGCCAG
CAGGGCGTGG ACCCGGCCGT CATCCGCCTG GCCATGCTCG CCCACCACTA CCGCGCCCCG
TGGGAGTGGA CCGACGCCGA ACTCCCCGCC GCCACCGCCC GCGCCGAGCG CTGGCGCTCC
GCCCTCGCCC TGGGCGCGGC GCCCGACGCC GCCCCGGTGC TCGCCGCCGT GCGCGCGGCC
CTGTCCGAGG ACCTGGACTC CCCGGCGGCC CTGGCCGCGG TGGACGCCTG GGCCGACACC
GCCCTCACCG AGGGCGGCGC CGACACCGGC GCGCCCGCCC TGGTGCGCGC GACCGTGGAC
ACCCTGCTGG GCGTGCGCCT GTAA
 
Protein sequence
MRSWSAPDIV PLPGTGGPLR VHDTATGRIR TTTPGPRAGM YACGITPYDA AHLGHAFTYL 
TFDLVNRVWR DAGHDVNYVQ NTTDIDDPLL ERAEATGVDW RDLAHREIDV FREDMAALRI
IPPTSYVGVV ESVDLISDLA ARIRDTGAAY ELDGDLYFSV AEAPEFGEIS NLDRGQMLEL
FGERGGDPQR TGKKDPLDWL LWRAERPGEP AWDSPLGRGR PGWHIECSAI ALDRLGPAFD
LNGGGSDLIF PHHEMGAAET RCATGGPNAH NHLHVGMVGL DGEKMSKSLG NLVFVSKLRQ
QGVDPAVIRL AMLAHHYRAP WEWTDAELPA ATARAERWRS ALALGAAPDA APVLAAVRAA
LSEDLDSPAA LAAVDAWADT ALTEGGADTG APALVRATVD TLLGVRL