Gene Ndas_0756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0756 
Symbol 
ID9244598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp927046 
End bp928140 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content68% 
IMG OID 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_003678707 
Protein GI297559733 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0825121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCCC ATCCCCGACG CACACCCCTC CTGGCCGCCG CACTGTGCGG CACACTCCTG 
CTCACCGCCT GCGGAGGCGT CGGCGACGCC GCGCGGCCCG AGGGCGAGGC CGGCGTCGTC
GGCATCGCCA TGCCCACCCA GTCCTCCGAG CGCTGGATCA ACGACGGCGA GAACATGGTC
GCCGAGTTCG AGGCCCGCGG CTTCGGCACC GACCTCCAGT ACGGCGAGGA CGTCGTGGAG
GACCAGGTCT CCCAGATCGA GAACATGATC ACCCGGGGCG CCGACGTCCT GGTCATCGCC
TCCATCGACG GCGAGGCCCT CGGCGACGTG CTCGACATGG CCGCCTCCAG CGACATCCCC
GTCATCGCCT ACGACCGCCT CATCCTCGGC AGCGAGCACG TCGACTACTA CGCCACCTTC
GACAACTTCC AGGTGGGCGT CCTCCAGGGC GAGTACATCG TGCGAGCCCT CGACCTGGAG
AACGAGGAGG GTCCCTTCAA CATCGAGCTG TTCGGCGGCT CGCCCAACGA CAACAACTCC
TCCTACTTCC TCGACGGGGC GATGTCGGTG CTCCAGCCCC ACATCGACGA CGGCCGCCTC
GTCGTCCGCA GCGGCCAGAC CTCCATGGAG CAGATCGCCA CCCAGGAGTG GTCCGGCGCC
GTCGCCCAGG ACCGCATGGA CAACCTGCTC AGCGCCCACT ACTCCGAGGA GGAGGTGCAC
GCGGTCCTGT CGCCCTACGA CGGCATGAGC CTCGGCGTGA TCGAGTCCCT GCGCGCCGTC
GGCTACGGCA CCGAGGACCG GCCGCTGCCC GTCATCACCG GCCAGGACGC CGAGGCCGCC
TCGGTCCGGT CCATCATCGC CGGGGAGCAG ACCCAGACCG TCTTCAAGGA CATCCGGACC
CTGGCCACCC AGACCGTGGA CATGGTCGAG GCCCTGGTAC AGGGTGAGGA GGTCCCGGTC
AACGACACCG AGAGCTACGA CAACGGGGTC AAGGTCGTCC CCTCCTACCT GCTCGACCCC
GTCTCGGTGG ACGCCGACAA CTACCACGAG GTCCTGGTCG AGAGCGGCTA CTACGAGGAG
TCCGAGCTCC AGTGA
 
Protein sequence
MTPHPRRTPL LAAALCGTLL LTACGGVGDA ARPEGEAGVV GIAMPTQSSE RWINDGENMV 
AEFEARGFGT DLQYGEDVVE DQVSQIENMI TRGADVLVIA SIDGEALGDV LDMAASSDIP
VIAYDRLILG SEHVDYYATF DNFQVGVLQG EYIVRALDLE NEEGPFNIEL FGGSPNDNNS
SYFLDGAMSV LQPHIDDGRL VVRSGQTSME QIATQEWSGA VAQDRMDNLL SAHYSEEEVH
AVLSPYDGMS LGVIESLRAV GYGTEDRPLP VITGQDAEAA SVRSIIAGEQ TQTVFKDIRT
LATQTVDMVE ALVQGEEVPV NDTESYDNGV KVVPSYLLDP VSVDADNYHE VLVESGYYEE
SELQ