Gene Ndas_3984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3984 
Symbol 
ID9247855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4765232 
End bp4766524 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content74% 
IMG OID 
Productlipolytic protein G-D-S-L family 
Protein accessionYP_003681887 
Protein GI297562913 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.767273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCCCA CCCGCGCCCG GCGAGGCGTG GCCGTCGTCA TGGCCGGCAC ACTGCTCCTG 
CTCGGCGGCT GGAACGCCAC CGGCGCCGCC GCCGCACCGG AGCCTGGCAC CGGGAACTGG
GTCGGCACCT GGGGGGCCGC GCCGACCGCC ACGCCCGCCA CCGGCACCCC GGTCCTGCAC
GACGAGACCG TCCGGCAGGT CGTGCGCACC AGCGTCGGCG GCGACCGGCT GCGCCTGCGC
CTGACCAACG AGTTCGGGGA GAGCGCGCTG CGGGTGGGCG AGGTGCACGT GGCCCTGCGC
GCGGGCGACT CGGGCACCGA CATCGACCCG GCCACCGACC GCACGGTCAC CTTCGGCGGC
CGTACGTCGG TGACCGTCCC GGCGGGCGCC CCCATGGTCA GCGACCCGGT GGCGCTGGAA
CTGCCCGCCC GCTCCGACCT GGTCGTGAGC ATCCACCTGC CGGAGGAGAC ACCGGTGACC
ACCCTGCACG CCTCCTCCTT CCAGGAGAAC GTGGTGGCCG CGGGCGACGT CACCGGTGAC
ACGTCCGTCG AGGCCATCCG CACGCTCACC CAGTGGCACT TCCTGTCGGG GGTCAGCGTG
CGGTCGCGCA CGCGGCTCGC CGACGCGGTC GTGGCCCTGG GCGACTCCAT CACCGACGGC
TCCGAGACCC GGGTCAACGC CAACCACCGG TGGCCGGACC TGCTCGCCCG GCGCCTGCCG
CACACCGGCG TGCTCAACTC GGGGATCGCG GGCAACCGCC TGCTCCACGA CCCCAACCCG
CCCGAGGGCG GCGACGCCGA GGACTTCGCG GCCTTCTTCG GGCAGAGCGC CCTGCGCCGC
TTCGACCGGG ACGTGCTGGC GCACCCGGGC GCGGGCCACG TCATCGTCCT TCTGGGGGTC
AACGACCTGG GCCACCCCGG CACCGTGGCC CCGGTGTCGG AGACCGTCAC GGCAGAGGAG
GTCATCGGCG CCCACCGCCA GATCATCGCC CGCGCCCGTG CGGCGGGGCT GCGGGTCTAC
GGGGGCACCA TCCTGCCGTT CAAGGGCGAC ACCCTCGGTT TCCACACACA GGAGAACGAG
GCCAAGCGCC AGGCGGTCAA CGCGTGGATC CGCACGGGCG GGGAGTACGA CGCGGTCATC
GACTTCGACG AGGTGATGCG CGACCCGGCC GACCCGCTGA GCCTGCTGCC CGCCTACGAC
AGCGGCGACG GCCTGCACCC CAACGACGCC GGTATGGCCG CGATGGCCGA CGCCGTACCG
GCCCGGCTCT TCCGCCCGGA CCGGGAGGGA TAA
 
Protein sequence
MSPTRARRGV AVVMAGTLLL LGGWNATGAA AAPEPGTGNW VGTWGAAPTA TPATGTPVLH 
DETVRQVVRT SVGGDRLRLR LTNEFGESAL RVGEVHVALR AGDSGTDIDP ATDRTVTFGG
RTSVTVPAGA PMVSDPVALE LPARSDLVVS IHLPEETPVT TLHASSFQEN VVAAGDVTGD
TSVEAIRTLT QWHFLSGVSV RSRTRLADAV VALGDSITDG SETRVNANHR WPDLLARRLP
HTGVLNSGIA GNRLLHDPNP PEGGDAEDFA AFFGQSALRR FDRDVLAHPG AGHVIVLLGV
NDLGHPGTVA PVSETVTAEE VIGAHRQIIA RARAAGLRVY GGTILPFKGD TLGFHTQENE
AKRQAVNAWI RTGGEYDAVI DFDEVMRDPA DPLSLLPAYD SGDGLHPNDA GMAAMADAVP
ARLFRPDREG