Gene Ndas_0466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0466 
Symbol 
ID9244305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp560688 
End bp562178 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content75% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003678419 
Protein GI297559445 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.202142 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCACG CACTGACCCT GCTCCGCGGG GGCCGGGCGC GCCTGCGCGA CTGGCGGCGC 
GGCCGCCCGT TCTGGGGCGG GCTCCTCCTC GTCATCGCCG GAGTCGAGCT GCTGGTCGCA
CCGGCCGCGC AGAGCCTGAT CCTGCCCATC GACCTGATCG CCTACACCGG TATCGCCGGT
GTGTCGGGTC CGCTCATCGC GGTACTGCTG ATCACCCTGG GAGCGCTGAG CTGGTTCCAG
CCCGCGCAGC ACCTCTTCTT CGGGGTGGTC GGGCTCATGC TCGCCCTGGT GTCCTTCGTG
ACCTCCAACT TCGGCGGCTT CGTCATCGGG ATGCTGCTCG GCATCGTCGG CGGCTCCCTC
GTCTTCGCGT GGGCGCCCCG CGTGGTCCGC CGCCGAAGGC GCGGCCGGGG ACGCCGACGG
GTCGCGGACG CCGACGCCGC CGGTCCAGGG GTGTCTGCCG GGGCCCCTCC AGGGGCCGCC
GTCCCCCGAG GCCCCGAAGC GCCCGACGGG ACCGGTGGGC GCGCGGCCGG GGAGGAACCC
CCGACCGCCG TCCCCGCCGA GACCGCGCCC GACGCGGGCG TTCCCGGGAC CCCTCCCGGC
AGGGACACCG GTTCCGCGAC ACCCGACGGG TCCGTCAGCG CCCCCCGGCC TCCGTCCCGG
CCGCTGGCCG CGCTCGCCCT GCCCCTGGCA CTGGCCGTGA CCCTGGTCGG CGCCGCCGCG
CCCGCCGACT GGCCCTGGGA CTGGTTCCTG CCCCCGGGCG AGGAGGAGGA GCAGCCCTCG
CCCTCCCCCT CCCCCTCGGA CGAGCCCTCG GCGAGTCCCA CCGACCGGCC CACCCCGCCG
GGGCCCGGGC CCGGCGCGGG CGAGGGGGAC GGTCCGGACG AGCGGCCCGA GGACGGGGAG
ACCGAGGAGG AGCCGGAGGA GGACGGTCGG GACCGGGAGG CGAACCCGGA CGAGTGCGAG
ATGGGCACCG GTGAGTCCGC CCTGGCGGGG TCAGAGGAGG AGTTCCTGGA CGCCGTCCGC
GCCTGCCAGG CGGCCCAGGA CGCGGGGGAG CTGCCCGAGG TTCCGCTGGA GGAGGCCCAC
GACTGCTCCA CCGGCTCGGT CCGCGCCTCC GGCCTGACCG CCGACCGGCT GACGATGAGC
GGCGCCCGCT ACGACGGCGT GGTGGAGTGC CCCACCCTCG ACGGCCCCCG CAGGTACATC
CGGCTGACCA TGAGCCGGGC CGACTTCGTC AACGCCGAAC TGTGGTTCGA GGACGCCGGA
ACCCGGATGA GCCTGGGCCT GCCCACCATG GTCATGGACG GGTCCGTCCA GATGCACATC
ACCCGCATGC ACGTGCGCAT CCTGGGGATC CCGCTCACCT TCACACCGGA CTTCCCGCCC
CCGCTGCTGC TGCCGTACAT GATCGTCACC GACGTGGACG TGGACGACCC GCTGGCCAGC
ACCGACGTCA TGAACATCCC CGACCTCAAC GGCCGCTACG GCGGCGCCTG A
 
Protein sequence
MAHALTLLRG GRARLRDWRR GRPFWGGLLL VIAGVELLVA PAAQSLILPI DLIAYTGIAG 
VSGPLIAVLL ITLGALSWFQ PAQHLFFGVV GLMLALVSFV TSNFGGFVIG MLLGIVGGSL
VFAWAPRVVR RRRRGRGRRR VADADAAGPG VSAGAPPGAA VPRGPEAPDG TGGRAAGEEP
PTAVPAETAP DAGVPGTPPG RDTGSATPDG SVSAPRPPSR PLAALALPLA LAVTLVGAAA
PADWPWDWFL PPGEEEEQPS PSPSPSDEPS ASPTDRPTPP GPGPGAGEGD GPDERPEDGE
TEEEPEEDGR DREANPDECE MGTGESALAG SEEEFLDAVR ACQAAQDAGE LPEVPLEEAH
DCSTGSVRAS GLTADRLTMS GARYDGVVEC PTLDGPRRYI RLTMSRADFV NAELWFEDAG
TRMSLGLPTM VMDGSVQMHI TRMHVRILGI PLTFTPDFPP PLLLPYMIVT DVDVDDPLAS
TDVMNIPDLN GRYGGA