Gene Ndas_5333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5333 
Symbol 
ID9249233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp499466 
End bp500950 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content76% 
IMG OID 
Productprotein of unknown function DUF309 
Protein accessionYP_003683219 
Protein GI297564246 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0481251 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTGGACG ACTCGGCCGC CTACGTCGAC GGCCCCTGGA CCCACAGGAC CGTCAGCGCG 
GCGGGGGCCC GGTTCCACGT CGCCGAGGCC GGTGACGGGC CGCTGGTGCT CCTCCTCCAC
GGCTTCCCCC AGTTCTGGTG GGCCTGGCGC GCCCAACTGA CCGCGCTCGC CGACGCCGGT
TACCGCGCGG TCGCCGCGGA CCTGCGCGGC TACGGCGCCA GCGACAAGAC CCCGCGCGGC
TACGACCTCG TCACCCTCGC CCAGGACGCC GCCGGACTGG TCCGCGCCCT CGGCTCACGG
GACGCGGCCG TGGTCGGGCA CGGCCTGGGC GGCCTCGTCG CCTGGACGAT GACCGCCTAC
CACCCCGGCA CCGTGCGCGC CCTGGCCGCG GTGTCGTCGC CGCACCCGCT GCGAGCGGCC
CGCGTCCTGG CCTCCGGCGG TCCCGGCGTC CGCCACATGC TCCGGGCACA GCTGCCGATC
CTCCCCGAGC ACCGGCTCCT GAGCGACGGG TGCGAACGCG TGGGCGACCT GCTCCGGGAG
TGGTCGGGCC CCGGCTGGCC CGACACCGAG GCCGAGGAGC ACTACCGCCG CGCCTTCGCC
ATCCCCAAGG TCTCCCACTG CTCCCTGGAG AGCCACCGCT GGATCTTCCG GTCGCGGTGG
CGCACGGACG GACTCCGCTA CGACGCGCGG ATGCGCCGTG CTCCCGTGCG CGTCCCCGTC
CTCCAGCTCC ACGGGACCCT CGACCCGGTG TGCCCGCCCG GACCGGCACG CGCCTCACGG
GGCGTGGTGA CGGGGCCCTA CCGTTGGAGG CAGGTCCACG GCGCGGGACA CTTCCCGCAC
GAGGAACGAC CGGAGGAGGT CTCCCGCGCG CTCGTCGAGT GGCTCGCGGA GGTCTCGGCG
GTGAGGCGGA CGGAGTCCTC GCAGGCGAGG GGGAACGGCG GTGAGGCACT GATGGCGACG
GAGACCGCGG ACGGGCACCG GGAGTCCGGG GGCCGAGGAG GCGGGCGGGA CCGCAACGAG
ACCGGGCAGG CGCAGAACCA GCGCCCCCGG GACCGGTACG GGCGCCCGAT GCCGCACGGG
AGCCGGGGCG AGGTGGAGCG GGTCCCCGAC GACGCCGAGT TCTCCGCGGA GGAAGGTCTG
GAGGAGGCCC AGCGGCTGCT CGACCAGGGG TACGCTTTCA CCGCCCACGA GGTCCTCGAA
GCCGTGTGGA AGTCCGCGCC CGACCCCGAG CGGGAACTGT GGCGCGGCCT CGCCCAGACG
GCCGTGGGGG TCACCCACGC GCAGCGCGGC AACATGGTCG GCGCGGCACG TCTGCTGAGG
CGGGGCGCGG ACCGCGTGGA GCCGTTCGGG CCGGACGCCC CCCACGGGGT GGACGTGGCG
GGGGTGGCGG CGTTCGCCCG CGCCCTGGCC GACGACCTCG ACGCCGGGCG CGCCCGCCCG
GGTGACGGGA TCGACCCTTC GGGGATGCGC CTGCTCGGCG GGTAA
 
Protein sequence
MLDDSAAYVD GPWTHRTVSA AGARFHVAEA GDGPLVLLLH GFPQFWWAWR AQLTALADAG 
YRAVAADLRG YGASDKTPRG YDLVTLAQDA AGLVRALGSR DAAVVGHGLG GLVAWTMTAY
HPGTVRALAA VSSPHPLRAA RVLASGGPGV RHMLRAQLPI LPEHRLLSDG CERVGDLLRE
WSGPGWPDTE AEEHYRRAFA IPKVSHCSLE SHRWIFRSRW RTDGLRYDAR MRRAPVRVPV
LQLHGTLDPV CPPGPARASR GVVTGPYRWR QVHGAGHFPH EERPEEVSRA LVEWLAEVSA
VRRTESSQAR GNGGEALMAT ETADGHRESG GRGGGRDRNE TGQAQNQRPR DRYGRPMPHG
SRGEVERVPD DAEFSAEEGL EEAQRLLDQG YAFTAHEVLE AVWKSAPDPE RELWRGLAQT
AVGVTHAQRG NMVGAARLLR RGADRVEPFG PDAPHGVDVA GVAAFARALA DDLDAGRARP
GDGIDPSGMR LLGG