Gene Ndas_3537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3537 
Symbol 
ID9247406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4247657 
End bp4248712 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content77% 
IMG OID 
ProductEndonuclease/exonuclease/phosphatase 
Protein accessionYP_003681444 
Protein GI297562470 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.291858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.737818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGCG TACAGGCACC GGGAAGCGGC ACGGGACCGG CGGGCGGTGG GGCCTCCGGG 
GCCTCCGGGG CCTCCGGCCG CTCCCCCGCG GTGACGGTCC TGGTCGCGCT GGCCGCGGCG
GGGCTCGCGG TGTTCTCCCT GCTGCGGGTC CTGGGCCTGG AACGGGGCTG GCCCCTGGTG
CCCGCGCTGG CCTTCACGCC CTACGCGCTG GCCGCCGCGC CGCTGGCGGC GGTGGTGGCC
GGGCTGCTGC GGCGCTGGGG GTCCATGGCG GTCCTGACCG CGGTGACGTT GGCGCTGGCC
GCCGTGGTCG TGCCCCGCGC TGTGCCGTTC GGCGTGACCT CGGCGGGCGG TCCGGTCGTG
CGGATCATGA CCCTGAACAC GCTCGGCGGC GGCGCCGACG CGGCGCGGGT GGTGTCGCTG
GTGCGCGACC GGGAGGTGGA CGTGCTCACC CTCCAGGAGG TCACCCCCGA CCTGGTGCGC
GAGCTGTCCG CCGCCGGGCT GGACGACCTG CTGCCGCACG CCGTCGACCG CTCCGACCCC
CGGGGCGTGC ACGGGAGCAC CGTCCACTCG GCCCTCCCGC TCACCGACAC CGGCGGCGGG
GAGGCGGGCC ACGACACGTT CGCGATGCCG ACCGCCGTCG TGCGGCTGCC CGAGGACGGC
GACGGGGACG GGGAGGACGA CGTGCTGGAG GTGACGTCGG TGCACGTCCC GCCTCCGCTG
TCCCCCGCCT ACACCGCCTC CTGGCGCGGG GAGCTGGAGG GCTTGGCCGA GGTGGGCGAC
CCGGACACGA TGCGGGTCCT GGCCGGGGAC TTCAACGCCA CGCTCGACCA CGCGGCGCTG
CGCGAGGTCC TGGAGGCGGG GTACCTGAGC TCGGCCGCGG TCCTGGGTGA GGGGTTGGAG
CCCACGTGGC CGGTGGGGCG AGCGGTGCCG GGCCTGGCCA TCGACCACGT GCTGGTGCAC
CCCCGGATGG GCGTCGACGG CCTGGAGGTC GTGGAGGTGA CCGGCACTGA CCACAGGGCG
GTGCTGGTCG AGGTGTCCCT GCCCTCGCCC CGGTAG
 
Protein sequence
MERVQAPGSG TGPAGGGASG ASGASGRSPA VTVLVALAAA GLAVFSLLRV LGLERGWPLV 
PALAFTPYAL AAAPLAAVVA GLLRRWGSMA VLTAVTLALA AVVVPRAVPF GVTSAGGPVV
RIMTLNTLGG GADAARVVSL VRDREVDVLT LQEVTPDLVR ELSAAGLDDL LPHAVDRSDP
RGVHGSTVHS ALPLTDTGGG EAGHDTFAMP TAVVRLPEDG DGDGEDDVLE VTSVHVPPPL
SPAYTASWRG ELEGLAEVGD PDTMRVLAGD FNATLDHAAL REVLEAGYLS SAAVLGEGLE
PTWPVGRAVP GLAIDHVLVH PRMGVDGLEV VEVTGTDHRA VLVEVSLPSP R