Gene Ndas_3800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3800 
Symbol 
ID9247671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4563412 
End bp4564899 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681704 
Protein GI297562730 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0830819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0502051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGACC TGCCATTCGG TTTCAGCATG CCGAACGATC CCGATGACGA GTCCGGACGC 
CGTTCTGGCG ACTCCGGCTC GGGTGCCGGC AGCGGCGGCC CGGGCGGAGG AGGATCCGGA
ACGCCGGACG GCTTCCCGTT CGGCGACCCG CAGCAGATGG CCAACATGCT GCGTCAGTTC
GCCGACATGA TGTCGGCCCA ACCCGGCCCC GCTCCCGGTT CCAGCGGCGA CCAGAGGTCG
TCGTCCGGTG TGAACTGGGA CATGGCCAAG GACGTCGCGC GCCAGACGGT CGCCCAGGAG
GGCGACCCGA GCGTTCCGGC GTCCGACTAC GCCAGGGTCG AGGAGGCCCT GCGCCTGGCC
GACCTGTGGC TCGACCAGGC TACCGACCTG CCGTCGGGCG TCCACACCGC CCAGGCCTGG
AGCCGGTCGG AGTGGATCGA GCGGACGATG GACTCGTGGG CGCAGCTGTG CGACCCGCTG
ACCAGCAAGA CCGTCCGGTC GATGGGGCAG AACCTGCCCG AGGAGATGCG GTCCGTGGCC
GGTCCGCTGC TGGGCATGAT CCAGCAGATG GGCAGCATGA TGGTCGGCCG CCAGGCGGGC
CAGGCCGTGG GCGAGCTGTC CCGCGAGGTG GTCGGCACCG CCGACATCGG ACTGCCCCTG
GCCGGTGAGG GCCGTGCGGC GCTGCTGCCC TCCGGCGTGG CGCGCTTCAG CGAGGGCCTG
GAGGTCCCCG AGGACGAGGT GCGCCTGTAC CTGGCCGCCC GCGAGGCGGC CGTGCACCGG
CTGTACTCGC ACGTGCCGTG GCTGCGCTCG CACGTGTCCC GGCTGGTCAA CGAGTACGCG
GACGGGATGT CCTTCGACAT CAGCGGCCTG GAGGACCGGC TCGGCGAGAT CGACCTCACC
AACCCCGAGG CGCTCCAGGA GGCCCTGGGC GGTGTCGGGG GCGAGGGGCT GTTCCAGCCC
GAGGACACCC CGCAGCAGAA GGCGGCGCTG GCCCGGTTGG AGACCACCCT GGCGCTGATC
GAGGGCTGGG TGGCCACCGT GGTCTCGTCC GCGGTGTCGG GCCGTCTGCC GCAGGCGGAC
GCCCTGGCCG AGGCGACCCG GCGCCGCCGG GCGACCGGCG GGCCCGCCGA GCACACGTTC
GCCGCCCTGG TCGGCCTGGA GCTGCGCCCG CGCCGCCTGC GCGAGGCGTC CGCGCTGTGG
TCCGCCCTGG AGGAGGCGCG CGGTGTGGAG GGCCGGGACG CGGTCTGGGA GCACCCGGAC
CTGATGCCCA CCGGCGACGA CCTGGACGAT CCCGAGGCCT TCGTGCGCGG GGGCGGTGAC
GGGTTCGGCG ACGCCGACTT CGACATCTCC TCGCTGACCG GGGACGCTCC CGGCCGGGAG
AGGGCGGGGA CCGACGGGGA CGCGTCCGGG GAGGGGCCTT CCGACGAGGG CGCCCCCGGC
GGTGACCGGG ACGGCGACGG GGACGACGAC CGGAGAGAGG GCGCGTAG
 
Protein sequence
MSDLPFGFSM PNDPDDESGR RSGDSGSGAG SGGPGGGGSG TPDGFPFGDP QQMANMLRQF 
ADMMSAQPGP APGSSGDQRS SSGVNWDMAK DVARQTVAQE GDPSVPASDY ARVEEALRLA
DLWLDQATDL PSGVHTAQAW SRSEWIERTM DSWAQLCDPL TSKTVRSMGQ NLPEEMRSVA
GPLLGMIQQM GSMMVGRQAG QAVGELSREV VGTADIGLPL AGEGRAALLP SGVARFSEGL
EVPEDEVRLY LAAREAAVHR LYSHVPWLRS HVSRLVNEYA DGMSFDISGL EDRLGEIDLT
NPEALQEALG GVGGEGLFQP EDTPQQKAAL ARLETTLALI EGWVATVVSS AVSGRLPQAD
ALAEATRRRR ATGGPAEHTF AALVGLELRP RRLREASALW SALEEARGVE GRDAVWEHPD
LMPTGDDLDD PEAFVRGGGD GFGDADFDIS SLTGDAPGRE RAGTDGDASG EGPSDEGAPG
GDRDGDGDDD RREGA