Gene Ndas_0817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0817 
Symbol 
ID9244662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1007907 
End bp1009553 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content76% 
IMG OID 
Productprotein of unknown function DUF885 
Protein accessionYP_003678767 
Protein GI297559793 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0452595 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGGC GGTTCCGCGA GGTGGCCGAG CGCGTGCTCG ACTCCCTGCT CCACGACGCC 
CCCGAGTGGG CGCTGGACCT GGGCGACACC CGTGGCGCCT CCCGTCTGTC CGACCACTCG
GCCGAGGCCG ACGTGCGCCG CGTCTCCGTG CTCACCGACG CCCTCGGATC CCTGGACGAG
ATCGACCCCG ACCTCATCCC GGCCGGCGAC CGGGTCGACC TGGAGGTGCT GCGGACCCGC
GTCAGCGCCG ACCTGTGGCA CACCGCGGAA CTGCGCCCGC ACACGTGGGA TCCGCTGCTG
TACTCGCCGG GCGAGGCCCT GCACGCCCTC GTGGAGCGTG AGGTCCTGCC CCTCCCCGAA
CGCCTGTCGG CGCTCGCCGC GCGCTGCGCG GCCCTGCCCG TCCACCTCGC CACCGCGCGC
TCGCGCCTGT CGGAGGGCCC CGGCATGCCC CGCGTGCACG TGGAGACGGC CCTGGCCCAG
GCGGCCGGGG CCCGCGCCAT GCTCACCTCC GACGTGCCCG CCCCGGCCGA GGGAGCGCCC
TCCGCCCTGG AACCGGCCCG CGAGGCCGCC CTGGCCGCCG TGGAGGAGCA CGCCGCCTGG
CTGCGGGACC GTCTGGAGAC CGCCACCGCC GACCCCCGTC TGGGCGAGCG CGACTTCGCC
GCCCAGCTCT GGTACACCCT CGACTCCGAG CTCTCGCCCG AGGCGCTGCT GGTGCGCGCC
GAGAGCGACC TGCTGGCCAC GGAGGAGGCG ATCGCCGAGA CGGCGGCCGA GTACCTGGGC
GGGGCGCGCC GCCGGGAGGG GGTGGCCGAG GCGCTCGCCG AGCTGGCCGC ACGGGGCGCC
ACCGACGCCG ACACCGTCCG CCCCGCCTGC GCCGACGCCC TCCTGCACCT GAACGAGCGG
GTGCGCGCGC TGGACATCGT GACGGTCCAC GACGACCCGG TCCGGATCGT GCCGATGCCC
GAGGCCCGCC GCGGGGTGTC GGTGGCCTAC TGCGAGCCCC CCGGCCCCCT CGACCCGCGG
TCCGGGGAGC AGCCGACCCT GGTAGCGGTG GCCCCGCCGC CGGAGGACTG GCCCGCCGAG
CGCAGGGAGT CCTTCTTCCG CGAGTACAAC GCGGTCATGC TGCGCGACCT CATGGCCCAC
GAGGCCGTTC CCGGGCACGC TCTCCAGCTC GCCCACGCCG CCCGGCACGA GGGCGGCACC
CGGGTGGGCC GGGCCCTGTG GAGCGGCACC TTCGTGGAGG GCTGGGCGGT CTACGCCGAG
GAGGTGCTGG CCCGCCACGG CTGGTCCGGC GACCGGCGCG AGGACCTGGC GCTGCGCCTG
GTGCAGCTCA AGATGCGCCT GCGGATGATC ATCAACGCGA TCCTGGACGT GCGCCTGCAC
ACCGGCGACC TCACCGAGGC CGAGGCGATC TCCCTGATGA CCCGGCGCGG ACACCAGGAG
GAGGGCGAGG CCGTCGGCAA GTGGCGCCGC GCCCAGCTCA CCAGCGCCCA GCTGTCCACC
TACTACGTGG GCTACGCGGA GGTCTCCGAC ATCGCCAGCG ACCTGGCCCT GGCCCGGCCC
GCGCTCACCG AGCGCGAGCG CCACGACGCG ATGCTCGCCC ACGGCAGCCC GCCCCCGCGC
CACCTGCGCA CCCTGCTGGG GCTGTAG
 
Protein sequence
MTRRFREVAE RVLDSLLHDA PEWALDLGDT RGASRLSDHS AEADVRRVSV LTDALGSLDE 
IDPDLIPAGD RVDLEVLRTR VSADLWHTAE LRPHTWDPLL YSPGEALHAL VEREVLPLPE
RLSALAARCA ALPVHLATAR SRLSEGPGMP RVHVETALAQ AAGARAMLTS DVPAPAEGAP
SALEPAREAA LAAVEEHAAW LRDRLETATA DPRLGERDFA AQLWYTLDSE LSPEALLVRA
ESDLLATEEA IAETAAEYLG GARRREGVAE ALAELAARGA TDADTVRPAC ADALLHLNER
VRALDIVTVH DDPVRIVPMP EARRGVSVAY CEPPGPLDPR SGEQPTLVAV APPPEDWPAE
RRESFFREYN AVMLRDLMAH EAVPGHALQL AHAARHEGGT RVGRALWSGT FVEGWAVYAE
EVLARHGWSG DRREDLALRL VQLKMRLRMI INAILDVRLH TGDLTEAEAI SLMTRRGHQE
EGEAVGKWRR AQLTSAQLST YYVGYAEVSD IASDLALARP ALTERERHDA MLAHGSPPPR
HLRTLLGL