Gene Ndas_5484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5484 
Symbol 
ID9249387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp678622 
End bp679806 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content73% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003683369 
Protein GI297564396 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.188368 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA TGTTGCCGGG CGGCGGACTC CAGCAGGACA GGTTTCCGAC GGTGCGCAAG 
GGCGGCTACG ACAAGGCGCA CGTCGACGAC TACTTCGTGC GGACGGACAA CCAGGTCAAG
AGTCTGCGCG AGCGCCTCCA GCGGCTGGAC GACGAGCTGG AGCAGTACAA GCGCGACCTC
GCCATCGCCC GCGAGAAGGC GCAGGTCAAG CCGGAGCACG AGCAGATCAG CGAGCGCATG
GCCGAGATCC TGCGCATCGC CGAGGAGGAG GCCCAGGAGC GCCGCTCCAA GGTCGAGTCC
GAGGTGAAGG AGGCCGAGAA GAAGGCCCAG GACGAGATCG CCAAGTACCG CAAGGACGCC
GAGGAGCACG CCGAGCGGAT CCTGTCCTCC GCGCGCTCGG AGGCCCACTC GATGGTCGAC
AGCGCCAAGA AGGAGTCCGA CCAGCTCCGG GAACAGGCCA AGCAGGAGGG CGAGCGCCGC
CTGAACGAGG CCGAGGCGCG CGCGAAGAAG ATCCACGACA CCGCGGACCG CAGGCTCGCC
ACCCTCACCG CCACGCACGC CGAGGCGCTG CGCCGCCTCA AGGACATGCA CTCGACCCTG
GCCGACCTGG TCGCGGCCGA GGACAAGGCG GGCGCGCTGG AGAGCGGGCT CTCCCGCGAC
GAGGTGGCCG CCGCGTCCGC CCCGGCCAAG CCGGCCCCGG CCAAGCCCGC CGCGGCGGCC
AAGGCCCCCG AGCGCGCCGC GGAGCCCTCC CGGCCCCAGC CCCGCCCCGA GGCGGCCGCG
CCCGCTCCGG CCAAGCCGGC CCCGGACGCG CCCGGGGACA AGGACGAGGC CACCACCAAG
CTGCCTCCGC TCCAGCAGCA GCCGGACGAG GCGACCGTGC GCATCAGGCC GGTGGCCAAG
CCGGAGCAGA CCGGCCAGGA CCCGGCGGGG AACTCCTCCC CGAAGGACCA GGCCCAGTCC
GCTCCCCGCC CCCCGCAGGG CGCCCGCTTC ACCGGTCCCG CCCCCGAGGG CGACCAGAAG
CCGCAGCCCG GCCCGCAGCA GCCGCAGTCC GGTCCGCAGC AGGGCCAGAA GGGCCAGGAG
CAGGGCGGCG ATCCGGGCAT CACCGGTATC TACCGCCACC CGGAGTCCGG CCACAACCAG
CCGCCCAAGG GCGAGGACGG CGTCCGGGTC ATCCGCAAGC CCTGA
 
Protein sequence
MSDMLPGGGL QQDRFPTVRK GGYDKAHVDD YFVRTDNQVK SLRERLQRLD DELEQYKRDL 
AIAREKAQVK PEHEQISERM AEILRIAEEE AQERRSKVES EVKEAEKKAQ DEIAKYRKDA
EEHAERILSS ARSEAHSMVD SAKKESDQLR EQAKQEGERR LNEAEARAKK IHDTADRRLA
TLTATHAEAL RRLKDMHSTL ADLVAAEDKA GALESGLSRD EVAAASAPAK PAPAKPAAAA
KAPERAAEPS RPQPRPEAAA PAPAKPAPDA PGDKDEATTK LPPLQQQPDE ATVRIRPVAK
PEQTGQDPAG NSSPKDQAQS APRPPQGARF TGPAPEGDQK PQPGPQQPQS GPQQGQKGQE
QGGDPGITGI YRHPESGHNQ PPKGEDGVRV IRKP