Gene Ndas_0470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0470 
Symbol 
ID9244309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp566641 
End bp568074 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003678423 
Protein GI297559449 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.896601 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCGA CCACCGACCC ACACGACGGC GCGGAGAACG TGGACTGGGC GACCCTGGCC 
CCCGAGGGCG GCGCTCGCAT CCCCGAGCTC CTCGGCCAGC TCGCCGGGAG CGACAGCGCG
CTCTCCGAAC TCCACGACAT CATCCACTTC CCCGCGCCCG GACACCTCGC CGCACCCAAG
GCGGTCGACT TCCTGGTCGA CCTCGCCTGC GCCGAGTCCA CCCCGGCCAC CGACCGCTGG
CGTCCCCTCA GCCTGCTCCT GGAGCTGGTG GCCGGGCACG CCGAGGACCG CTTCCCCGAG
ACCCGCGACC TGGACCAGTG GCGCGACGAG GTGGCCTGGG CGGCCTCCAA CGACGCCGAG
AAGGTCCGCC AGCAGTACCG CGACTGGGCC GAGGAGGCCC CGGACGAGCA GCACTACCAC
CGCATGCGCA ACCGGCTCGC CGTGATGGAG CAGCCCGACG GCGCCGCGAT CCTCCAGGCC
GAGCTGGACA CCTACGAGGC CGTCCGCGCC CGCGTCCCCG ACCTGGTAGC ACTGCTCCAC
GGCGGCGCCA ACCGCGGCGG CATGGACCGC GCGGGGGAGT GGGTGAGCTA CCTGCTGGCC
TTCCTGCCCG ACGACGCCGA GCGCATCACC ACCGAGATCA CCGGAGCCTC CCAGGTCCTG
CTCGCCAAGG ACCTGCGCCC CGCCCCGCAG GCGCCCACGA GCCTGCGCGA CGTGATGAAC
GCGCTGGACA GCGGCGGCGA CCCGCTCCCC GCGGAACTGT TCGCCCTCGG CCTGCTGGCC
AGCCCGAACG ACATCGACGT CTCGGTCGGC CTCACCCACC AGATGGCGGG CGGGAACCTG
TACAACTCCT TCGCCGCCTC GGTGGCGATG CTGCTCATCC ACGGCGAGAA GACCCCGCGC
GAGGCGCTGC GCCGCGTCGG CCGCGGCGGC GGCACCTCCA TGGGCTACCA GGGCCTGTTC
AACGAGTCCT GGCCGCACTG CGGCGGGCAC TCCCCGCAGG TCCTGGGCTT CCTGGCCCTG
GGCCGCGCGG GCGACCGCGC CCGCCGTCTG CGCCTGGACA TCCTGCCCGG CCTGATCAAT
GGCGAGGAGG ACTCGCGCGC CCTCGTGACG GGTGTCGGCC TCGAACTGGT GCTCGGCCCC
CGCAGCAAGG GCCACACCGC CGAGGAGCAC GCCGAGGCCG ACTACGACGA GGAGACCCTC
AAGGTCCTCT GGACGATCGC CGAACTGCCC GCCTCCGCCT GGGAGGACGA GGAGTTCACG
CGGACCCTGA GCGCCTGGGC GCTGCCGGAC AACGCCGAGG ACTTCTGCGC GCTCGTGGGT
GTGGAGTCCC AGCCCGAACC GGAGCCGGAG CGGGCCGCGG TCCCGCAGCC GCAGCAGCCC
GCCGCCCCCC AGCCCGGCGG GCTGCTGGGG CGCCTCTTCG GCGGAGGACG GTAG
 
Protein sequence
MNATTDPHDG AENVDWATLA PEGGARIPEL LGQLAGSDSA LSELHDIIHF PAPGHLAAPK 
AVDFLVDLAC AESTPATDRW RPLSLLLELV AGHAEDRFPE TRDLDQWRDE VAWAASNDAE
KVRQQYRDWA EEAPDEQHYH RMRNRLAVME QPDGAAILQA ELDTYEAVRA RVPDLVALLH
GGANRGGMDR AGEWVSYLLA FLPDDAERIT TEITGASQVL LAKDLRPAPQ APTSLRDVMN
ALDSGGDPLP AELFALGLLA SPNDIDVSVG LTHQMAGGNL YNSFAASVAM LLIHGEKTPR
EALRRVGRGG GTSMGYQGLF NESWPHCGGH SPQVLGFLAL GRAGDRARRL RLDILPGLIN
GEEDSRALVT GVGLELVLGP RSKGHTAEEH AEADYDEETL KVLWTIAELP ASAWEDEEFT
RTLSAWALPD NAEDFCALVG VESQPEPEPE RAAVPQPQQP AAPQPGGLLG RLFGGGR