Gene Ndas_4154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4154 
Symbol 
ID9248028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4959131 
End bp4960495 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682055 
Protein GI297563081 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGTTA CGCCAGACGT GGCAACTTCC CCGTCCGCCG GACGTCGGTT TCGTGCCTAC 
ACCACGAAAC ACCTCGACGA GCTCACCACG CGAGCCGGGC TCGCCGCCGA CGAGCGGCTC
GCGGTGCAGG CGGTGGCCAC CGTGCTGCCG TTCCGGGTCA ACAGCTACGT CGTCGACGAG
CTGATCGACT GGGACGCGGC TCCCGACGAT CCGATCTACC GCCTGGTCTT CCCGCAGGCG
GACATGCTGC CCCAGGACGA CGTGTCCCGG ATCGCCGACC TGCTGCGCTC TGGCGCCCAG
CGCAAGGAGC TGAACGAGGC CGCCAACCAG ATCCGCGCAC GCCTGAACCC GCACCCCGCG
GGCCAGATGG ACCTCAACGT GCCCAAGCTG GCCAACGAGG AGCCCATCCC CGGCGTCCAG
CACAAGTACA AGGAGACCGT GCTCTTCTTC CCCAAGCAGG GGCAGACCTG TCACGCGTAC
TGCACGTACT GCTTCCGCTG GGCCCAGTTC GTCGGCGACG CCGACCTGAA GTTCGCCTCC
AGCGAGATCG ACCAGCTCGT CGACTACGTC CGCTCGCACC CCGAGGTCAC CAGCGTCCTG
TTCACCGGCG GCGACCCGAT GATCATGGGC GAGGGGGTCA TCTCCAAGTA CATCGAGCCG
CTGCTGGAGA TCGAGCACCT GGAGGCCATC CGCATCGGCA CGAAGGCGCT GGCCTACTGG
CCGCAGCGCT TCGTCACCGA CCCGGACGCC GACGACACCC TGCGCCTGTT CGAGAAGGTC
GTGGCCTCGG GCAAGAACCT CGCGTTCATG GCCCACTTCT CCCACCCCAA CGAGATGCGG
CCCGAGCTGG CCCAGGAGGC GGTGCGCCGC ATCCGCGCGA CCGGCGCCGT CATCCGCACG
CAGGCGCCGC TGATCCGCAC GATCAACGAC GACTCCGCCG TGTGGGAGAG CATGTGGCGC
ACCCACCTGC GGCACGGCAT GGTCCCGTAC TACATGTTCG TCGAGCGTGA CACGGGTCCG
CAGGACTACT TCGCGGTGCC GCTGGCGGAG GCCTACGAGA TCTTCCGCGG CGCCTACAAG
AGCGTCTCGG GACTGGCCCG CACGGTGCGC GGCCCGTCGA TGTCGGCGAC CCCGGGCAAG
GTCTGCGTGG ACGGCGTCAC CGAGGTGGCG GGCCAGAAGG TCTTCGTCCT GCACTTCATC
CAGGCGCGCG ACCCCGAACT GGTCGGCAGG CCCTTCTTCG CCGAGTACGA CGAGAAGGCC
GCGTGGCTGT TCGACCTCAA GCCCGCCCTG GGCGCGACCC ACCTGCCGTG GGAGCAGTCC
CCGGTCGGCG CTCCCGGCGG CCTGGTCGAC CCCACCCGCC TGTAG
 
Protein sequence
MSVTPDVATS PSAGRRFRAY TTKHLDELTT RAGLAADERL AVQAVATVLP FRVNSYVVDE 
LIDWDAAPDD PIYRLVFPQA DMLPQDDVSR IADLLRSGAQ RKELNEAANQ IRARLNPHPA
GQMDLNVPKL ANEEPIPGVQ HKYKETVLFF PKQGQTCHAY CTYCFRWAQF VGDADLKFAS
SEIDQLVDYV RSHPEVTSVL FTGGDPMIMG EGVISKYIEP LLEIEHLEAI RIGTKALAYW
PQRFVTDPDA DDTLRLFEKV VASGKNLAFM AHFSHPNEMR PELAQEAVRR IRATGAVIRT
QAPLIRTIND DSAVWESMWR THLRHGMVPY YMFVERDTGP QDYFAVPLAE AYEIFRGAYK
SVSGLARTVR GPSMSATPGK VCVDGVTEVA GQKVFVLHFI QARDPELVGR PFFAEYDEKA
AWLFDLKPAL GATHLPWEQS PVGAPGGLVD PTRL