Gene Ndas_3395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3395 
Symbol 
ID9247260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4059510 
End bp4060886 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content70% 
IMG OID 
Productsecreted protein 
Protein accessionYP_003681306 
Protein GI297562332 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.201979 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.899466 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGTCGG AGACGCGAAG GCCCGCGAAC GGACTGCGGT GGGGGACGCT CGGAACGGCA 
CCCCTCCTGG TCACCGCCCT CCTGGCCGTT CCCGGGGCCA GGGCCGACGG GGTGAGCACC
GGCGGGGGCG TCGTGCACGT GGCCAACCTG CCCAGGCCGG AGGGGCTGGA CGGCACCGCG
TCCTACAACT CCGACCTGGC GTTCAGCGGC GACTACGCGA TCGGCGGCAA CTACGACGGC
TTCGTCGTCT ACGACATCTC CGATCCGGAG AACCCGAGCC GGGTCTCCAC GGTCGTGTGC
CCCGGGAGTC AGGGCGACGT GTCGGTCAGC GGCGACCTGC TCTACCTCTC GGTGGACTAC
CCGCGCGCCA GCAGCGAGTG CGGCGCGCCC TCGGTGTCGG CCACCGACCC GGACGGCTTC
GAGGGGATCC GGATCTTCGA CATCTCCGAC AAGGCCAACC CGCAGTACGT GTCGGCGGTG
CGCACCGACT GCGGTTCGCA CACCAACACG CTGGTGCCGG GCGAGGACGG TGAGCGCGAC
TACGTGTACG TCTCCTCGTA CTCGCCCTCG AAGGACTTTC CCAACTGCCA GCCGCCGCAC
GACAGGATCT CCGTCGTCGA GGTTCCGCTG GCGGACCCGG CCTCGGCCCG GGTGGTCAGC
GAACCGGTGC TGCTGCCGGA CGGCGGCCAC GGGACCACCA CCGGCTGCCA CGACATCACC
GCCTACCCCG AGCGGAACCT GGCCGCGGCG GCCTGCCTGG GGGACGGACT GCTGCTCGAC
ATCTCCGACC CGGTGAACCC GGTCGTCACC GAGGCGGTCC AGGACGAGAA CTTCGCGTTC
TGGCACTCGG CGACCTTCAC CAACGACGCC CGCACCGTGG TCTTCACCGA CGAGCTCGGC
GGCGGGCACG CGGCCACCTG CGACGCCGGG ACGGGGCCCG AACGGGGAGC CAACGCGGTC
TACTCTCTGG ACCAGAGCGG GGCGGAACCG AAGCTCGAAT TTCGGAGCTA CTACAAACTG
CCTCGTCATC AGGCCGAAAC TGAGAACTGT GTGGCGCACA ACGGCTCGCT GATCCCGGTG
CCCGGCCAGG ACTACTTCGT GCAGTCCTGG TACCAGGGCG GCGTCTCGGT GATCGACCTC
AACGACCCGG CCGACCCCCG GGAGATCGGC CACTTCGACC GCGGACCCTG GAACCCCGAC
GCCCTGACGA CGGCGGGCTC GTGGTCGGCC TACTACTACA ACGGCTACGT CTACTCCTCC
GACATCAGGC GCGGGCTCGA CGTGCTCCGG CTGACCGACT CCCGCCTGGC GGGGGCCGAG
GAGGTGCGGA TGGAGGAGTT CAACCCGCAG TCGCAGCCGT CCACGCCCTC CGGCTGA
 
Protein sequence
MVSETRRPAN GLRWGTLGTA PLLVTALLAV PGARADGVST GGGVVHVANL PRPEGLDGTA 
SYNSDLAFSG DYAIGGNYDG FVVYDISDPE NPSRVSTVVC PGSQGDVSVS GDLLYLSVDY
PRASSECGAP SVSATDPDGF EGIRIFDISD KANPQYVSAV RTDCGSHTNT LVPGEDGERD
YVYVSSYSPS KDFPNCQPPH DRISVVEVPL ADPASARVVS EPVLLPDGGH GTTTGCHDIT
AYPERNLAAA ACLGDGLLLD ISDPVNPVVT EAVQDENFAF WHSATFTNDA RTVVFTDELG
GGHAATCDAG TGPERGANAV YSLDQSGAEP KLEFRSYYKL PRHQAETENC VAHNGSLIPV
PGQDYFVQSW YQGGVSVIDL NDPADPREIG HFDRGPWNPD ALTTAGSWSA YYYNGYVYSS
DIRRGLDVLR LTDSRLAGAE EVRMEEFNPQ SQPSTPSG