Gene Ndas_1488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1488 
Symbol 
ID9245338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1822385 
End bp1823902 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content70% 
IMG OID 
Productprotein of unknown function DUF245 domain protein 
Protein accessionYP_003679424 
Protein GI297560450 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.283032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.138828 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTGC GGCGGATTAT GGGTATCGAG ACCGAGTACG GGATCTCCGT ACCAGGCAAC 
CCCGGGGCCA ACGCGATGGT CACCTCCACC CAGGTGGTCA ACGCCTATCT GGCCGCCTCG
GCGGCCCGGG CGCGGCGCGC CCGCTGGGAC TTCGAGGAGG AGAACCCCCT GCGCGACGCG
CGCGGCTTCG ACCTGGCGCG TGAGGTCGCC GATCCCACGC AGCTCACCGA CGAGGACCTG
GGCCTGGCCA ACGTCATCCT CACCAACGGC GCCCGCCTGT ACGTGGACCA CGCCCACCCC
GAGTACTCGG CGCCCGAGGT CACCAACCCG CGCGACGCGG TGCTGTGGGA CAAGGCGGGG
GAGCGGGTGA TGGCCGACGC CGCCGCGCGG GCGGGCGCCA CCCCGGGCAT CGAGCCGATC
CAGCTGTACA AGAACAACAC CGACAACAAG GGCGCCTCCT ACGGCTGCCA CGAGAACTAC
CTGATGCACC GGTCGACGCC GTTCGGCGAC ATCGTCCGGC ACCTGATCCC GTTCTTCGTC
TCCCGCCAGG TGGTGTGCGG GGCGGGCAAG GTCGGCATCG GCTCCGACGG GCAGGGCAGC
GGGTTCCAGA TCTCCCAGCG CGCCGACTTC TTCGAGGTCG AGGTGGGGCT GGAGACCACC
CTCAAGCGCC CCATCATCAA CACCAGGGAC GAACCGCACG CCGACCCCGA CCAGTACCGG
CGGCTGCACG TCATCATCGG CGACGCCAAC ATGAGCGAGA TCTCCACCTA CCTCAAGCTG
GGGACGACGG CGCTGGTGCT GTCGATGATC GAGGACGGGT TCCTGAGCGC GGACCTGTCC
CTGGAGACCC CGGTGGCCGA CCTGCGGGAG GTCTCGCACG ACCCCGGCCT CACCCACCGG
GTGCGGCTAC GCGACGGGCG GCGGATGACC GCGCTGGAAC TCCAGTGCGA GTACCTGGAC
CAGGCCCGCA AGTACGTGGA GGACCGCTTC GGCACCGACG TGGACCCCGA CACCGCCGAC
GTCCTGGACC GCTGGGAGTC GGTGCTCAAC CGGCTGGGCG CCGACCCGAT GAGCCTGGCC
GACGAGCTGG ACTGGGTGGC CAAACTCAAG GTGCTGGAGG GCTACCGCTC CCGTGACTCC
CTGGAGTGGT CGCACCCCCG CCTGCAACTG GTGGACCTGC AGTACTCCGA CGTGCGGGCG
GACAAGGGCA TCTACAACCG GCTGGTGGCG CGCGGGCGGA TGAAGCGCCT GCTGGAGGAG
GCCCAGGTGG ACCGGGCGGT CACCGAGCCG CCGGAGGACA CCCGCGCCTA CTTCCGCGGG
CGCTGCCTGG CCAAGTACGC CGACGAGGTG GCCGCCGCCT CCTGGGACTC GGTGATCTTC
GACCTGCCGG GCTACGACTC GCTCCAGCGG GTGCCGACCC TGGAGCCGCG CCGGGGCACC
AAGGAACACG TGGGGAAGCT GCTGGACGCC TCCCCGACCG CGGCCGACCT GGTGGCGGTG
CTCACCGGAG GCCGCTGA
 
Protein sequence
MSVRRIMGIE TEYGISVPGN PGANAMVTST QVVNAYLAAS AARARRARWD FEEENPLRDA 
RGFDLAREVA DPTQLTDEDL GLANVILTNG ARLYVDHAHP EYSAPEVTNP RDAVLWDKAG
ERVMADAAAR AGATPGIEPI QLYKNNTDNK GASYGCHENY LMHRSTPFGD IVRHLIPFFV
SRQVVCGAGK VGIGSDGQGS GFQISQRADF FEVEVGLETT LKRPIINTRD EPHADPDQYR
RLHVIIGDAN MSEISTYLKL GTTALVLSMI EDGFLSADLS LETPVADLRE VSHDPGLTHR
VRLRDGRRMT ALELQCEYLD QARKYVEDRF GTDVDPDTAD VLDRWESVLN RLGADPMSLA
DELDWVAKLK VLEGYRSRDS LEWSHPRLQL VDLQYSDVRA DKGIYNRLVA RGRMKRLLEE
AQVDRAVTEP PEDTRAYFRG RCLAKYADEV AAASWDSVIF DLPGYDSLQR VPTLEPRRGT
KEHVGKLLDA SPTAADLVAV LTGGR