Gene Ndas_0556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0556 
Symbol 
ID9244397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp685777 
End bp687894 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content72% 
IMG OID 
Productprotein of unknown function DUF187 
Protein accessionYP_003678509 
Protein GI297559535 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGG GTGTGACGGA GAGCGGGCCC GGGAGCGCGG CCAGCGCGCC GCGCGAAGCC 
GACTGGCATC TGGGCGCGAC GCGCTGGACG CAGCTGACGC TGACCGAGAA CGACCCCGGA
CGCTTCGACC CCGACTTCTG GATCCAGGTC ATGCGCGAGA CCAGGTCCAA CGCGGCCTGC
ATCAGCGCGG GCGGGTACGT CGCCTTCTAC CCGACCGACG TCCCCTACCA CCACCGCAGC
GTCCACCTCG GCGACACCGA CCCCTTCGGC GCCCTGGTCG AGGGCGCGCG CGGCCTGGAC
ATGCACGTCA TGGCCAGGGT GGACCCCCAC GCCGTCCACG CCGACGCCGC GAGCGCCCAC
CCCGAGTGGC TGGCGCGTAC CGCCGACGGC AGCCCGGTGG AGCACTGGGG CTACCCCGGT
ATCTGGCTGA CCTGTCCGTT CGGCCCGTAC AACCGCGACT TCATCACCGA GGTCGCCCGC
GAGATCGTCA CCCGCTACGA CGTGGACGCC GTCTTCGCCA ACCGCTGGCA GGGCCACGGC
ATCTCCTACA GCGAGGCGGC GCTGCGCAGC TTCCGCGACG AGACGGGGTT CGACCTGCCG
CGCCGGGAGG GGGACACCTC CGACCCCGCC TGGCGCGCCT ACGTGGTGTG GCGCAGGCGC
AAGCTGAGCG ACCTGGTGAG CCTCTGGGAC CAGGCGGTCC GCGACATCCG CCCGCACGCC
CGCTTCATCC CCAACCTGGG CAGCATCGCC GCGCGCGACC TCGACCGCGG CATGCTGGCC
CGGCACTTCC CGTTCTTCCT CATCGACAAG CAGGGCCGTT CCGGGGTGGA GGCGCCCTGG
TCGGCCGGGC GCAACGGCAA GCGCAGCCGG GCGGTCTTCC GGGACAGGCC GGTCGGACTC
ATCACCTCCA TCGGTCCCGA GCACCACCAG CACCGCTGGA AGGACTCGGT ATCGCCCGGG
GCCGAGACCA CCATGTGGAT CGTCGACGGC TTCGCGCAGG GCGCCTTCCC CTGGTTCACC
AAGTTCAACG GCATGGTGCC CGACCGGCGG TGGGTGCGCC CCGTAGCCGA GGCCTTCGCG
CTCCACGAGC GCCTGGAGCC CGAACTCGCG GGCCGCCGCA TCACGGCGGA CGTGGCGCTC
CTGGAGACGG GAGGCCCCGA GAGGGGGCGC TCGCACGAGG ACGGCTTCTA CCACGCCCTG
GTCGAGGCCC GCATCCCCTT CGAGATCGTC GCCGAGCAGA ACCTGTCCGA GAGCGAGTTG
GGCCGGTTCA GGGTGCTCGT CCTGCCCGAC GCCGAGCGGT TGAGCCAGGA CCAGTGCCGC
GCGATCCGGG CCTTCACCGA GTCGGGGGGA AGCGTCGTGG CCGCCCACCG CTCCTCCCTG
GACGACGAGT ACGGCACCCC GCGGGCCAAC TTCGGGCTCG CCGACGTGTT CGGCGTGGAC
CTGAGCATGC CGGTGCGGGG ACCCGTCAAG AACAACTACG TCGCGCTCAC CGGAGAGCAC
CCCACCACGG ACGGATTCGG GGGAGCGGAG CGCGTCATCG GCGGCACCGA ACTCATCGGT
GTGACCGCCC GCCCGGGGAC CGGCGTGCCC CTGCGCTTCG TGCCCGACTA CCCCGACCTG
CCGATGGAGG AGGTCTACCC GCGGGAGGAG CCCGCGAGCC CGGCCCTGGT CACCCGGGAG
CTGCCCGGAG GCGGGCGCTC GGTCTACGTC GCCTTCAACC TCGGCTCCCT GTTCTGGGAG
GCGCTCCAAC CCGACCACGG GACGCTCGTC GCCAACGCCG TCCGCTGGGC CCTCGGGGAA
CCGGAACGGG TCCGGGTCCG CGGACGCGGA CTGGTCGACC TCGCGCTCTG GGAGGACCCG
GATTCGGTCG CCGTGGTGAT CGTCAACCTC ACCAACCCCA TGGCACTCAA GGGGCCCATG
CGCGAGATCA TCCCGCTGCC CGCGCAGAAG GTCTCGGTCG CCCTGCCGGA GGGCGCTGTG
GGCGCCGACG CGCGCCTCCT GGTCTCCGGG TCCGACGTCC CGGTGCGCGT GGGCGCGGGG
CGCGCCGAGG TGACCGTGGA CTCCGCGGAC CTGCTGGAGG CCGTCCGCTT CACGTGGATC
CGGGGTGAGC GGGCATGA
 
Protein sequence
MAQGVTESGP GSAASAPREA DWHLGATRWT QLTLTENDPG RFDPDFWIQV MRETRSNAAC 
ISAGGYVAFY PTDVPYHHRS VHLGDTDPFG ALVEGARGLD MHVMARVDPH AVHADAASAH
PEWLARTADG SPVEHWGYPG IWLTCPFGPY NRDFITEVAR EIVTRYDVDA VFANRWQGHG
ISYSEAALRS FRDETGFDLP RREGDTSDPA WRAYVVWRRR KLSDLVSLWD QAVRDIRPHA
RFIPNLGSIA ARDLDRGMLA RHFPFFLIDK QGRSGVEAPW SAGRNGKRSR AVFRDRPVGL
ITSIGPEHHQ HRWKDSVSPG AETTMWIVDG FAQGAFPWFT KFNGMVPDRR WVRPVAEAFA
LHERLEPELA GRRITADVAL LETGGPERGR SHEDGFYHAL VEARIPFEIV AEQNLSESEL
GRFRVLVLPD AERLSQDQCR AIRAFTESGG SVVAAHRSSL DDEYGTPRAN FGLADVFGVD
LSMPVRGPVK NNYVALTGEH PTTDGFGGAE RVIGGTELIG VTARPGTGVP LRFVPDYPDL
PMEEVYPREE PASPALVTRE LPGGGRSVYV AFNLGSLFWE ALQPDHGTLV ANAVRWALGE
PERVRVRGRG LVDLALWEDP DSVAVVIVNL TNPMALKGPM REIIPLPAQK VSVALPEGAV
GADARLLVSG SDVPVRVGAG RAEVTVDSAD LLEAVRFTWI RGERA