Gene Ndas_2564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2564 
Symbol 
ID9246415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3055754 
End bp3057778 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content71% 
IMG OID 
Productprotein of unknown function DUF839 
Protein accessionYP_003680489 
Protein GI297561515 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.200844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0537939 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGAAT CCGCGCCCCG CCGCCGCCAG CTGCCGTTGA TCGGCCAGGT CGGTGGCGGT 
CGTTCCCGTG CGACATGCAG ACTGCGCTGC GGCGACGCCT GTTTCCACCC GGCTCCCAAC
ACCAGCGACA ACCCCTACTT CGGGGACATC TTCGCCAAGG CGCTCTCCCG CCGCTCCGTC
ATCCAGGCCG GTGCCGTGAG CGCCGGAGCG GGCGCGCTCG GACTCGCCGC GCTCAGCCCG
GCCTCGGCCG ACCGCCGCGG CCCCGACCGC CCCCACCCCT CGCCGCGGCC CACCTTCACC
AGCGTCCAGC CCAACAACGA CGACAAGATC ACCATCCCCC GGGGCTACGA CCAGCACGTC
ATCATCCGCT GGGGCGAGCC GGTCCTGCCC GGCGCTCCCG AGTTCGACGT GTACGACCAG
ACCGCCGAGG CGCAGGCCCA GCAGTTCGGC TACAACTGCG ACTACGTCGG CTTCCACCAG
CTCGACGACG ACCACGCCCT GCTGTGGGTC AACCACGAGT ACACCAACGA GGAACTCATG
TTCCCGGGGT ACGCCGGGGG CGACGCCGCC ACCGAGGAGC AGGTCCGCAT CGCCATGGCC
GCCCACGGCG GTTCCATCGT CGAGCTCCAG CGCGAGGGCC GCACCGGCAA GTGGGTGCTC
TCCACGGGCG AGCGCTCCTT CAACCGCCGC ATCACCGCCG ACACCCCCAT GCGCCTCACC
GGCCCCGCCG CCGGGCACGA CCTGCTCAAG ACGGCGGAGG ACCCCACCGG CACCCTGGTC
AGGGGCATGC TCAACAACTG CGCGGGCGGC ATGACCCCGT GGGGCACCTT CCTCACCGCC
GAGGAGAACT TCAACCAGTA CTTCGCCAAC GGCTCCGGGT CCGCCGAGAC CAGGCGCTAC
GGCGTCGGCA CCGGTGCCAC GGGGCGCCGC TGGGAGCGTT TCGAGGAGCG CTTCGACCTC
TCCAAGCACC CCAACGAGAT CAACCGCTTC GGCTACATCG TCGAGGTCGA CCCGCTCGAC
CCCGGGTCCG AGCCGCTCAA GCGCACCATG CTCGGCCGCT TCAAGCACGA GGGCGCCACC
ACCCGCCTGG CCGACGACGG CCGGGTCGTG GCCTACATGG GCGACGACGA GCGCTTCGAC
TACATGTACA AGTTCGTCAG CGCCAAGAAG TACGTCGAGG GCTCCCGGCG CCACAACCTC
TCCCTGCTGG ACACCGGCAC GCTGTACGTG GCGCGCCTGT CCGGCAACAG CCCCGCCGAG
GAGTTCGACG GCTCCGGCGC GCTGCCCGCC GACGGCGAGT TCGACGGTTC CGGCGAGTGG
GTCGCGCTGT GCACCGACAC CGAGAGCTTC GTCCCCGGCT TCAGCGTCGC CGAGGTGCTC
ATCCACACCC GTCTGGCCGC CGACGCCGTG GGCCCCACCA AGATGGACCG CCCCGAGGAC
TTCGAGCCCA GCCCGGTCAC CGGCAAGGTC TACTGCGCGC TGACCAACAA CTCCGCCCGC
GAGCCCGGCC AGGCCGACGA GCCCAACCCG CGCGGCCCCA ACCGGCACGG CCACGTCCTG
GAGATCGTGG AGTCCGGCAA CGACGCGGCC GCCACCACCT TCGCCTGGAA CGTGCCGCTG
GTGTGCGGCG ACCCCGAGGA CGACGACACC TACTACGCGG GCTTCGACAA GTCCAAGGTC
ATGCCGATCT CCGCGCCGGA CAACCTGACC TTCGACAAGG ACGGCAACCT GTGGATCTCC
ACGGACGGCC AGCCGGGCGC CCTGGGGATC AACGACGGCC TGCACGTCAT GCCGGTCGAG
GGCCGCTTCC GAGGTGAGCT GAAGACCTTC GCCACCGTCC CGGTCGGCGC GGAGGCCTGC
GGCCCCTTCG TCACCGAGGA CAGCAAGACG GTGTTCCTGG CCCCCCAGCA CCCCGGTGAC
GGCGGCAGCT TCGAGGCTCC CACCAGCACC TGGCCCGACG GCGAGTTCCC GCGCCCGTCC
GTGGTGTGCA TCTGGCACAC CGCGGGCCGC GAGGTCGGCA GGTAG
 
Protein sequence
MPESAPRRRQ LPLIGQVGGG RSRATCRLRC GDACFHPAPN TSDNPYFGDI FAKALSRRSV 
IQAGAVSAGA GALGLAALSP ASADRRGPDR PHPSPRPTFT SVQPNNDDKI TIPRGYDQHV
IIRWGEPVLP GAPEFDVYDQ TAEAQAQQFG YNCDYVGFHQ LDDDHALLWV NHEYTNEELM
FPGYAGGDAA TEEQVRIAMA AHGGSIVELQ REGRTGKWVL STGERSFNRR ITADTPMRLT
GPAAGHDLLK TAEDPTGTLV RGMLNNCAGG MTPWGTFLTA EENFNQYFAN GSGSAETRRY
GVGTGATGRR WERFEERFDL SKHPNEINRF GYIVEVDPLD PGSEPLKRTM LGRFKHEGAT
TRLADDGRVV AYMGDDERFD YMYKFVSAKK YVEGSRRHNL SLLDTGTLYV ARLSGNSPAE
EFDGSGALPA DGEFDGSGEW VALCTDTESF VPGFSVAEVL IHTRLAADAV GPTKMDRPED
FEPSPVTGKV YCALTNNSAR EPGQADEPNP RGPNRHGHVL EIVESGNDAA ATTFAWNVPL
VCGDPEDDDT YYAGFDKSKV MPISAPDNLT FDKDGNLWIS TDGQPGALGI NDGLHVMPVE
GRFRGELKTF ATVPVGAEAC GPFVTEDSKT VFLAPQHPGD GGSFEAPTST WPDGEFPRPS
VVCIWHTAGR EVGR